Tag: Machine Learning

Classifying text using LLMs

  Text classification is one of the most common use cases in Natural Language Processing, with numerous practical applications – now easier to access with Large Language Models. Companies use text classification in multiple scenarios to become more efficient: Tagging large volumes of data: reducing manual labor with better filtering, automatically organizing large volumes of […]

Written by on Aug 29, 2023

Making Money with Mediocre AI Models: A Guide for Business Stakeholders

In the world of AI, it’s easy to assume that only the most accurate models can bring value to your business. However, this is far from the truth. In fact, even mediocre models can be transformed into money-making machines with the right strategies. In this article, we’ll explore three real-life examples of how we turned […]

Written by on Aug 15, 2023

Increasing Efficiency with Active Learning

The problem: Labeling data is boring (and expensive) So there you are. You have collected your data, analyzed it, processed it, and built your sophisticated model architecture. After many hours of training and evaluating, you have come to a very unpleasant conclusion: you need more data. Before you readjust your budget to fit the extra […]

Written by on Mar 3, 2023

NILG.AI among the winning startups of HODCON Challenge 2022

We are proud to announce that NILG.AI was among the winning startups of the open call for the Hands on Data 2022 Conference.  Hands on Data is an open innovation initiative that creates matchmaking opportunities between major corporations in the Ruhr area and global startups. From a pool of over 100 applications of start-ups from […]

Written by on Feb 13, 2023

How to deal with the annoying implications of changing data sources

Let’s discuss a common scenario in AI consulting. The client provides access to data sources in formats such as CSVs or databases that aren’t in a production environment. Why? Usually, they’re exploring the value of the project, do not want to disclose too much data and want to prevent technical problems from happening at the […]

Written by on Nov 20, 2022

Stop removing outliers just because!

Outliers are data points that stand out for being different from the remaining data distribution. An outlier can be: An odd value in a feature A data point distant from the centroid of the data A data point in a region of low density, but between areas of high density. Suppose you have been working […]

Written by on Nov 14, 2022

Duplicate detection in text data

A common use case seen across several industries is the creation of systems capable of detecting the similarity between pairs of objects – images and texts. For example, duplicate detection in marketplaces, or recommendation systems that show similar objects to the ones the user has searched for, can use such systems. They can also be […]

Written by on Oct 25, 2022

Turning classes into inputs

Let’s face it, we all have worked on an ML project where we had to predict a ridiculously high number of classes. Large enough to make the number of observations per class into an embarrassingly small subset. Most people model these tasks as a multiclass classification problem where, for each input observation, we must predict […]

Written by on Sep 22, 2022

Privacy Preserving Machine Learning

This article reports my work at NILG.AI during a curricular internship on privacy-preserving Machine Learning. Trip data is any type of data that connects the origin and destination of a person’s travel and is generated in countless ways as we move about our day and interact with systems connected to the internet. But why is […]

Written by on Aug 16, 2022

New internships in AI: Privacy-preserving ML and Similarity Learning

We are proud to share that NILG.AI has partnered with the Faculty of Engineering of the University of Porto (FEUP) through internships in AI as part of curricular units! We have received two new interns, Margarida Vieira and Beatriz Lopes; both enrolled in the Bachelor in Informatics and Computing, tackling two very distinct challenges with […]

Written by on Apr 21, 2022