Farid Hamidtheanonymousfrog.hashnode.dev·Sep 2, 2024Can Data Science unlock new insights into CGM driven Diabetes management?For those interested in the code behind this analysis, check out my GitHub repository here (with link to analysis in Google Colab Python Notebook). Context Having lived with the chronic illness of insulin-dependent (Type 1) Diabetes for over 25 years...29 readsBERTopic
AASHIFaashif.hashnode.dev·Jun 24, 2024UDA - Unstructured Document AnalysisUDA: A Benchmark Suite for Retrieval-Augmented Generation in Real-world Document Analysis Introduction In recent years, the use of Retrieval-Augmented Generation (RAG) has significantly enhanced the capabilities of Large Language Models (LLMs), enabl...Information Retrieval
Vanshika Kumarvanshikakumar.hashnode.dev·Apr 10, 2024How to Text mine in R using NLP techniquesIntroduction: Text mining, also known as intelligent text analysis, text data mining or knowledge discovery in text(KDT) is the process of extracting significant patterns and meaning from unstructured text data. Natural language processing (NLP), dat...1 likeR Language
freeCodeCampforfreeCodeCampfreecodecamp.org·Jan 5, 2020A Deep Dive into Word Embeddings for Sentiment AnalysisBy Bert Carremans When applying one-hot encoding to words, we end up with sparse (containing many zeros) vectors of high dimensionality. On large data sets, this could cause performance issues. Additionally, one-hot encoding does not take into accou...keras
freeCodeCampforfreeCodeCampfreecodecamp.org·Jun 13, 2019Sentiment Analysis with Text MiningBy Bert Carremans In this tutorial, I will explore some text mining techniques for sentiment analysis. We'll look at how to prepare textual data. After that we will try two different classifiers to infer the tweets' sentiment. We will tune the hyperp...Data Science
freeCodeCampforfreeCodeCampfreecodecamp.org·Mar 7, 2019How to extract keywords from text with TF-IDF and Python’s Scikit-LearnBy Kavita Ganesan Back in 2006, when I had to use TF-IDF for keyword extraction in Java, I ended up writing all of the code from scratch. Neither Data Science nor GitHub were a thing back then and libraries were just limited. The world is much differ...Data Science
freeCodeCampforfreeCodeCampfreecodecamp.org·Oct 23, 2018An easy way to make word clouds for data scientistsBy Kavita Ganesan About a year ago, I looked high and low for a Python word cloud library that I could use from within my Jupyter notebook. I needed it to be flexible enough to use counts or tfidf when needed or just accept a set of words and corresp...Data Science
freeCodeCampforfreeCodeCampfreecodecamp.org·Jul 26, 2018Quick tips for constructing custom stop word listsBy Kavita Ganesan In natural language processing (NLP) and text mining applications, stop words are used to eliminate unimportant words, allowing applications to focus on the important words instead. Stop words are a set of commonly used words in any...Data Science