Profoundly Mundane

Welcome to Profoundly Mundane! This blog discusses engineering in the context of Natural Language Processing (NLP). While the amount of resources and information available has greatly increased within the last few years, many ‘simple’ subjects remain under-discussed. This makes it hard to understand the trade-offs when selecting NLP methods, unlike more established engineering fields. The goal of this blog is to explore the hidden depth within the seemingly mundane aspects of NLP and make understanding the different design choices more accessible.

Posts

Jun 15, 2025
Practical Tidbits: Taking a Magnifying Glass to (Text) Classifier Performance
Feb 12, 2025
Work In Progress: LLMs for ETL
Jan 1, 2025
Improving the NLP Tool Kit: Characterization
Dec 1, 2024
Fun with Words: A Foray into Solving NYT Connections via Decomposition
Oct 30, 2024
Fun With Words: NYT Connections
Oct 14, 2024
Quick and Dirty Metric to Imperial Conversions (How to Entertain Yourself as an American Driving in a Metric Country)
Sep 28, 2024
Negative Result: Improving Fixed Vocab Text Representations
Aug 28, 2024
Practical Tidbits: To Pickle or Not to Pickle
Jun 20, 2024
Practical Tidbits: Selecting MinHash Hyperparameters for Deduplication
May 18, 2024
Practical Tidbits: ElasticSearch with custom Embeddings (Vectors) for Versions Greater than 7.6
Apr 28, 2024
Original Work: “Nudging” Active Learning to Learn Minority Classes
Mar 28, 2024
An In-depth Discussion of Textual Similarity: Taking a look at the toolkit
Mar 2, 2024
A Dream: An Easy Way to Work with Documents and (implicitly) Structured Text
Dec 18, 2023
An In-depth Discussion of Textual Similarity: Characteristics and When They Matter
Nov 18, 2023
An In-depth Discussion of Textual Similarity: Starting the Conversation