GitHub

An LLM-based administrative assistant that mimics user writing styles and pre-writes email replies for executives, boosting professional and personal productivity by up to 25%

Project Motivation

Professionals get hundreds of emails a day:

Many go unread
Not organized or prioritized
Can result in missing important Tax/Federal/Bank Statement Docs

Busy professionals may take a long time to respond to all these emails, and the worry of having to respond can be very stressful. This is further marred by subpar Quality of responses and workstream delays.

How can Hedwig can help?

Being an LLM-powered administrative assistant, Hedwing handles the following:

Pre-writes emails for users according to their personal writing styles
Picks out relevant emails from the past emails of the user
Gains context around the incoming email from previous responses
Identifies the relationship between sender and responder to craft responses
Ensure data security by storing within existing organizational infrastructure

Project Development

1. Concept Overview

The single token of data in our analysis is an email-reply pair. Any email-reply pair can be divided into three elements:

Global Context
Local Context
Pair Style Relationship

Hedwig aims to tap these three concepts using a proprietary personalization framework while using an LLM to reply to any incoming email.

2. Data Overview

2A - Enron Dataset

The Enron email dataset was used for project development. This email dataset contains approximately 500,000 emails generated by employees of the Enron Corporation. The Federal Energy Regulatory Commission obtained it while investigating Enron's collapse.

The original data obtained has highly dense JSON files.

For our purpose, we cleaned and transmuted the data to build message-reply pairs using the following rules:

Emails with the same subject and sender-recipient pair were grouped into threads
Rather than focussing on an email, an email-reply conversation was emphasized
Empty emails or emails with just emojis/attachments were removed

~25000 messages reply conversations were identified, which became our modeling dataset

Enron EDA + Cleaning Notebook

2B - Human Verification Data

We used close to 250 personal emails to mimic a local testing environment for our tool. This helps refine our prompts further and add human context to our analysis.

3. Personalization Framework

For most LLM Models, we suffer from two problems while generating text:

There's often no single-point source of information
Information can be out of date

We add Retrieval Augmentation frameworks to our LLM as Personalization elements. That way, the LLM always refers to our primary source data (and does not hallucinate).

The following modular framework embeds personalization elements in each step of your email generation.

Personalization Framework Notebook

Level 0 - Memory

Before we even start looking into any incoming email, we can start gaining sufficient knowledge that can help put guardrails for our LLM:

Who is the sender?
On who's behalf is the LLM generating the response
What's the average length of an email that the replier typically sends

These three memory features are re-instated in every step of the Personalization Framework.

Level 1 – Global Context

To find emails relevant to any incoming query

Every Human is unique

Hedwig assigns independent vector universes to each individual user
All the emails written by that user in the past are converted to vector embeddings
Collections of each user's email embeddings are stored in a vector database (Chroma)
Any incoming email is embedded using the same model
Maximal Marginal Relevance Search is implemented to find the most relevant emails between the incoming query and the user vector store.
MMR Search selects examples based on a combination of which emails are most similar to the sender's email, while also optimizing for diversity.

VectorStore Creation Notebook

Level 2 - Local Context

There can be multi-modal conversations for the same topic

Two components define an email thread:

Sender-Receiver personnel
Subject Line

Even with an MMR-backed RAG system, the LLM can often add more information to the email reply than required. Further LLM hallucinations accompany this. To provide guardrails to our output, we update the retriever to emphasize recent emails from the ongoing thread (if any).

Notebook for finding threads.

Level 3 - Hyper-Personalization

Level 3A - Pairstyle

The latest chronological emails (irrespective of thread) between the user and sender are extracted. An individual LLM Endpoint is utilized to extract writing style adjectives between the two users.

Inspiration was taken from the Chicago Manual of Style The Chicago Manual of Style Online is the venerable, time-tested guide to style, usage, and grammar in an accessible online format

Level 3B - Merging Style and Context

The final text generation chain takes:

Contextual Email from Level 2
Writing Style adjectives from Level 3A
Memory is integrated with each step of the LLM Chain and reinforced in the final prompt

This step gives us our final output.

4. Prompt Framework

Rather than fine-tuning our LLM, we emphasized toggling the model parameters and fine-tuning the prompt to extract maximum generative power without GPU cost constraints.

We built a multi-prompt approach, with each LLM endpoint emphasizing one task

5. User Interface

The base notebook is located here.
The entire codebase was productized using Streamlit and deployed on HuggingFace Spaces

Streamlit Integration Notebook

6. Acknowledgements

Kshitij Mittal

Website LinkedIn

Scott Matsubara

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
01_Outlook_API		01_Outlook_API
02_Data_Ingestion		02_Data_Ingestion
03_Basic_NLG		03_Basic_NLG
04_Prelim_Langchain_Framework		04_Prelim_Langchain_Framework
05_Prelim_Personalization		05_Prelim_Personalization
06_ModelValidation		06_ModelValidation
07_HumanValidation		07_HumanValidation
08_PromptTuning		08_PromptTuning
09_FinalPipeline		09_FinalPipeline
10_Streamlit		10_Streamlit
99_Archive		99_Archive
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Motivation

How can Hedwig can help?

Project Development

Table of Contents

1. Concept Overview

2. Data Overview

2A - Enron Dataset

2B - Human Verification Data

3. Personalization Framework

Level 0 - Memory

Level 1 – Global Context

Level 2 - Local Context

Level 3 - Hyper-Personalization

Level 3A - Pairstyle

Level 3B - Merging Style and Context

4. Prompt Framework

We built a multi-prompt approach, with each LLM endpoint emphasizing one task

5. User Interface

6. Acknowledgements

About

Releases

Packages

Contributors 2

Languages

smatsubara15/Hedwig

Folders and files

Latest commit

History

Repository files navigation

Project Motivation

How can Hedwig can help?

Project Development

Table of Contents

1. Concept Overview

2. Data Overview

2A - Enron Dataset

2B - Human Verification Data

3. Personalization Framework

Level 0 - Memory

Level 1 – Global Context

Level 2 - Local Context

Level 3 - Hyper-Personalization

Level 3A - Pairstyle

Level 3B - Merging Style and Context

4. Prompt Framework

We built a multi-prompt approach, with each LLM endpoint emphasizing one task

5. User Interface

6. Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages