Skip to content

EYAIChallenge/CRM-Document-Extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

alt text

Logo AI Challenge 2025 | Community Connection Graph Challenge


🧠 Challenge: Community Connection Graph

In this strategic consulting challenge, your team will develop an innovative solution that transforms unstructured data into actionable community intelligence.

You'll be working with a dataset of physical forms containing citizen profile information (personal data, interests, preferences, diseases, event reviews) that have been captured through mobile photography in non-ideal conditions (blurry images, poor lighting, varied quality).

Objective

Design an end-to-end intelligence platform that:

  • Extracts unstructured information
  • Standardizes it
  • Builds a powerful graph database that integrates with a CRM platform
    → This will optimize company interactions with clients.

Go beyond basic data processing—integrate real-time community event data to:

  • Create a dynamic recommendation engine
  • Identify relevant upcoming events for specific community segments, or other relevant information that can be referenced
  • Suggest tailored content based on each client's profile

Show how advanced OCR, graph architecture, and data enrichment can turn fragmented inputs into a cohesive community intelligence system that drives meaningful engagement opportunities.


🗂️ Dataset

The dataset includes 30 .jpeg images
Captured using mobile phones, each image contains unstructured profile data about individual citizens, such as interests and preferences


🧩 Consulting Mindset Expectations

  • Data Transformation Strategists
    Convert imperfect inputs into structured, valuable intelligence.

  • Connection Architects
    Design systems that reveal non-obvious relationships and opportunities.
    Create intuitive visualizations of complex relationship networks.

  • Sell the Solution, Not Just the Process
    Don’t just explain what you built — present it as a valuable solution for the client.
    Highlight business impact and propose clear strategic next steps.


📦 Deliverables

  • ✅ A working end-to-end prototype demonstrating the complete data journey.
  • Organized, well-documented, and reproducible code.
  • ✅ A strategic presentation pitching your solution to the judging panel as EY executive stakeholders.
  • (Optional) A live demo to showcase your solution in action.

⚠️ Important Submission Requirement ⚠️

✅ Submit Before the 14h00 Deadline

📁 Submit a .zip folder containing:

  • Your Google Colab notebook with all cells run and outputs shown
  • Screenshots of all external tools/visualizations used

📧 Email to: [email protected]
📌 Subject: CRM Document Extraction – GroupName
📋 Include all team members’ names in the email

Only one submission per group


💡 Tips for Competitors

  • 🔧 Optimize Data Processing
    Develop a robust pipeline for refining imperfect images into structured intelligence, including preprocessing, OCR enhancement, and data cleaning.

  • 🌐 Strategic Graph Design
    Build an efficient graph database linking citizens, interests, locations, and events to uncover valuable insights.

  • 🚀 Intelligent Data Enrichment
    Use SerpAPI to dynamically integrate community events
    Build smart recommendation algorithms

  • 🛠️ Handle Data Quality Issues
    Address challenges like formatting inconsistencies, missing fields, and handwriting variations with strong validation mechanisms.

  • 📊 Impactful Visualizations
    Build clear, intuitive views that highlight key relationships for non-technical users.

  • 🎯 Tell a Strategic Story
    In your pitch, don’t just explain the tech. Show how your solution solves real problems and delivers lasting value.


🛠 Tech & Tools

🚨 Mandatory Requirement
All development must be in Google Colab using Python

You are free to choose any:

  • 📚 Libraries: Use any tool you need, such as Pandas, Scikit-learn, LangChain, etc.
  • 📈 Visualization tools: Python-based tools (Matplotlib, Seaborn), Power BI, Tableau
  • 🤖 AI Assistants: Feel free to consult ChatGPT, GitHub Copilot, Gemini, etc.

⏱ Time Management & Rules

  • ⏳ You have 4 hours to complete the challenge
    🔒 No extensions. Use it wisely.

  • 🗣 Present a 5-minute consulting pitch
    🎯 Simulate a client-facing delivery. You’ll be evaluated on your solution and how well you communicate it.

  • 👥 Each group may request:

    • 1 technical support session (up to 5 minutes)
    • 1 business support session (up to 5 minutes)

💬 Assistants won’t give you the solution — they'll guide your thinking


📋 Strategy & Workflow Tips

This is a consulting-style challenge. Time is tight — here’s how to win:

  1. 🧑‍🤝‍🧑 Assign roles early
    E.g., data engineer, business analyst, presenter

  2. ⏱️ Work in parallel
    Don’t wait on each other — split & conquer

  3. 🎤 Start building the presentation early
    Don’t leave it for the final 10 minutes

  4. Be realistic
    It’s better to deliver a focused, clear, and well-explained solution than a rushed or overly complex one.

🧠 Judging is based on teamwork, structure, and strategic thinking, not just technical skills


💬 Final Thought

This challenge represents an opportunity to demonstrate how emerging technologies can transform imperfect data into strategic intelligence that drives community engagement. The most successful teams will balance technical sophistication with practical implementation considerations, creating a solution that could realistically be deployed in community development initiatives. Your approach should demonstrate how technology can strengthen community connections and create more targeted, relevant engagement opportunities.

🏆 Success means:

  • Mastering imperfect data
  • Leveraging cutting-edge tech
  • Delivering a deployable, business-focused solution

Your tech isn’t just a tool — it’s a bridge to stronger community engagement


🏁 Brought to you by EY AI Challenge

About

EY AI Challenge 2025 | CRM Document Extraction Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published