GitHub - rohancsalvi/B.Tech-Final-Year-Project: Final Year Project

B.Tech (Computer Science & Engineering) Final Year Project - VNIT, Nagpur

Team Members -

Abdul Sattar Mapara
Saket Chopade
Rohan Salvi
Pritam Kumar Sahoo

Guided by -

Dr. U.A. Deshpande Sir (VNIT, Nagpur)
Dr. Sagar Sunkle Sir (TRDDC, Pune)

About the Project

The aim of the project is to gather time-stamped factual information about a given topic/entity from a given set of documents (Brokerage Reports).

More precisely, given a set of documents (brokerage reports in PDF format), about a company or a bank (or any organization) published over a period of 1-2 years, it is expected that factual information about that company, or a bank (or any entity) to be extracted (in the form of semi-structured statements) and classified as an increasing or decreasing trend. The extracted facts are expected to be grouped by date/month.

Summary of Tasks Accomplished

Collecting and Processing the reports
1. Brokerage Reports collected from - trendlyne.com
2. PDF -> Text conversion
3. Text -> Sentence (Sentence Tokenization)
4. Pass through spaCy pipeline for tokenization (into tokens), lemmatization, Part of Speech Tagging, Dependency Parse tree generation, Named Entity Recognition
Extraction of Date/Timestamp
1. Using Named Entity Recognition
2. Using Metadata associated with the reports
Extraction of Facts in the form of Semi Structured Statements
1. Using Textacy library
2. Using Dependency Parse tree generated by spaCy (Custom Approach)
3. Explored relation extraction using Stanford Open IE
Sentiment Analysis (Sentence Classification)
1. Dictionary based approach
2. Machine learning based approach (using Support Vector Machines)
3. Deep learning based approach (using Convolutional Neural Networks)
Note: Conversion of words to numbers done using custom word2vec model
Application (using Flask framework) for demonstration of the project

About this Repository

This repository contains the source code written during the project for accomplishing the required tasks and experimentation.

This branch (master) contains the source code of the application developed for demonstration.

Demonstration

Video - Download Final-Year-Project-Demo OR View Final-Year-Project-Demo

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
templates		templates
.gitignore		.gitignore
Final-Year-Project-Demo.mp4		Final-Year-Project-Demo.mp4
README.md		README.md
keywords.txt		keywords.txt
main.py		main.py
sentences_key.txt		sentences_key.txt
svo.py		svo.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

B.Tech (Computer Science & Engineering) Final Year Project - VNIT, Nagpur

Team Members -

Guided by -

About the Project

Summary of Tasks Accomplished

About this Repository

Demonstration

About

Releases

Packages

Languages

rohancsalvi/B.Tech-Final-Year-Project

Folders and files

Latest commit

History

Repository files navigation

B.Tech (Computer Science & Engineering) Final Year Project - VNIT, Nagpur

Team Members -

Guided by -

About the Project

Summary of Tasks Accomplished

About this Repository

Demonstration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages