Senior Design Team sddec22-19 • Text extractions from documents into Elasticsearch

Project Overview

Client:

BuilderTrend

Project Scope:

As part of the "Global Search" initiative at Buildertrend, we want to enable our customers to search not only through data stored in our database, but also through text within files that they've uploaded. That includes Word, Powerpoint, PDF, and other file types common to Buildertrend projects.

To do this, we'll be utilizing Elasticsearch as a data store and search technology. How the text is extracted from the files and stored into Elastic is the primary goal of this project.

Project Goals:

  • Find a tool that can extract text out of numerous different file types. The more file types, the better. We can provide a list of the top file types used in Buildertrend.
  • Write an application that utilizes this tool and pushes the data into Elasticsearch
  • The application should be able to demonstrate the search functionality with a simple UI

Team Members

Bruce Bitwayiki

Backend Architecture Design

Bruce is a senior in Computer Engineering with interest in cloud computing and machine learning. He will be working on the backend implementation and testing for this project.

Jared Hayashi

Backend Architecture Design & Client Communcication

Jared is a senior in Software Engineering. He will be designing the backend implementation.

Rushal Sohal

Frontend Architecture Design

Rushal is a senior in Computer Engineering with interest in AI and Deep Learning. He will be working on the frontend architecture.

Tiffany Mayberry

Frontend Architecture Design & Faculty Advisor Communcication

Tiffany is a senior in Software Engineering. She will be working on designing the user interface of the application.



Reports

491 Reports (Weekly)

Report 1 (1/25/22 - 2/6/22)
Report 2 (2/7/22 - 2/13/22)
Report 3 (2/14/22 - 2/20/22)
Report 4 (2/21/22 - 2/27/22)
Report 5 (2/28/22 - 3/6/22)
Report 6 (3/7/22 - 3/11/22)
Report 7 (3/21/22 - 3/27/22)
Report 8 (3/28/22 - 4/3/22)
Report 9 (4/4/22 - 4/10/22)

492 Reports (Biweekly)

Report 10 (8/22/22 - 9/15/22)
Report 11 (9/16/22 - 9/30/22)
Report 12 (9/30/22 - 10/11/22)
Report 13 (10/11/22 - 10/25/22)
Report 14 (10/26/22 - 11/8/22)
Report 15 (11/9/22 - 11/22/22)


491 Final Design Documents

Final Design Document
491 Final Presentation


492 Final Design Documents

Final Report
Poster
492 Final Presentation


Design Documents

Team Characteristics
Team Contract
Professionalism Document
Project Requirements
Project Plan
Project Design
Project Testing


Lightning Talks

Elevator Pitch
Requirements Pitch
Project Plan Pitch
Project Design Pitch
Testing Plan Pitch