Skip to content

上海交通大学EE208电类工程导论C课程大作业:搜索引擎

Notifications You must be signed in to change notification settings

currybur/EE208-Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Final Project of Introduction to Electric Engineering(Class-C)(EE208)

Implemented a searching engine (both text-based and image-based), and set up a website to illustrate. Members include Hua Zeyu, Liu Hanwen and Liu Yuwei. 🌈🌈🌈


Dataset

Crawled text and pictures from official websites of National Museum of China, the Palace Museum, and Shaanxi History Museum.

Web Framework

web.py is used.

User Interface Design

Thanks to 🌸 Zeyu, our website seems to be very high-level!

Text-based Search

Apply Lucene to establish text index and to search. Word segmentation work is done by jieba.

Image-based Search

Based on this blog, use structural and colored features to match queried picture in the picture library. Also can match some unstored similar pictures.

Additional Functions

Liu Hanwen also implemented some multimedia functions, such as voice-text conversion, OCR and some face detection work, which haven't been attached to final version of our website.


How to Use This Repo

requirements:

  • Lucene
  • web.py
  • opencv
  • blabla

There has been a long time since we finish this work and I have forgotten the details... You can deploy your environment according to the codes and error info. Wish you good luck😜!

About

上海交通大学EE208电类工程导论C课程大作业:搜索引擎

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published