User-Interest-Extraction-API

parameter:

event: these are available options
- rest
- tourist
- Movies
- TechNews
- SportsNews
- ithome
- taiwan_attractions
- taiwan_restaurant
- yahoosport
- yahoomovie
- politics
- art
num: please don't query num bigger than 20, some type of event json didn't prepare for that much.

API url:

example : 140.120.13.243:12435/api/?event=SportsNews&num=20

Usage

run Django
- . venv/bin/activate
- cd user_interest_api_server/
- nohup python3 manage.py runserver 0.0.0.0:8000 &
run Crawler
- nohup python3 manage.py scrapy_background.py &
- nohup python3 yahoo_sport.py &
- nohup python3 comprehensiveCrawler.py &

index.html provide a web service by loading json When you start to look up those data.
you need to run scrapy command first to get those json file.

scrapy:
- scrapy crawl Attractions -o attractions.json -t json from tripadvisor
- scrapy crawl restaurant -o restaurant.json -t json from tripadvisor
- scrapy crawl taiwan_attractions -o taiwan_attractions.json -t json from taiwan tripadvisor
- scrapy crawl taiwan_restaurant -o taiwan_restaurant.json -t json from taiwan restaurant
- scrapy crawl ithome -o ithome.json -t json from ithome
- scrapy crawl yahoomovie -o yahoomovie.json -t json from movie board of yahoo
  - use python scrapy_background.py to activate
- scrapy crawl artemperor -o art.json -t json from movie board of yahoo
  - use python scrapy_background.py to activate
- scrapy crawl politics -o politics.json -t json from movie board of yahoo
  - use python scrapy_background.py to activate
seperate crawlers:
- comprehensiveCrawler.py is a web crawler which can get the information from ESPN, CNET and IMDB, and organize them into JSON files for sports news, tech news and movie inforamtion respectively. The JSON files will be created in ./result/.
- python yahoo_sport.py from sport board of yahoo

Momo-Product-API

parameter:

proportion: the proportion of 20 product categories.
the categories sequence is shown below.
- tissue
- notebook
- lodging
- fragrance
- sportswear
- makeup
- health
- organicfood
- watch
- underwear
- girlshoes
- pregnant
- appliances
- camping
- bag
- book
- video
- stationery
- religion
- anime
num: the number of displays.
please don't input num bigger than 200.
currently there are only 10 data in each category.

API url:

example : 140.120.13.243:12435/momoapi/?proportion=[1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1]&num=100

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
restaurant+attractions		restaurant+attractions
user_interest_api_server		user_interest_api_server
.gitignore		.gitignore
README.md		README.md
comprehensiveCrawler.py		comprehensiveCrawler.py
index.html		index.html
phantomjs		phantomjs
requirements.txt		requirements.txt
yahoo_sports.py		yahoo_sports.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

User-Interest-Extraction-API

parameter:

API url:

Usage

Momo-Product-API

parameter:

API url:

About

Releases

Packages

Languages

maxkaoppp/User-Interest-Extraction-API

Folders and files

Latest commit

History

Repository files navigation

User-Interest-Extraction-API

parameter:

API url:

Usage

Momo-Product-API

parameter:

API url:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages