Full Stack webscrapper for ML using nodeJS and mySQL

aaEfIfY.png!web

The documentation in this repository describe the FullStack webscrapping platform for use in Machine learning.

Architecture

3e2Mzmm.png!web

We first break the architecture into four distictive components namely Front-End, API, Scrapers and Database. The user sends information from the front-end to the API, the fron-end connects the API through a form. Inputs like the youtube URL are sent through front-end. Later the scrapers through the API pulls the necessary data and is saved to the database. Afterwhich the data is served to the front-end.

The Tech Stack are as below

Front-End - javascript
API - express
scraper - puppeteer
db - mysql (typeorm)

Also we need nodejs, npm and mysql.

The Architecture consists of several components:

Front End

For the Front-end we will have a header, an input box and a button. Below which we will have render boxes which renders relevant info from json. This will send data to the API.

API

We will have to create a single route with two methods GET and POST. We use nodejs and simple backed framework express.

Scraper

This function takes in URL and reaches out to YouTube, fetch the relevant data and then store it into the database.

Database

We use mySQL here. Here we add id, name, avatar and channelURL

To run the program

First go into server

$ npm install init

Install all the necessary packages

$ npm install express
$ npm install body-parser

Run the index.js script

$ node index.js

Thanks to Aron from Uber

Architecture

Front End

API

Scraper

Database

Recommend

使用 mapbox 实现全国房价数据可视化

中英文排版规范化 API

Vim命令详解之替换命令r、R及删除并进入插入模式命令s、S

DevOpen.Club 推出在线播放视频教程功能

This week in KDE: Plasma 5.18 in two days

From memory corruption to disable_functions bypass: understanding PHP exploits

DDD 中的战术设计是啥？

AAAI 2020最佳论文出炉，800名国人缺席，疫情并未浇灭热情

老乡鸡董事长手撕员工联名信：卖车卖房也要给员工发工资

企业的倒闭潮，可能才刚刚开始

About Joyk