bigData-project

所属分类:大数据
开发工具:Java
文件大小:69KB
下载次数:0
上传日期:2020-05-07 20:10:05
上 传 者sh-1993
说明:  2019-2020年Unibo-切塞纳校区大数据课程项目
(Project for the Big Data course 2019-2020 at Unibo - Campus di Cesena)

文件列表:
LICENSE (1072, 2020-04-21)
build.gradle (2085, 2020-04-21)
gradle (0, 2020-04-21)
gradle\wrapper (0, 2020-04-21)
gradle\wrapper\gradle-wrapper.jar (55190, 2020-04-21)
gradle\wrapper\gradle-wrapper.properties (231, 2020-04-21)
gradlew (5305, 2020-04-21)
gradlew.bat (2269, 2020-04-21)
settings.gradle (37, 2020-04-21)
src (0, 2020-04-21)
src\main (0, 2020-04-21)
src\main\java (0, 2020-04-21)
src\main\java\MapReduceJob.java (5116, 2020-04-21)
src\main\java\combiners (0, 2020-04-21)
src\main\java\combiners\FlightCombiner.java (816, 2020-04-21)
src\main\java\mappers (0, 2020-04-21)
src\main\java\mappers\FlightMapper.java (1262, 2020-04-21)
src\main\java\mappers\JoinMapperAirline.java (1147, 2020-04-21)
src\main\java\mappers\JoinMapperKpi.java (943, 2020-04-21)
src\main\java\mappers\SortMapper.java (770, 2020-04-21)
src\main\java\reducers (0, 2020-04-21)
src\main\java\reducers\FlightReducer.java (783, 2020-04-21)
src\main\java\reducers\JoinReducer.java (868, 2020-04-21)
src\main\java\reducers\SortReducer.java (463, 2020-04-21)
src\main\java\utils (0, 2020-04-21)
src\main\java\utils\AirlineKpiWritable.java (1275, 2020-04-21)
src\main\java\utils\Flight.java (1673, 2020-04-21)
src\main\java\utils\FlightDataWritable.java (1400, 2020-04-21)
src\main\scala (0, 2020-04-21)
src\main\scala\Main.scala (1333, 2020-04-21)
src\main\scala\model (0, 2020-04-21)
src\main\scala\model\Airline.scala (371, 2020-04-21)
src\main\scala\model\Flight.scala (835, 2020-04-21)
src\main\scala\model\Log.scala (378, 2020-04-21)
src\main\scala\model\Utils.scala (206, 2020-04-21)
src\main\scala\spark (0, 2020-04-21)
src\main\scala\spark\SparkJob.scala (2772, 2020-04-21)
... ...

# bigData-project This project is for the final exam of the course of Big Data 2019/2020. The goal of the project is to create two jobs (1 MapReduce / 1 Spark) on an Hadoop cluster. ## Dataset The dataset is illustrated [here](https://www.kaggle.com/usdot/flight-delays). ## Jobs Both the jobs are based on the same query: * Rank of the best airlines based on a KPI obtained with a relationship between total flights's delay minutes and distance in KM (total delay minutes / distance KM)

近期下载者

相关文件


收藏者