# 大数据应用与开发 **Repository Path**: niit_edu_cn_0/big-data ## Basic Information - **Project Name**: 大数据应用与开发 - **Description**: 大数据应用与开发教程 - **Primary Language**: Python - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 0 - **Created**: 2025-03-04 - **Last Updated**: 2026-02-09 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # 大数据应用与开发 #### [1 大数据环境准备](https://gitee.com/niit_edu_cn_0/big-data/raw/master/experiment%20documents/1.%20%E7%BB%AA%E8%AE%BA.md) #### 2 大数据环境搭建 ##### [2.1 安装JAVA 1.8](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/2.1%20Java%20%E5%AE%89%E8%A3%85.md) ##### [2.2 配置SSH免密登录](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/%202.2%20SSH%E5%85%8D%E5%AF%86%E7%99%BB%E5%BD%95.md) ##### [2.3 安装Python3.9](https://gitee.com/niit_edu_cn_0/big-data/edit/master/install%20python.md) ##### 2.3 idea 开发环境安装 #### 3 安装Hadoop框架 ##### [3.1 Hadoop 环境配置](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/3.hadoop%E7%8E%AF%E5%A2%83%E6%90%AD%E5%BB%BA.md) ##### [3.2 HDFS单词计数](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/3.2%20HDFS%E5%8D%95%E8%AF%8D%E8%AE%A1%E6%95%B0%E5%AE%9E%E9%AA%8C.md) #### 4 大数据生态圈 ##### [4.1 ZooKeeper 安装](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/4.1%20Zookeeper%20%E5%AE%89%E8%A3%85%E9%85%8D%E7%BD%AE.md) ##### [4.2 Flume 安装](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/4.1%20Flume%20%E5%AE%89%E8%A3%85.md) ##### 4.3 Hbase 安装 ##### [4.4 Hive 安装](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/4.2%20Hive%20%E5%AE%89%E8%A3%85%E5%AE%9E%E9%AA%8C.md) #### 5 Spark 环境搭建 ##### [5.1 Spark 安装配置](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/5.1%20Spark%E5%AE%89%E8%A3%85%E9%85%8D%E7%BD%AE.md) ##### [5.2 RDD算子](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/5.2%20RDD%E7%AE%97%E5%AD%90%E5%AE%9E%E9%AA%8C.md) ##### 5.3 Mllib 机器学习库 ##### 5.4 并行机器学习算法实例 #### 扩展实验 ##### [1. HuDi 数据湖安装配置实验](https://gitee.com/niit_edu_cn_0/big-data/blob/master/experiment%20documents/hudi%20%E9%85%8D%E7%BD%AE%E5%AE%9E%E9%AA%8C.md) #### 课程说明 实验课时:24 总学时:24 任课教师: 夏吉安(计算机与软件学院) E-mail:xiagyan@niit.edu.cn 手机:13851814532 课程周数:11-18周 上课学期:2024-2025学年第2学期 上课班级:软件2433-Web开发方向1班,软件2433-Web开发方向2班