大数据开发面试题

1.MapReduce 工作原理:https://blog.csdn.net/m0_37558366/article/details/89500539

2.MapReduce、Hive、Spark中数据倾斜:https://blog.csdn.net/lzw2016/article/details/89284124

3.数据建模概述:https://www.cnblogs.com/lpdeboke/p/14898148.html

4.hive中内部表和外部表的区别:https://blog.csdn.net/aimee12345/article/details/82493139

5.hive中小文件产生的原因?如何避免?:https://blog.csdn.net/aaaaajiboke/article/details/86646651

6.hive中的静态分区和动态分区:https://blog.csdn.net/a200822146085/article/details/89841387

7.hive中常见存储格式;https://blog.csdn.net/qq_43665254/article/details/112756767

8.hive常见的建表方式:https://blog.csdn.net/qq_43665254/article/details/112759682

9.hive性能调优https://www.cnblogs.com/ITtangtang/p/7683028.html

9.python基础面试题:https://www.cnblogs.com/lpdeboke/p/11347990.html

10.hive面试全集:https://docs.qq.com/doc/DUW9uQVNjUUhOeUFO

原文地址:https://www.cnblogs.com/lpdeboke/p/14898141.html