Apache Pig 0.10.1已经发布,这个版本并没有什么特性,更多的是为了稳定性以及一些bug的修正
显著变化:
1. 代码独立以及支持多种格式的下载.
下载地址: http://www.fightrice.com/mirrors/apache/pig/pig-0.10.1/
可以通过tar.gz,rpm,deb等方式下载获取
2. 更好的支持 Apache Hadoop 0.23.x/2.x
从0.10.0开始,pig就已经加入了Maven仓库.你可以通过以下方式配置pom.xml来使用pig0.10.1
<dependency> <groupId>org.apache.pig</groupId> <artifactId>pig</artifactId> <version>0.10.1</version> <classifier>h2</classifier> </dependency>
同时Pig团队修正了Pig运行在Hadoop 0.23X/2.X版本上的一些Bug( PIG-3035, PIG-2783, PIG-2761, PIG-2912, PIG-2791)
3.更好的支持Oracle JDK7.0
4.单元测试的错误修正
具体修改如下:
IMPROVEMENTS
PIG-2907: Publish pig jars for Hadoop2/23 to maven (rohini)
PIG-3019: Need a target in build.xml for source releases (gates)
PIG-2794: Pig test: add utils to simplify testing on Windows (jgordon via gates)
PIG-2908: Fix unit tests to work with jdk7 (rohini via dvryaboy)
PIG-2852: Update documentation regarding parallel local mode execution (cheolsoo via jcoveney)
PIG-2712: Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat (rohini via gates)
PIG-2727: PigStorage Source tagging does not need pig.splitCombination to be turned off (prkommireddi via dvryaboy)
PIG-2711: e2e harness: cache benchmark results between test runs (thw via daijy)
PIG-2680: TOBAG output schema reporting (andy schlaikjer via jcoveney)
PIG-2650: Convenience mock Loader and Storer to simplify unit testing of Pig scripts (julien)
BUG FIXES
PIG-3107: bin and autocomplete are missing in src release (daijy)
PIG-3106: Missing license header in several java file (daijy)
PIG-3099: Pig unit test fixes for TestGrunt(1), TestStore(2), TestEmptyInputDir(3) (vikram.dixit via daijy)
PIG-3035: With latest version of hadoop23 pig does not return the correct exception stack trace from backend (rohini)
PIG-2953: "which" utility does not exist on Windows (daijy)
PIG-2960: Increase the timeout for unit test (daijy)
PIG-2958: Pig tests do not appear to have a logger attached (daijy)
PIG-2942: DevTests, TestLoad has a false failure on Windows (jgordon via daijy)
PIG-2943: DevTests, Refactor Windows checks to use new Util.WINDOWS method for code health (jgordon via dvryaboy)
PIG-2801: grunt "sh" command should invoke the shell implicitly instead of calling exec directly with the command tokens
(jgordon via daijy)
PIG-2800: pig.additional.jars path separator should align with File.pathSeparator instead of being hard-coded to ":"
(jgordon via azaroth)
PIG-2798: pig streaming tests assume interpreters are auto-resolved (jgordon via daijy)
PIG-2797: Tests should not create their own file URIs through string concatenation, should use Util.generateURI
instead (jgordon via daijy)
PIG-2796: Local temporary paths are not always valid HDFS path names (jgordon via daijy)
PIG-2795: Fix test cases that generate pig scripts with "load " + pathStr to encode "\" in the path (jgordon via daijy)
PIG-2940: HBaseStorage store fails in secure cluster (cheolsoo via daijy)
PIG-2821: HBaseStorage should work with secure hbase (rohini via daijy)
PIG-2890: Revert PIG-2578 (dvryaboy)
PIG-2859: Fix few e2e test failures (rohini via daijy)
PIG-2729: Macro expansion does not use pig.import.search.path - UnitTest borked (johannesch via daijy)
PIG-2791: Pig does not work with Namenode Federation (rohini via daijy)
PIG-2783: Fix Iterator_1 e2e test for Hadoop 23 (rohini via daijy)
PIG-2761: With hadoop23 importing modules inside python script does not work (rohini via daijy)
PIG-2759: Typo in document "Built In Functions" (daijy)
PIG-2745: Pig e2e test RubyUDFs fails in MR mode when running from tarball (cheolsoo via daijy)
PIG-2741: Python script throws an NameError: name 'Configuration' is not defined in case cache dir is not created
(knoguchi via daijy)
PIG-2669: Pig release should include pig-default.properties after rebuild (daijy)
PIG-2739: PyList should map to Bag automatically in Jython (daijy)
PIG-2730: TFileStorage getStatistics incorrectly throws an exception instead of returning null (traviscrawford via daijy)
PIG-2717: Tuple field mangled during flattening (daijy)
PIG-2721: Wrong output generated while loading bags as input (knoguchi via daijy)
PIG-2912: Pig should clone JobConf while creating JobContextImpl and TaskAttemptContextImpl in Hadoop23 (rohini via daijy)
PIG-2775: Register jar does not goes to classpath in some cases (daijy)
粗鲁的翻译自:http://hortonworks.com/blog/apache-pig-0-10-1-released/
更多技术文章、感悟、分享、勾搭,请用微信扫描:
相关推荐
iotdb-0.10.1
Apache Mahout 项目旨在帮助开发人员更加方便快捷地创建智能应用程序。Mahout 的创始者 Grant Ingersoll 介绍了机器学习的基本概念,并演示了如何使用 Mahout 来实现文档集群、提出建议和组织内容。
Tez是Apache最新的支持DAG作业的开源计算框架,它可以将多个有依赖的作业转换为一个作业从而大幅提升DAG作业的性能。Tez并不直接面向最终用户——事实上它允许开发者为最终用户构建性能更快、扩展性更好的应用程序。...
Tez是Apache开源的支持DAG作业的计算框架,它直接源于...(1)Apache二级开源项目(源代码今天发布的) (2)运行在YARN之上 (3) 适用于DAG(有向图)应用(同Impala、Dremel和Drill一样,可用于替换Hive/Pig等
1、可以根据excel存储的邻接矩阵生成图数据结构的节点拓扑形式,进行可视化 2、可以用于实体关系图的绘制
最新版log4cxx工具包,用于c++环境下log日志的打印输出,适用于linux环境,使用时需要配合log4j配置文件,定义日志输出级别和输出路径
tez-0.10.1-SNAPSHOT-minimal.tar.gz
这个是把checktrain在WIN系统上运行,进行苹果越狱工作,会制作 启动盘的朋友,可以下载下去使用。有4G的就够用的了。
Flask是一个使用 Python 编写的轻量级 Web 应用框架。其 WSGI 工具箱采用 Werkzeug ,模板引擎则使用 Jinja2 。Flask使用 BSD 授权。 Flask也被称为 “microframework” ,因为它使用简单的核心,用 extension 增加...
servo-core-0.10.1.jar
checkra1n 0.10.1.iso苹果越狱下载 这个是把checktrain在WIN系统上运行,进行苹果越狱工作,会制作 启动盘的朋友,可以下载下去使用。有4G的就够用的了。
这回是API文档,上次那个没检查,里面的东西都是坑。
tez-0.10.1-SNAPSHOT.tar.gz
已编译好的github上最新的tez-ui-0.10.1 比apache官网上的都要高一个版本,解决了很多bug
Bazel的安装包,官网下载速度慢,可使用此链接 先下载Bazel的安装包 https://github.com/bazelbuild/bazel/releases,我下载的是..../bazel-0.10.1-installer-linux-x86_64.sh 注意:bazel版本不能过高,否则会报错
资源来自pypi官网。 资源全名:statsmodels-0.10.1-cp37-cp37m-manylinux1_x86_64.whl
基于github tez最新版本编译,编译日期为2021-03-22,压缩包带有tar包和tez-ui的war包
Python下的SCIPY模块,用于进行科学计算,包含大量的函数,十分方便,适用于Python 2.5版本和Windows系统。
scipy for python2.5,用于进行科学计算,包含大量的函数,十分方便,适用于Python 2.5版本和Windows系统。
scipy-0.10.1.zip