HADOOP-11934 |
Use of JavaKeyStoreProvider in LdapGroupsMapping causes infinite loop |
Blocker |
security |
Mike Yoder |
Larry McCay |
HADOOP-11932 |
MetricsSinkAdapter hangs when being stopped |
Critical |
. |
Jian He |
Brahma Reddy Battula |
HADOOP-11802 |
DomainSocketWatcher thread terminates sometimes after there is an I/O error during requestShortCircuitShm |
Major |
. |
Eric Payne |
Colin Patrick McCabe |
HADOOP-11730 |
Regression: s3n read failure recovery broken |
Major |
fs/s3 |
Takenori Sato |
Takenori Sato |
HADOOP-11674 |
oneByteBuf in CryptoInputStream and CryptoOutputStream should be non static |
Critical |
io |
Sean Busbey |
Sean Busbey |
HADOOP-11604 |
Prevent ConcurrentModificationException while closing domain sockets during shutdown of DomainSocketWatcher thread. |
Critical |
net |
Liang Xie |
Chris Nauroth |
HADOOP-11491 |
HarFs incorrectly declared as requiring an authority |
Critical |
fs |
Gera Shegalov |
Brahma Reddy Battula |
HADOOP-11482 |
Use correct UGI when KMSClientProvider is called by a proxy user |
Major |
. |
Arun Suresh |
Arun Suresh |
HADOOP-11368 |
Fix SSLFactory truststore reloader thread leak in KMSClientProvider |
Major |
kms |
Arun Suresh |
Arun Suresh |
HADOOP-11350 |
The size of header buffer of HttpServer is too small when HTTPS is enabled |
Major |
security |
Benoy Antony |
Benoy Antony |
HADOOP-11343 |
Overflow is not properly handled in caclulating final iv for AES CTR |
Blocker |
security |
Jerry Chen |
Jerry Chen |
HADOOP-11333 |
Fix deadlock in DomainSocketWatcher when the notification pipe is full |
Major |
. |
zhaoyunjiong |
zhaoyunjiong |
HADOOP-11316 |
“mvn package -Pdist,docs -DskipTests -Dtar” fails because of non-ascii characters |
Blocker |
. |
Tsuyoshi Ozawa |
Tsuyoshi Ozawa |
HADOOP-11295 |
RPC Server Reader thread can’t shutdown if RPCCallQueue is full |
Major |
. |
Ming Ma |
Ming Ma |
HADOOP-11238 |
Update the NameNode’s Group Cache in the background when possible |
Minor |
. |
Chris Li |
Chris Li |
HADOOP-10786 |
Fix UGI#reloginFromKeytab on Java 8 |
Major |
security |
Tobi Vollebregt |
Stephen Chu |
HADOOP-8151 |
Error handling in snappy decompressor throws invalid exceptions |
Major |
io, native |
Todd Lipcon |
Matt Foley |
HDFS-8863 |
The remaining space check in BlockPlacementPolicyDefault is flawed |
Critical |
. |
Kihwal Lee |
Kihwal Lee |
HDFS-8846 |
Add a unit test for INotify functionality across a layout version upgrade |
Major |
namenode |
Zhe Zhang |
Zhe Zhang |
HDFS-8486 |
DN startup may cause severe data loss |
Blocker |
datanode |
Daryn Sharp |
Daryn Sharp |
HDFS-8480 |
Fix performance and timeout issues in HDFS-7929 by using hard-links to preserve old edit logs instead of copying them |
Critical |
. |
Zhe Zhang |
Zhe Zhang |
HDFS-8431 |
hdfs crypto class not found in Windows |
Critical |
scripts |
Sumana Sathish |
Anu Engineer |
HDFS-8404 |
Pending block replication can get stuck using older genstamp |
Major |
namenode |
Nathan Roberts |
Nathan Roberts |
HDFS-8270 |
create() always retried with hardcoded timeout when file already exists with open lease |
Major |
hdfs-client |
Andrey Stepachev |
J.Andreina |
HDFS-8245 |
Standby namenode doesn’t process DELETED_BLOCK if the add block request is in edit log. |
Major |
. |
Rushabh S Shah |
Rushabh S Shah |
HDFS-8219 |
setStoragePolicy with folder behavior is different after cluster restart |
Major |
. |
Peter Shi |
Surendra Singh Lilhore |
HDFS-8127 |
NameNode Failover during HA upgrade can cause DataNode to finalize upgrade |
Blocker |
ha |
Jing Zhao |
Jing Zhao |
HDFS-8072 |
Reserved RBW space is not released if client terminates while writing block |
Major |
datanode |
Arpit Agarwal |
Arpit Agarwal |
HDFS-8070 |
Pre-HDFS-7915 DFSClient cannot use short circuit on post-HDFS-7915 DataNode |
Blocker |
caching |
Gopal V |
Colin Patrick McCabe |
HDFS-8046 |
Allow better control of getContentSummary |
Major |
. |
Kihwal Lee |
Kihwal Lee |
HDFS-7999 |
FsDatasetImpl#createTemporary sometimes holds the FSDatasetImpl lock for a very long time |
Major |
. |
zhouyingchao |
zhouyingchao |
HDFS-7980 |
Incremental BlockReport will dramatically slow down the startup of a namenode |
Major |
. |
Hui Zheng |
Walter Su |
HDFS-7960 |
The full block report should prune zombie storages even if they’re not empty |
Critical |
. |
Lei (Eddy) Xu |
Colin Patrick McCabe |
HDFS-7930 |
commitBlockSynchronization() does not remove locations |
Blocker |
namenode |
Konstantin Shvachko |
Yi Liu |
HDFS-7929 |
inotify unable fetch pre-upgrade edit log segments once upgrade starts |
Major |
. |
Zhe Zhang |
Zhe Zhang |
HDFS-7915 |
The DataNode can sometimes allocate a ShortCircuitShm slot and fail to tell the DFSClient about it because of a network error |
Major |
. |
Colin Patrick McCabe |
Colin Patrick McCabe |
HDFS-7894 |
Rolling upgrade readiness is not updated in jmx until query command is issued. |
Critical |
. |
Kihwal Lee |
Brahma Reddy Battula |
HDFS-7885 |
Datanode should not trust the generation stamp provided by client |
Critical |
datanode |
vitthal (Suhas) Gogate |
Tsz Wo Nicholas Sze |
HDFS-7884 |
NullPointerException in BlockSender |
Blocker |
datanode |
Tsz Wo Nicholas Sze |
Brahma Reddy Battula |
HDFS-7871 |
NameNodeEditLogRoller can keep printing “Swallowing exception” message |
Critical |
. |
Jing Zhao |
Jing Zhao |
HDFS-7830 |
DataNode does not release the volume lock when adding a volume fails. |
Major |
datanode |
Lei (Eddy) Xu |
Lei (Eddy) Xu |
HDFS-7788 |
Post-2.6 namenode may not start up with an image containing inodes created with an old release. |
Blocker |
. |
Kihwal Lee |
Rushabh S Shah |
HDFS-7763 |
fix zkfc hung issue due to not catching exception in a corner case |
Major |
ha |
Liang Xie |
Liang Xie |
HDFS-7742 |
favoring decommissioning node for replication can cause a block to stay underreplicated for long periods |
Major |
namenode |
Nathan Roberts |
Nathan Roberts |
HDFS-7733 |
NFS: readdir/readdirplus return null directory attribute on failure |
Major |
nfs |
Arpit Agarwal |
Arpit Agarwal |
HDFS-7714 |
Simultaneous restart of HA NameNodes and DataNode can cause DataNode to register successfully with only one NameNode. |
Major |
datanode |
Chris Nauroth |
Vinayakumar B |
HDFS-7707 |
Edit log corruption due to delayed block removal again |
Major |
namenode |
Yongjun Zhang |
Yongjun Zhang |
HDFS-7610 |
Fix removal of dynamically added DN volumes |
Major |
datanode |
Lei (Eddy) Xu |
Lei (Eddy) Xu |
HDFS-7609 |
Avoid retry cache collision when Standby NameNode loading edits |
Critical |
namenode |
Carrey Zhan |
Ming Ma |
HDFS-7596 |
NameNode should prune dead storages from storageMap |
Major |
namenode |
Arpit Agarwal |
Arpit Agarwal |
HDFS-7587 |
Edit log corruption can happen if append fails with a quota violation |
Blocker |
namenode |
Kihwal Lee |
Jing Zhao |
HDFS-7575 |
Upgrade should generate a unique storage ID for each volume |
Critical |
. |
Lars Francke |
Arpit Agarwal |
HDFS-7552 |
change FsVolumeList toString() to fix TestDataNodeVolumeFailureToleration |
Major |
datanode, test |
Liang Xie |
Liang Xie |
HDFS-7533 |
Datanode sometimes does not shutdown on receiving upgrade shutdown command |
Major |
. |
Kihwal Lee |
Eric Payne |
HDFS-7503 |
Namenode restart after large deletions can cause slow processReport (due to logging) |
Major |
namenode |
Arpit Agarwal |
Arpit Agarwal |
HDFS-7489 |
Incorrect locking in FsVolumeList#checkDirs can hang datanodes |
Critical |
datanode |
Noah Lorang |
Noah Lorang |
HDFS-7470 |
SecondaryNameNode need twice memory when calling reloadFromImageFile |
Major |
namenode |
zhaoyunjiong |
zhaoyunjiong |
HDFS-7443 |
Datanode upgrade to BLOCKID_BASED_LAYOUT fails if duplicate block files are present in the same volume |
Blocker |
. |
Kihwal Lee |
Colin Patrick McCabe |
HDFS-7425 |
NameNode block deletion logging uses incorrect appender. |
Minor |
namenode |
Chris Nauroth |
Chris Nauroth |
HDFS-7263 |
Snapshot read can reveal future bytes for appended files. |
Major |
hdfs-client |
Konstantin Shvachko |
Tao Luo |
HDFS-7235 |
DataNode#transferBlock should report blocks that don’t exist using reportBadBlock |
Major |
datanode, namenode |
Yongjun Zhang |
Yongjun Zhang |
HDFS-7225 |
Remove stale block invalidation work when DN re-registers with different UUID |
Major |
namenode |
Zhe Zhang |
Zhe Zhang |
HDFS-7213 |
processIncrementalBlockReport performance degradation |
Critical |
namenode |
Daryn Sharp |
Eric Payne |
HDFS-7009 |
Active NN and standby NN have different live nodes |
Major |
datanode |
Ming Ma |
Ming Ma |
HDFS-4882 |
Prevent the Namenode’s LeaseManager from looping forever in checkLeases |
Critical |
hdfs-client, namenode |
Zesheng Wu |
Ravi Prakash |
HDFS-3443 |
Fix NPE when namenode transition to active during startup by adding checkNNStartup() in NameNodeRpcServer |
Major |
auto-failover, ha |
suja s |
Vinayakumar B |
MAPREDUCE-6361 |
NPE issue in shuffle caused by concurrent issue between copySucceeded() in one thread and copyFailed() in another thread on the same host |
Critical |
. |
Junping Du |
Junping Du |
MAPREDUCE-6324 |
Uber jobs fail to update AMRM token when it rolls over |
Blocker |
mr-am |
Jason Lowe |
Jason Lowe |
MAPREDUCE-6303 |
Read timeout when retrying a fetch error can be fatal to a reducer |
Blocker |
. |
Jason Lowe |
Jason Lowe |
MAPREDUCE-6300 |
Task list sort by task id broken |
Minor |
. |
Siqi Li |
Siqi Li |
MAPREDUCE-6238 |
MR2 can’t run local jobs with -libjars command options which is a regression from MR1 |
Critical |
mrv2 |
zhihai xu |
zhihai xu |
MAPREDUCE-6237 |
Multiple mappers with DBInputFormat don’t work because of reusing conections |
Major |
mrv2 |
Kannan Rajah |
Kannan Rajah |
MAPREDUCE-6230 |
MR AM does not survive RM restart if RM activated a new AMRM secret key |
Blocker |
mr-am |
Jason Lowe |
Jason Lowe |
MAPREDUCE-6166 |
Reducers do not validate checksum of map outputs when fetching directly to disk |
Major |
mrv2 |
Eric Payne |
Eric Payne |
MAPREDUCE-5649 |
Reduce cannot use more than 2G memory for the final merge |
Major |
mrv2 |
stanley shi |
Gera Shegalov |
YARN-4047 |
ClientRMService getApplications has high scheduler lock contention |
Major |
resourcemanager |
Jason Lowe |
Jason Lowe |
YARN-3999 |
RM hangs on draining events |
Major |
. |
Jian He |
Jian He |
YARN-3990 |
AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected |
Critical |
resourcemanager |
Rohith Sharma K S |
Bibin A Chundatt |
YARN-3850 |
NM fails to read files from full disks which can lead to container logs being lost and other issues |
Blocker |
log-aggregation, nodemanager |
Varun Saxena |
Varun Saxena |
YARN-3832 |
Resource Localization fails on a cluster due to existing cache directories |
Critical |
nodemanager |
Ranga Swamy |
Brahma Reddy Battula |
YARN-3733 |
Fix DominantRC#compare() does not work as expected if cluster resource is empty |
Blocker |
resourcemanager |
Bibin A Chundatt |
Rohith Sharma K S |
YARN-3725 |
App submission via REST API is broken in secure mode due to Timeline DT service address is empty |
Blocker |
resourcemanager, timelineserver |
Zhijie Shen |
Zhijie Shen |
YARN-3585 |
NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled |
Critical |
. |
Peng Zhang |
Rohith Sharma K S |
YARN-3526 |
ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster |
Major |
resourcemanager, webapp |
Weiwei Yang |
Weiwei Yang |
YARN-3493 |
RM fails to come up with error “Failed to load/recover state” when mem settings are changed |
Critical |
yarn |
Sumana Sathish |
Jian He |
YARN-3464 |
Race condition in LocalizerRunner kills localizer before localizing all resources |
Critical |
nodemanager |
zhihai xu |
zhihai xu |
YARN-3393 |
Getting application(s) goes wrong when app finishes before starting the attempt |
Critical |
timelineserver |
Zhijie Shen |
Zhijie Shen |
YARN-3369 |
Missing NullPointer check in AppSchedulingInfo causes RM to die |
Blocker |
resourcemanager |
Giovanni Matteo Fumarola |
Brahma Reddy Battula |
YARN-3287 |
TimelineClient kerberos authentication failure uses wrong login context. |
Major |
. |
Jonathan Eagles |
Daryn Sharp |
YARN-3267 |
Timelineserver applies the ACL rules after applying the limit on the number of records |
Major |
. |
Prakash Ramachandran |
Chang Li |
YARN-3251 |
Fix CapacityScheduler deadlock when computing absolute max avail capacity (short term fix for 2.6.1) |
Blocker |
. |
Jason Lowe |
Craig Welch |
YARN-3242 |
Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client |
Critical |
resourcemanager |
zhihai xu |
zhihai xu |
YARN-3239 |
WebAppProxy does not support a final tracking url which has query fragments and params |
Major |
. |
Hitesh Shah |
Jian He |
YARN-3238 |
Connection timeouts to nodemanagers are retried at multiple levels |
Blocker |
. |
Jason Lowe |
Jason Lowe |
YARN-3231 |
FairScheduler: Changing queueMaxRunningApps interferes with pending jobs |
Critical |
. |
Siqi Li |
Siqi Li |
YARN-3227 |
Timeline renew delegation token fails when RM user’s TGT is expired |
Critical |
. |
Jonathan Eagles |
Zhijie Shen |
YARN-3222 |
RMNodeImpl#ReconnectNodeTransition should send scheduler events in sequential order |
Critical |
resourcemanager |
Rohith Sharma K S |
Rohith Sharma K S |
YARN-3207 |
secondary filter matches entites which do not have the key being filtered for. |
Major |
timelineserver |
Prakash Ramachandran |
Zhijie Shen |
YARN-3103 |
AMRMClientImpl does not update AMRM token properly |
Blocker |
client |
Jason Lowe |
Jason Lowe |
YARN-3094 |
reset timer for liveness monitors after RM recovery |
Major |
resourcemanager |
Jun Gong |
Jun Gong |
YARN-3055 |
The token is not renewed properly if it’s shared by jobs (oozie) in DelegationTokenRenewer |
Blocker |
security |
Yi Liu |
Daryn Sharp |
YARN-3024 |
LocalizerRunner should give DIE action when all resources are localized |
Major |
nodemanager |
Chengbing Liu |
Chengbing Liu |
YARN-2997 |
NM keeps sending already-sent completed containers to RM until containers are removed from context |
Major |
nodemanager |
Chengbing Liu |
Chengbing Liu |
YARN-2992 |
ZKRMStateStore crashes due to session expiry |
Blocker |
resourcemanager |
Karthik Kambatla |
Karthik Kambatla |
YARN-2978 |
ResourceManager crashes with NPE while getting queue info |
Critical |
. |
Jason Tufo |
Varun Saxena |
YARN-2964 |
RM prematurely cancels tokens for jobs that submit jobs (oozie) |
Blocker |
resourcemanager |
Daryn Sharp |
Jian He |
YARN-2952 |
Incorrect version check in RMStateStore |
Major |
. |
Jian He |
Rohith Sharma K S |
YARN-2922 |
ConcurrentModificationException in CapacityScheduler’s LeafQueue |
Major |
capacityscheduler, resourcemanager, scheduler |
Jason Tufo |
Rohith Sharma K S |
YARN-2917 |
Potential deadlock in AsyncDispatcher when system.exit called in AsyncDispatcher#dispatch and AsyscDispatcher#serviceStop from shutdown hook |
Critical |
resourcemanager |
Rohith Sharma K S |
Rohith Sharma K S |
YARN-2910 |
FSLeafQueue can throw ConcurrentModificationException |
Major |
fairscheduler |
Wilfred Spiegelenburg |
Wilfred Spiegelenburg |
YARN-2906 |
CapacitySchedulerPage shows HTML tags for a queue’s Active Users |
Major |
capacityscheduler |
Jason Lowe |
Jason Lowe |
YARN-2905 |
AggregatedLogsBlock page can infinitely loop if the aggregated log file is corrupted |
Blocker |
. |
Jason Lowe |
Varun Saxena |
YARN-2894 |
When ACL’s are enabled, if RM switches then application can not be viewed from web. |
Major |
resourcemanager |
Rohith Sharma K S |
Rohith Sharma K S |
YARN-2890 |
MiniYarnCluster should turn on timeline service if configured to do so |
Major |
. |
Mit Desai |
Mit Desai |
YARN-2874 |
Dead lock in “DelegationTokenRenewer” which blocks RM to execute any further apps |
Blocker |
resourcemanager |
Naganarasimha G R |
Naganarasimha G R |
YARN-2865 |
Application recovery continuously fails with “Application with id already present. Cannot duplicate” |
Critical |
resourcemanager |
Rohith Sharma K S |
Rohith Sharma K S |
YARN-2856 |
Application recovery throw InvalidStateTransitonException: Invalid event: ATTEMPT_KILLED at ACCEPTED |
Critical |
resourcemanager |
Rohith Sharma K S |
Rohith Sharma K S |
YARN-2816 |
NM fail to start with NPE during container recovery |
Major |
nodemanager |
zhihai xu |
zhihai xu |
YARN-2637 |
maximum-am-resource-percent could be respected for both LeafQueue/User when trying to activate applications. |
Critical |
resourcemanager |
Wangda Tan |
Craig Welch |
YARN-2414 |
RM web UI: app page will crash if app is failed before any attempt has been created |
Major |
webapp |
Zhijie Shen |
Wangda Tan |
YARN-2340 |
NPE thrown when RM restart after queue is STOPPED. There after RM can not recovery application’s and remain in standby |
Critical |
resourcemanager, scheduler |
Nishan Shetty |
Rohith Sharma K S |
YARN-2246 |
Job History Link in RM UI is redirecting to the URL which contains Job Id twice |
Major |
webapp |
Devaraj K |
Devaraj K |