gpt4 book ai didi

java - HBase Java Client批处理/放入CDH 4.6的速度很慢

转载 作者:行者123 更新时间:2023-12-02 21:46:44 25 4
gpt4 key购买 nike

我正在使用HBase来存储CDH4(当前为4.5)管理的应用程序日志,并且升级到cdh 4.6(与4.7相同)后,插入速度非常慢。我发现客户端正在连接到regionserver并立即关闭连接(使用CDh 4.5我没有遇到相同的问题)

RegionServer日志:

13:46:08,428 INFO org.apache.zookeeper.ZooKeeper: Initiating client connection, connectString=ZK03:2181,ZK02:2181,ZK01:2181 sessionTimeout=60000 watcher=hconnection
13:46:08,429 INFO org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper: The identifier of this process is 19573@NODE01
13:46:08,429 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection to server ZK03/10.1.243.170:2181. Will not attempt to authenticate using SASL (java.lang.SecurityException: Unable to locate a login configuration)
13:46:08,429 INFO org.apache.zookeeper.ClientCnxn: Socket connection established to ZK03/10.1.243.170:2181, initiating session
13:46:08,431 INFO org.apache.zookeeper.ClientCnxn: Session establishment complete on server ZK03/10.1.243.170:2181, sessionid = 0x146a9fec35171f0, negotiated timeout = 60000
13:46:08,538 INFO org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation: Closed zookeeper sessionid=0x146a9fec35171f0
13:46:08,540 INFO org.apache.zookeeper.ZooKeeper: Session: 0x146a9fec35171f0 closed
13:46:08,540 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down

客户端连接类:
private void initConnection(Configuration hConf) throws RuntimeException {
try {
//HConnectionManager.create(hConf);
hConnection = HConnectionManager.createConnection(hConf);
} catch (ZooKeeperConnectionException e) {
logAndThrow("Failed to init connection " + e.getMessage());
}
}

public Connection(Configuration hConf) {
initConnection(hConf);
}

public void closeConnection() throws IOException {
hConnection.close();
}

public HTableInterface getHTableInterface(String tableName) throws IOException {
HTableInterface htable = hConnection.getTable(tableName);
htable.setAutoFlush(false, true);
htable.setWriteBufferSize(1024*1024*12);
return htable;
}

进口:
Put put = new Put(rowKey.get(), tsWhole);
mainTableBuffer.add(put);
if(cfg_.maxBatchBufferSize <= mainTableBuffer.size()) {
mainTableInterface_.batch(mainTableBuffer);
mainTableBuffer.clear();
}

最佳答案

看来我已经找到问题了。创建辅助索引时它在协处理器中。
这是用于插入secondaryIndexTable的实际代码

 public void postBatchMutate(ObserverContext<RegionCoprocessorEnvironment> c, MiniBatchOperationInProgress<Pair<Mutation, Integer>> miniBatchOp) throws IOException {

HTableInterface searchTableInterface = c.getEnvironment().getTable(tableName);
try {
searchTableInterface.batch(mutationsBuffer);
} catch (InterruptedException e) {
logger.error("Caught exception while executing batch on table " + currSearchTName, e);
} finally {
searchTableInterface.close();
}
}

问题似乎是使用环境连接进行插入。启动时创建连接
hConnection = HConnectionManager.createConnection(hConf);

并且在postBarchMutate中用于获取表
HTableInterface htable = hConnection.getTable(tableName);

现在可以使用,但是仍然不知道为什么使用环境连接是错误的,为什么连接总是关闭

关于java - HBase Java Client批处理/放入CDH 4.6的速度很慢,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/24631708/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com