提交 · v5.5.2 · Archie Kelly / kafka

9月 17, 2020
- Set Confluent to 5.5.2, Kafka to 5.5.2. · 22ca4dd1
  由 Confluent Jenkins Bot 创作于 4年前
  
  2 个标签
  
  22ca4dd1
9月 16, 2020

Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · 7219f77f
由 Confluent Jenkins Bot 创作于 4年前

7219f77f

KAFKA-10292: Set min.insync.replicas to 1 of __consumer_offsets (#9286) · 64b24cef

由 Bruno Cadonna 创作于 4年前

The test StreamsBrokerBounceTest.test_all_brokers_bounce() fails on
2.5 because in the last stage of the test there is only one broker
left and the offset commit cannot succeed because the
min.insync.replicas of __consumer_offsets is set to 2 and acks is
set to all. This causes a time out and extends the closing of the
Kafka Streams client to beyond the duration passed to the close
method of the client.

This affects especially the 2.5 branch since there Kafka Streams
commits offsets for each task, i.e., close() needs to wait for the
timeout for each task. In 2.6 and trunk the offset commit is done
per thread, so close() does only need to wait for one time out per
stream thread.

I opened this PR on trunk, since the test could also become
flaky on trunk and we want to avoid diverging system tests across
branches.

A more complete solution would be to improve the test by defining
a better success criteria.

Reviewers: Guozhang Wang <wangguoz@gmail.com>

64b24cef

8月 20, 2020

Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · feb0dcf4
由 Confluent Jenkins Bot 创作于 4年前

feb0dcf4

KAFKA-10308: Fix flaky core/round_trip_fault_test.py (#9079) · 8bde3d47

由 Chia-Ping Tsai 创作于 4年前

Creating a topic may fail (due to timeout) in running system tests. However, `RoundTripWorker` does not ignore `TopicExistsException` which makes `round_trip_fault_test.py` be a flaky one.

More specifically, a network exception can cause the `CreateTopics` request to reach Kafka but Trogdor retry it
and hit a `TopicAlreadyExists` exception on the retry, failing the test.

Reviewers: Ismael Juma <ismael@juma.me.uk>

8bde3d47

SEC-1307: Backporting log4j migration to confluent-repackaged version (#401) · 5a7acd8b

由 Nitesh Mor 创作于 4年前

* MINOR: log4j migration to confluent repackaged version (#362)

Context: log4j v1 has reached end of life many years ago, and is affected by CVE-2019-17571
Confluent repackaged version of log4j fixes the security vulnerabilities.

Reviewers: Ismael Juma <ismael@juma.me.uk>, Jeff Kim <jeff.kim@confluent.io>

* SEC-1334: update confluent-log4j version (#384)

5a7acd8b

8月 19, 2020
- Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · 5262716a
  由 Confluent Jenkins Bot 创作于 4年前
  
  5262716a
8月 18, 2020

MINOR: Use new version of ducktape · dbe24f8c

由 Andrew Egelhofer 创作于 4年前

ducktape diff: https://github.com/confluentinc/ducktape/compare/v0.7.8...v0.7.9



- bcrypt (a dependency of ducktape) dropped Python2.7 support.
ducktape-0.7.9 now pins bcrypt to a Python2.7-supported version.

Author: Andrew Egelhofer <aegelhofer@confluent.io>

Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com>

Closes #9192 from andrewegel/trunk

(cherry picked from commit f6c26eaa)
Signed-off-by: Manikumar Reddy <manikumar.reddy@gmail.com>

dbe24f8c

8月 16, 2020
- Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · 7e394656
  由 Confluent Jenkins Bot 创作于 4年前
  
  7e394656
- KAFKA-10404; Use higher poll timeout to avoid rebalance in testCoordinatorFailover (#9183) · 786b885d
  由 Rajini Sivaram 创作于 4年前
  
  Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>
  786b885d
- Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · 1eb3e1d9
  由 Confluent Jenkins Bot 创作于 4年前
  
  1eb3e1d9
- KAFKA-8033; Wait for NoOffsetForPartitionException in testFetchInvalidOffset (#9184) · 817f7915
  由 Rajini Sivaram 创作于 4年前
  
  Reviewers: Ismael Juma <ismael@juma.me.uk>
  817f7915
- KAFKA-9516; Increase timeout in testNonBlockingProducer to make it more reliable (#9181) · 6bb81ae0
  由 Rajini Sivaram 创作于 4年前
  
  Reviewers: Ismael Juma <ismael@juma.me.uk>
  6bb81ae0
8月 13, 2020
- Merge branch '2.5' of https://github.com/confluentinc/kafka into 2.5 · f80e859b
  由 Confluent Jenkins Bot 创作于 4年前
  
  f80e859b
- MINOR: Ensure same version of scala library is used for compile and at runtime (#9168) · 1560effc
  由 Rajini Sivaram 创作于 4年前
  
  Reviewers: Ismael Juma <ismael@juma.me.uk>
  1560effc
- KAFKA-9066: Retain metrics for failed tasks (#8502) (#8854) · 89978e1d
  由 Randall Hauch 创作于 4年前
  
  Author: Chris Egerton <chrise@confluent.io> Reviewers: Nigel Liang <nigel@nigelliang.com>, Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>
  89978e1d
8月 11, 2020

Merge remote-tracking branch 'apache/2.5' into sync-upstream-11-aug-2020 · 16daa490
由 Rajini Sivaram 创作于 4年前

16daa490

MINOR: Ensure a single version of scala-library is used (#9155) · f3257cf7

由 Stanislav Kozlovski 创作于 4年前

This patch ensures we use a force resolution strategy for the scala-library dependency.

I've tested this locally and saw a difference in the output.

With the change (using 2.4 and the jackson library 2.10.5):
```
./core/build/dependant-libs-2.12.10/scala-java8-compat_2.12-0.9.0.jar
./core/build/dependant-libs-2.12.10/scala-collection-compat_2.12-2.1.2.jar
./core/build/dependant-libs-2.12.10/scala-reflect-2.12.10.jar
./core/build/dependant-libs-2.12.10/scala-logging_2.12-3.9.2.jar
./core/build/dependant-libs-2.12.10/scala-library-2.12.10.jar
```

Without (using 2.4 and the jackson library 2.10.5):
```
 find . -name 'scala*.jar'
./core/build/dependant-libs-2.12.10/scala-java8-compat_2.12-0.9.0.jar
./core/build/dependant-libs-2.12.10/scala-collection-compat_2.12-2.1.2.jar
./core/build/dependant-libs-2.12.10/scala-reflect-2.12.10.jar
./core/build/dependant-libs-2.12.10/scala-logging_2.12-3.9.2.jar
./core/build/dependant-libs-2.12.10/scala-library-2.12.12.jar
```

Reviewers: Ismael Juma <ismael@juma.me.uk>

f3257cf7

8月 10, 2020
- MINOR: Update 2.5 branch version to 2.5.2-SNAPSHOT · cd05b69c
  由 John Roesler 创作于 4年前
  
  cd05b69c
- Merge tag '2.5.1' into 2.5 · 9b19dbed
  由 John Roesler 创作于 4年前
  
  Tag for 2.5.1 release
  9b19dbed
8月 01, 2020

KAFKA-10173: Use SmokeTest for upgrade system tests (#8938) (#8993) · 2601c672

由 John Roesler 创作于 4年前

Replaces the previous upgrade test's trivial Streams app
with the commonly used SmokeTest, exercising many more
features. Also adjust the test matrix to test upgrading
from each released version since 2.0 to the current branch.

Reviewers: Guozhang Wang <wangguoz@gmail.com>

2601c672

7月 29, 2020

KAFKA-10224: Update jersey license from CDDL to EPLv2 (#9089) · 1ff25663

由 Rens Groothuijsen 创作于 4年前

Update jersey license from CDDL to EPLv2

Author: Rens Groothuijsen <l.groothuijsen@alumni.maastrichtuniversity.nl>
Reviewer: Randall Hauch <rhauch@gmail.com>

1ff25663

7月 28, 2020

MINOR: Remove staticmethod tag to be able to use logger of instance (#9086) · 2de640c7

由 Bruno Cadonna 创作于 4年前

A system test failed with the following error: global name 'self' is not defined

The reason was that `self` was accessed to log a message in a static method. This commit makes the method an instance method.

Reviewer: Matthias J. Sax <matthias@confluent.io>

2de640c7

7月 26, 2020

KAFKA-10158: Fix flaky testDescribeUnderReplicatedPartitionsWhenReassignmentIsInProgress (#9022) · e9b58cf7

由 Brian Byrne 创作于 4年前

Set `replica.fetch.max.bytes` to `1` and produce multiple record batches to allow
for throttling to take place. This helps avoid a race condition where the
reassignment would complete more quickly than expected causing an
assertion to fail some times.

Reviewers: Lucas Bradstreet <lucas@confluent.io>, Jason Gustafson <jason@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>

e9b58cf7

7月 24, 2020
- Bump version to 2.5.1 · 0efa8fb0
  由 John Roesler 创作于 4年前
  
  2 个标签
  
  0efa8fb0
7月 23, 2020

MINOR: Revert "KAFKA-10191 fix flaky StreamsOptimizedTest (#8913)" (#9053) · 7f9187fe

由 John Roesler 创作于 4年前

This reverts commit edce73fe,
which depends on features of the reset tool that are not present in 2.5.

Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Boyang Chen <boyang@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>

7f9187fe

7月 20, 2020

KAFKA-10295: Wait for connector recovery in test_bounce (#9043) · ef49076b
由 Greg Harris 创作于 4年前
```
Signed-off-by: Greg Harris <gregh@confluent.io>
```
ef49076b

KAFKA-10286: Connect system tests should wait for workers to join group (#9040) · f93056ea

由 Greg Harris 创作于 4年前

Currently, the system tests `connect_distributed_test` and `connect_rest_test` only wait for the REST api to come up.
The startup of the worker includes an asynchronous process for joining the worker group and syncing with other workers.
There are some situations in which this sync takes an unusually long time, and the test continues without all workers up.
This leads to flakey test failures, as worker joins are not given sufficient time to timeout and retry without waiting explicitly.

This changes the `ConnectDistributedTest` to wait for the Joined group message to be printed to the logs before continuing with tests. I've activated this behavior by default, as it's a superset of the checks that were performed by default before.

This log message is present in every version of DistributedHerder that I could find, in slightly different forms, but always with `Joined group` at the beginning of the log message. This change should be safe to backport to any branch.

Signed-off-by: Greg Harris <gregh@confluent.io>
Author: Greg Harris <gregh@confluent.io>
Reviewer: Randall Hauch <rhauch@gmail.com>

f93056ea

7月 18, 2020
- set dev version to 2.5.1 (#9035) · bac44009
  由 John Roesler 创作于 4年前
  
  Reviewers: David Arthur <mumrah@gmail.com>
  bac44009
7月 15, 2020
- ST-3402: Refactored Jenkinsfile to get secrets from Vault instead of Jenkins... · 21e17cd1
  由 elismaga 创作于 4年前
  
  ST-3402: Refactored Jenkinsfile to get secrets from Vault instead of Jenkins credential store (#361)
  21e17cd1
7月 11, 2020

KAFKA-10191 fix flaky StreamsOptimizedTest (#8913) · edce73fe

由 Chia-Ping Tsai 创作于 4年前

Call KafkaStreams#cleanUp to reset local state before starting application up the second run.

Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Boyang Chen <boyang@confluent.io>, John Roesler <john@confluent.io>

edce73fe

7月 10, 2020

MINOR: prune the metadata upgrade test matrix (#8971) · 8d810bf1

由 John Roesler 创作于 4年前

Most of the values in the metadata upgrade test matrix are just testing
the upgrade/downgrade path between two previous releases. This is
unnecessary. We run the tests for all supported branches, so what we
should test is the up-/down-gradability of released versions with respect
to the current branch.

Reviewers: Guozhang Wang <wangguoz@gmail.com>

8d810bf1

7月 09, 2020

KAFKA-10134: Use long poll if we do not have fetchable partitions (#8934) · 48446042

由 Guozhang Wang 创作于 4年前

The intention of using poll(0) is to not block on rebalance but still return some data; however, `updateAssignmentMetadataIfNeeded` have three different logic: 1) discover coordinator if necessary, 2) join-group if necessary, 3) refresh metadata and fetch position if necessary. We only want to make 2) to be non-blocking but not others, since e.g. when the coordinator is down, then heartbeat would expire and cause the consumer to fetch with timeout 0 as well, causing unnecessarily high CPU.

Since splitting this function is a rather big change to make as a last minute blocker fix for 2.6, so I made a smaller change to make updateAssignmentMetadataIfNeeded has an optional boolean flag to indicate if 2) above should wait until either expired or complete, otherwise do not wait on the join-group future and just poll with zero timer.

Reviewers: Jason Gustafson <jason@confluent.io>

48446042

7月 08, 2020

KAFKA-10221: Backport fix for KAFKA-9603 to 2.5 (#8987) · 17f9f3ae

由 Bruno Cadonna 创作于 4年前

KAFKA-9603 reports that the number of open files keeps increasing
in RocksDB. The reason is that bulk loading is turned on but
never turned off in segmented state stores for standby tasks.

This bug was fixed in 2.6 through PR #8661 by using code
that is not present in 2.5. So cherry-picking was not possible.

Reviewers: John Roesler <vvcephei@apache.org>

17f9f3ae

7月 07, 2020

KAFKA-10239: Make GroupInstanceId ignorable in DescribeGroups (#8989) · c2f26a28

由 Boyang Chen 创作于 4年前

This is a bug fix for older admin clients using static membership and call DescribeGroups.
By making groupInstanceId ignorable, it would not crash upon handling the response.

Added test coverages for DescribeGroups, and some side cleanups.

Reviewers: Jason Gustafson <jason@confluent.io>

c2f26a28

7月 02, 2020

MINOR: Update Netty to 4.1.50.Final (#8972) · 39769643

由 Ismael Juma 创作于 4年前

This includes important fixes. Netty is required by ZooKeeper if TLS is
enabled.

I verified that the netty jars were changed from 4.1.48 to 4.1.50 with
this PR, `find . -name '*netty*'`:

```text
./core/build/dependant-libs-2.13.3/netty-handler-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-transport-native-epoll-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-codec-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-transport-native-unix-common-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-transport-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-resolver-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-buffer-4.1.50.Final.jar
./core/build/dependant-libs-2.13.3/netty-common-4.1.50.Final.jar
```

Note that the previous netty exclude no longer worked since we upgraded
to ZooKeeper 3.5.x as it switched to Netty 4 which has different module names.
Also, the Netty dependency is needed by ZooKeeper for TLS support so we
cannot exclude it.

Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>

39769643

6月 30, 2020

KAFKA-10212: Describing a topic with the TopicCommand fails if unauthorised to... · e643c364

由 David Jacot 创作于 4年前

KAFKA-10212: Describing a topic with the TopicCommand fails if unauthorised to use ListPartitionReassignments API

Since https://issues.apache.org/jira/browse/KAFKA-8834, describing topics with the TopicCommand requires privileges to use ListPartitionReassignments or fails to describe the topics with the following error:

> Error while executing topic command : Cluster authorization failed. 

This is a quite hard restriction has most of the secure clusters do not authorize non admin members to access ListPartitionReassignments.

This patch catches the `ClusterAuthorizationException` exception and gracefully fails back. We already do this when the API is not available so it remains consistent.

Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>

e643c364

KAFKA-9509: Increase timeout when consuming records to fix flaky test in MM2 (#8894) · 06341cfb

由 showuon 创作于 4年前

A simple increase in the timeout of the consumer that verifies that records have been replicated seems to fix the integration tests in `MirrorConnectorsIntegrationTest` that have been failing more often recently. 

Reviewers: Ryanne Dolan <ryannedolan@gmail.com>, Sanjana Kaundinya <skaundinya@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Konstantine Karantasis <konstantine@confluent.io>

06341cfb

Bump CCS version to 5.5.2 · c25955ca
由 Andrew Egelhofer 创作于 4年前

c25955ca

6月 27, 2020

KAFKA-10173: Fix suppress changelog binary schema compatibility (#8905) · 1d07fb8c

由 John Roesler 创作于 4年前

We inadvertently changed the binary schema of the suppress buffer changelog
in 2.4.0 without bumping the schema version number. As a result, it is impossible
to upgrade from 2.3.x to 2.4+ if you are using suppression.

* Refactor the schema compatibility test to use serialized data from older versions
as a more foolproof compatibility test.
* Refactor the upgrade system test to use the smoke test application so that we
actually exercise a significant portion of the Streams API during upgrade testing
* Add more recent versions to the upgrade system test matrix
* Fix the compatibility bug by bumping the schema version to 3

Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Guozhang Wang <wangguoz@gmail.com>

1d07fb8c