Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 2.50.0 lyft #65

Open
wants to merge 10,000 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
10000 commits
Select commit Hold shift + click to select a range
e03a14c
Update stale_dataflow_prebuilt_image_cleaner.sh to clean containers u…
celeste-zeng Aug 2, 2023
8ef0cc1
Use correct concurrency condition to avoid incorrect interrupts (#27815)
damccorm Aug 2, 2023
afa4371
Fix bug causing elapsed time to be negative (#27812)
connorbrett Aug 2, 2023
4d49529
Fix gradle command not found error for Python ValidatesContainer Data…
celeste-zeng Aug 2, 2023
427e0a1
[#24789] Mention prism is the new default Go SDK runner in CHANGES.md…
lostluck Aug 2, 2023
13dd74c
Update REVIEWERS.yml (#27820)
manavgarg Aug 2, 2023
0bd6a21
Implement implicit config as a preparsing option.
robertwb Aug 2, 2023
a606c53
Nest all transform-specific arguments in an explicit 'config' parameter.
robertwb Aug 2, 2023
55f4133
Add Neo4j Resource manager. (#27787)
pranavbhandari24 Aug 3, 2023
553ac16
Add flags to build multiarch Java SDK containers and test (#27760)
celeste-zeng Aug 3, 2023
9c2ce6c
Add Splunk Resource manager to manage Splunk resources in tests. (#27…
pranavbhandari24 Aug 3, 2023
324366f
refactor dicomio_test.py (#27819)
svetakvsundhar Aug 3, 2023
4199f44
Github Workflow Replacement for Jenkins Job, beam_PreCommit_GoPortabl…
magicgoody Aug 3, 2023
66dd8b3
Update `it` modules. (#27817)
pranavbhandari24 Aug 3, 2023
0c15645
Explicitly load go license container locally (#27824)
celeste-zeng Aug 3, 2023
42cdd61
Merge pull request #27774: Override encoding positions for nested sch…
reuvenlax Aug 3, 2023
6d35cda
Added GitHub Workflow Replacement for Jenkins job_PreCommit_Kotlin_Ex…
aleksandr-dudko Aug 3, 2023
fd0e22a
merge toBeamRowFieldValue into toBeamValue for null value support (#2…
jon-esperanza Aug 3, 2023
bb31077
[Security] Upgrade snappy-java to 1.1.10.3 (#27827)
bvolpato Aug 3, 2023
fede6d4
Bump go.mongodb.org/mongo-driver from 1.12.0 to 1.12.1 in /sdks (#27826)
dependabot[bot] Aug 3, 2023
966bb2f
Upgrade Kryo coder from Kryo v4 to Kryo v5 (#27803)
Gadiguibou Aug 3, 2023
b817b5e
Merge pull request #27823 Nest all transform-specific arguments in an…
robertwb Aug 3, 2023
39d3c6f
[Python] Add Hugging Face PRs to CHANGES.md (#27829)
riteshghorse Aug 3, 2023
5d32ae2
[BEAM-5559] Upgrade version of guava to 32.1.1-jre for all vendored a…
hlteoh37 Aug 3, 2023
ef25972
Added GitHub Workflow Replacement for Jenkins job_PreCommit_Portable_…
aleksandr-dudko Aug 3, 2023
a0d73b8
Add model manager class to hold many models at once (#27603)
damccorm Aug 3, 2023
f0a172b
Bump actions/setup-go from 3 to 4 (#27678)
dependabot[bot] Aug 3, 2023
ef2ca7a
Fix comment trigger against PR branch (#27758)
andreydevyatkin Aug 3, 2023
1372572
Migrate "beam_PreCommit_Java_Examples_Dataflow_Java11" Jenkins job to…
andreydevyatkin Aug 3, 2023
ead1cef
defaults and programming guide (#27832)
lostluck Aug 3, 2023
d8cab61
Bump golang.org/x/net from 0.12.0 to 0.13.0 in /sdks (#27798)
dependabot[bot] Aug 3, 2023
30ae912
Add buildx flag to pick up the load flag. (#27833)
celeste-zeng Aug 3, 2023
2151b73
Migrate "beam_PreCommit_Java_Spark3_Versions" Jenkins job to Github A…
andreydevyatkin Aug 3, 2023
86426c8
Merge pull request #27835: Enable combiner lifting for the Group tran…
reuvenlax Aug 3, 2023
9080909
Merge pull request #27627: Fix Neo4jIO result consumption
chamikaramj Aug 3, 2023
cc80490
Use a lighter size estimation for StateFetchingIterators (use number …
bvolpato Jul 11, 2023
771d772
Fix weight tests to consider the element overhead of 8
bvolpato Aug 3, 2023
f7bdf0e
Changes based on the recent code review
bvolpato Aug 4, 2023
d108da4
Push multi-arch Python SDK containers built by GH tests to GCR and re…
celeste-zeng Aug 4, 2023
488279c
Bump commons-io to 2.13.0 (#27776)
bvolpato Aug 4, 2023
b40ca79
propagate fewKeys to hot-key path
reuvenlax Aug 1, 2023
1f90261
Merge pull request #27845: propagate fewKeys to hot-key path
reuvenlax Aug 4, 2023
5289d6b
Move changes from #27635 to the right section in CHANGES.md
Gadiguibou Aug 4, 2023
f953d71
Bump github.com/fsouza/fake-gcs-server from 1.47.2 to 1.47.3 in /sdks…
dependabot[bot] Aug 4, 2023
7e365d7
Update settings.gradle.kts to include project `arm` (#27831)
celeste-zeng Aug 4, 2023
887560b
Merge pull request #27849: Move changes from #27635 to the right sect…
aromanenko-dev Aug 4, 2023
5541412
Setup Java Flink runner PreCommit (#27721)
Abacn Aug 4, 2023
1877c88
Remove reconciled partitions from list of missing partitions (#27755)
tonytanger Aug 4, 2023
408d766
[Python]Don't stage beam SDK in Stager (#27436)
AnandInguva Aug 4, 2023
76290b8
Add logging at error to Bootloader logger (#27813)
jrmccluskey Aug 4, 2023
fb6d167
Allow usage of GCThrashing settings on Fn Harness (#27850)
bvolpato Aug 4, 2023
cff332d
Change repo to push, make steps for diff versions run in parallel. (#…
celeste-zeng Aug 4, 2023
8cfff66
Rebuild generated protobuf code to reflect fnapi proto update for dat…
zechenj18 Aug 4, 2023
4c40276
Merge pull request #27461 Light cache weigher for StateFetchingIterat…
robertwb Aug 4, 2023
8bd2c97
Add javaHome for building Java 11 and 17 multi-arch containers. (#27861)
celeste-zeng Aug 4, 2023
c654dc0
Add setup Java for different Java versions. (#27863)
celeste-zeng Aug 5, 2023
7754ab9
Fix google-api-services-bigquery to match new google-cloud-bigquery (…
bvolpato Aug 5, 2023
9ca6c33
Fix broken url for 'Tour of Beam' (#27412)
acejune Aug 7, 2023
03f8830
Bump golang.org/x/oauth2 from 0.10.0 to 0.11.0 in /sdks (#27870)
dependabot[bot] Aug 7, 2023
4bae68c
[AWS] Adjust interface of SqsIO.writeBatches to make entryMapper opti…
Aug 7, 2023
73e015b
Update versions based on libraries-bom, and add comment to keep consi…
bvolpato Aug 7, 2023
ee97da5
Ensure that if grpc stream terminates that DirectStreamObserver does …
scwhittle Aug 7, 2023
347f84c
Fix action run on base branch (#27860)
Abacn Aug 7, 2023
23d44a4
Fix routing header issue in Firestore Beam connector (#27858)
pl04351820 Aug 7, 2023
3b112e8
[#24789] Remainder of changes from #27550. (#27822)
lostluck Aug 7, 2023
d227b81
Bump beam-master legacy container to beam-master-20230807 (#27881)
bvolpato Aug 7, 2023
450d801
Update example for BundleProcessResponse (#27834)
hjtran Aug 7, 2023
7726283
Also use config pattern for yaml providers.
robertwb Aug 7, 2023
95a5697
Move code contribution guide into repo (#27883)
damccorm Aug 7, 2023
29ea352
Add an option to the FnApi harness PIPELINE_OPTIONS_FILE that overrid…
byronellis Aug 8, 2023
0f847ef
adds check to accomodate startswith filter on matrix jobs (#27893)
volatilemolotov Aug 8, 2023
34cabc0
update all jobs with new name and phrase matrix, required for rerun (…
volatilemolotov Aug 8, 2023
7824f2c
Add integration tests for Storage Write API schema update feature (#2…
ahmedabu98 Aug 8, 2023
014a193
Replace StorageV1 client with GCS client (#27878)
BjornPrime Aug 8, 2023
07f32a2
Add message for new prism users. (#27886)
lostluck Aug 8, 2023
1b2c38d
[prism] Avoid prism locking on data send on ProcessBundle failure. (#…
lostluck Aug 8, 2023
381e314
Temp changes to see the failure of validatesContainer ARM (#27885)
celeste-zeng Aug 8, 2023
8b42b8d
reset permission on .config (#27894)
volatilemolotov Aug 8, 2023
052202f
Use guave 32.1.2
Abacn Aug 3, 2023
c29485f
Checkstyle and errorprone fixes for new guava version
Abacn Aug 3, 2023
fc88537
Bump github.com/aws/aws-sdk-go-v2/credentials in /sdks (#27890)
dependabot[bot] Aug 8, 2023
89616ef
add Pod UID to environment (#27898)
volatilemolotov Aug 8, 2023
5815afb
Add logs when staging file/artifact can not be found (#27882)
bvolpato Aug 8, 2023
d0843a0
Add PubsubWriteIT tests, with validations to bounded writes (#27286)
bvolpato Aug 8, 2023
cfd7335
Fix formatting for Python ValidatesContainer Dataflow ARM (#27902)
celeste-zeng Aug 8, 2023
0571d5f
Remove extraneous warnings from notebook output (#27904)
damccorm Aug 8, 2023
c51aa85
Add `credentials` as a parameter to builder in Resource managers. (#2…
pranavbhandari24 Aug 8, 2023
4681486
Set connection attributes in SingleStoreIO (#27487)
AdalbertMemSQL Aug 8, 2023
e4b7461
Add MLTransform and Vertex Private endpoints to CHANGES.md (#27903)
damccorm Aug 8, 2023
bc42bb9
Create MemoryMonitorOptions class for GC thrashing parameter (#27867)
bvolpato Aug 8, 2023
71706b2
Automatically clean up old metrics containers (#27811)
damccorm Aug 8, 2023
4eed089
Implement go xlang wrapper for reading from Bigtable (#27778)
steveniemitz Aug 8, 2023
9d7a937
[Python] Remove harmless warnings from example notebooks (#27901)
riteshghorse Aug 8, 2023
1b6e55b
Merge pull request #27895 from Abacn/vendorguava
Abacn Aug 8, 2023
266ae5b
Only include WordCount for PostCommit Java Examples Dataflow ARM. (#2…
celeste-zeng Aug 8, 2023
4ef5366
Bump github.com/testcontainers/testcontainers-go in /sdks (#27873)
dependabot[bot] Aug 8, 2023
eba3de3
Bump google.golang.org/api from 0.134.0 to 0.135.0 in /sdks (#27889)
dependabot[bot] Aug 8, 2023
36f4bd8
Made changes to scripts to release multi-arch Beam SDK containers (#2…
celeste-zeng Aug 8, 2023
215f1d0
Set non-playground Go versions to use 1.21.0 (#27900)
lostluck Aug 8, 2023
3d02b3b
Replace remaining old guava namespace happens after #27895 (#27907)
Abacn Aug 8, 2023
eb7fd7e
Add PreCommit Java IO AWS job (#27846)
Amar3tto Aug 8, 2023
8489bbc
[yaml] Fix BigQueryIO for yaml pipeline compatibility (#27809)
Polber Aug 8, 2023
a0a297d
Try a second time to delete old images (#27905)
damccorm Aug 8, 2023
5a7e666
Update changelog to mention new SqsIO.writeBatches (relates to #21429).
mosche Aug 9, 2023
0019cd5
Merge pull request #27921: Update changelog to mention new SqsIO.writ…
aromanenko-dev Aug 9, 2023
a51e38c
Add PreCommit Java IO AWS2 job (#27877)
Amar3tto Aug 9, 2023
0837428
update runner images to 8b42b8dc (#27922)
volatilemolotov Aug 9, 2023
2cc3ebc
Bump github.com/tetratelabs/wazero from 1.3.1 to 1.4.0 in /sdks (#27913)
dependabot[bot] Aug 9, 2023
b6849ba
Bump github.com/aws/aws-sdk-go-v2/service/s3 in /sdks (#27915)
dependabot[bot] Aug 9, 2023
4e36b55
Use grafana 10.0.3 (#27909)
liferoad Aug 9, 2023
863e93f
Bump github.com/aws/aws-sdk-go-v2/config in /sdks (#27912)
dependabot[bot] Aug 9, 2023
8e67c70
Give liferoad issue triage permission (#27927)
damccorm Aug 9, 2023
4122e96
expose path to gcloud folder in setup action (#27923)
volatilemolotov Aug 9, 2023
aee0eb7
Add GoogleAdsIO for reading from Google Ads (#27681)
sjvanrossum Aug 9, 2023
3ffa6f9
Revert "expose path to gcloud folder in setup action (#27923)" (#27935)
damccorm Aug 9, 2023
0a8e3cc
Unpin google-cloud-storage. (#27931)
tvalentyn Aug 9, 2023
7d56d01
Fix license script install pyyml in Py311 (#27934)
Abacn Aug 9, 2023
fd6040d
Clean up image cleaner comments (#27933)
damccorm Aug 9, 2023
0e8e45c
Fixed dockstring by removing unknown var blocking (#27862)
ArjunGHUB Aug 9, 2023
357c3e6
Clarify timestamp behavior for BatchElements (#27937)
damccorm Aug 9, 2023
f07fe40
Make all Beam released container images multi-arch. (#27936)
celeste-zeng Aug 9, 2023
6b2d395
Refactor yaml providers for more eager and explicit errors.
robertwb Aug 9, 2023
36ddc1d
Bump Java containers to beam-master-20230809 (#27938)
Abacn Aug 9, 2023
0a7b999
Exclude StorageApiSinkSchemaUpdate integration tests from dataflow po…
Abacn Aug 9, 2023
c65c703
Rename run_full_pipeline to run_portable_pipeline.
robertwb Aug 9, 2023
051872e
Add mypy typing to registration dict.
robertwb Aug 9, 2023
1755dd5
Merge pull request #27512 Remove more legacy Runner v1 cruft.
robertwb Aug 9, 2023
49ed58f
Remove python examples from codecov (#27940)
damccorm Aug 10, 2023
6f60a6c
Fix stateful processing using direct runner with type checks enabled …
sadovnychyi Aug 10, 2023
71e060c
Update how we specify artifact modes (#27939)
damccorm Aug 10, 2023
b5f260d
add expose gcloud kubelet path to setup action (#27952)
volatilemolotov Aug 10, 2023
e3c58d4
Pre 2.50.0 Cut image requirement update. (#27947)
lostluck Aug 10, 2023
f392a21
Bump github.com/fsouza/fake-gcs-server from 1.47.3 to 1.47.4 in /sdks…
dependabot[bot] Aug 10, 2023
ba04f60
Merge pull request #27866: Allow writing protos directly to the stora…
reuvenlax Aug 10, 2023
fe974de
Fix setup action checkout (#27954)
damccorm Aug 10, 2023
b34abc4
Refactor terraform GKE and add bitnami kafka module (#27949)
damondouglas Aug 10, 2023
c994fdd
Update BOM (#27956)
reuvenlax Aug 10, 2023
47e8de3
Merge pull request #27884 Also use config pattern for yaml providers.
robertwb Aug 10, 2023
8caa7ce
Set Dataflow container to release version.
Aug 10, 2023
50e2495
Revert #27878 (#27968) (#27970)
damccorm Aug 11, 2023
c42daee
Cherry pick #27842 (#28015)
lostluck Aug 15, 2023
8293bbd
[Cherry Pick #28083][prism] support single external env pipelines. (#…
lostluck Aug 22, 2023
ff12f23
[Cherry-pick #28036] Add libgeos-dev for ARM components of multi-arch…
lostluck Aug 22, 2023
b1d245d
Pin hugging face docs (#28106) (#28115)
damccorm Aug 22, 2023
28d20e4
Loosen environment validation for RCs (#28110)
tvalentyn Aug 22, 2023
8f762e3
Add back docker prune step between python container pushes (#28033) (…
damccorm Aug 22, 2023
7fda4d4
add python_requires block needed for beam-release
nji302 Sep 25, 2023
9008da2
update version.py
nji302 Nov 6, 2023
2c62a61
[LYFT] Custom translators for Flink Kinesis and Kafka connectors
tweise Aug 17, 2018
00ecbe9
[LYFT] disable sonatype staging repo
mxm Mar 30, 2020
08fb8ed
[LYFT] [STRMCMP-1182] config for lyft repo mappings
maghamravi Jan 18, 2021
81bba24
Using alternate path for maven settings
nji302 Nov 7, 2023
ae9cad1
add maven repos to validate-runner
nji302 Nov 7, 2023
35e6166
disable warnings as errors config and nullchecker
nji302 Nov 8, 2023
dc0575a
set Java11
nji302 Nov 8, 2023
804ac2f
add pipeline options to satisfy new method arguments
nji302 Nov 8, 2023
fac1c09
bump deps to 1.17
nji302 Nov 8, 2023
7ba0b9f
[LYFT] LyftPythonEnvironmentFactory that spawns Python SDK processes …
tweise Aug 31, 2018
eaa8830
[LYFT] Fix from_runner_api_parameter signature for custom transforms
tweise May 7, 2020
0758989
[LYFT][STRMCMP-947] Using Kafka builders (#38)
maghamravi Jun 4, 2020
acfd75f
fixing after cherry-pick
nji302 Nov 8, 2023
e85b520
[LYFT] Allow the use of SQL via the embedded Java environment
mxm May 22, 2020
ab7f8d3
delete duplicate method
nji302 Nov 8, 2023
80823bb
[LYFT][STRMCMP-996] Adding translations for S3 and kinesis input sources
premsantosh Aug 31, 2020
4e95a69
[LYFT][STRMCMP-1155] S3 event parsing failures (#50)
maghamravi Dec 3, 2020
90b257d
[LYFT][STRMCMP-1143] Add an S3 connector for the beam python SDK (#48)
glaksh100 Dec 3, 2020
89e15ef
[LYFT] [STRMCMP-1161] Adding way to capture application env to kinesi…
premsantosh Dec 22, 2020
2bbd9f1
[LYFT][STRMCMP-1152] Code refactoring for s3AndKinesis PTransform (#49)
maghamravi Nov 30, 2020
62ea2dd
add s3 and hadoop deps
nji302 Nov 8, 2023
ffe0f2a
ignore test broken by new versions of dependency
nji302 Nov 9, 2023
6ecf903
add flink 1.17
nji302 Nov 10, 2023
8e6d11f
exclude log4j from dependencies
nji302 Nov 10, 2023
a1ba939
cython as a prereq
nji302 Nov 10, 2023
5a9cbeb
[LYFT] [STRMCMP-1594] Support for multiple kafka topics as input (#58)
maghamravi Dec 5, 2022
4b2eaba
[LYFT] [STRMCMP-1594] Support for multiple kafka topics as input - 2
maghamravi Dec 7, 2022
558ec8b
[LYFT] [STRMCMP-1594] Fixup: Using instance variables (#62)
maghamravi Jan 27, 2023
c2b13f9
[RTOP-997] [LYFT] Add retries for ephemeral unavailable failures (#63)
maghamravi Mar 14, 2023
1974e1a
fix test override
nji302 Nov 14, 2023
6e809fa
bump version after setting shaded dep to compiled scope
nji302 Nov 14, 2023
2a912c9
Revert "Checkstyle and errorprone fixes for new guava version"
nji302 Nov 15, 2023
3f25a28
Revert "Use guave 32.1.2"
nji302 Nov 15, 2023
dab79ed
fixing guava version
nji302 Nov 15, 2023
c9d0a61
Revert "fixing guava version"
nji302 Nov 16, 2023
20e3cce
Revert "Revert "Use guave 32.1.2""
nji302 Nov 16, 2023
e17eff8
Revert "Revert "Checkstyle and errorprone fixes for new guava version""
nji302 Nov 16, 2023
24f0705
patch empty savepoint behavior
nji302 Nov 17, 2023
4fae2c6
use .post instead of +lyft
nji302 Nov 17, 2023
95d52e2
updating dependencies
nji302 Dec 5, 2023
ca29692
log KafkaInput parameters
nji302 Dec 6, 2023
d2950b6
whitespace change
nji302 Dec 6, 2023
131ce7e
fix bug where maxOutOfOrdernessMillis can be null
nji302 Dec 6, 2023
bf5483a
bump dependencies to latest lyft versions
nji302 Dec 18, 2023
7f67002
relax protobuf dependency
nji302 Feb 5, 2024
9922393
Revert "relax protobuf dependency"
nji302 Feb 6, 2024
cc785f0
bump to latest versions
nji302 Feb 14, 2024
40c69ae
deploy occurred at changes
maheepm-lyft Apr 16, 2024
8b6c3e1
Merge pull request #66 from lyft/occured_at_fix
nji302 Apr 16, 2024
75bc29a
try different version of grpcio-tools
nji302 Apr 19, 2024
059f35d
Revert "try different version of grpcio-tools"
nji302 Apr 19, 2024
7da1f46
enable kafka auto partition discovery
nji302 May 9, 2024
a19a1b5
add KinesisRecordTimestampExtractor watermark assigner in portable tr…
nji302 Jun 12, 2024
55415bc
test event base changes with custom assigner
nji302 Jun 12, 2024
bd7617f
semicolon
nji302 Jun 12, 2024
274bd02
Revert "test event base changes with custom assigner"
nji302 Jun 12, 2024
195d7a4
Revert "Revert "test event base changes with custom assigner""
nji302 Jun 12, 2024
a323266
fix versions
nji302 Jun 12, 2024
5ef95b3
Revert "fix versions"
nji302 Jun 13, 2024
375b6ef
Revert "Revert "Revert "test event base changes with custom assigner"""
nji302 Jun 13, 2024
e8ca15d
Revert "Revert "test event base changes with custom assigner""
nji302 Jun 13, 2024
d8dd335
Revert "semicolon"
nji302 Jun 13, 2024
60f5087
Revert "test event base changes with custom assigner"
nji302 Jun 13, 2024
0d3d116
Revert "add KinesisRecordTimestampExtractor watermark assigner in por…
nji302 Jun 13, 2024
c5b1cb8
parse event_base in correct block of code
nji302 Jun 13, 2024
7bd3aeb
remove unneeded import
nji302 Jun 13, 2024
ccc8e0d
removed logs
maheepm-lyft Jun 13, 2024
8fc90db
Invoke pyarrow_hotfix to alleviate concerns due to CVE-2023-47248. (#…
tvalentyn Nov 11, 2023
ed20488
Bump Pyarrow version to include 14.0.0 (#29536)
AnandInguva Dec 3, 2023
a0f6632
added logging to help waittimeaggregation debugging
maheepm-lyft Oct 3, 2024
2cbca45
Merge pull request #67 from lyft/rtop_logging
aniruddha-lyft Oct 3, 2024
0b2c8b0
Revert "added logging to help waittimeaggregation debugging"
maheepm-lyft Oct 7, 2024
366bbdc
Merge pull request #68 from lyft/revert_rtop_logging
maheepm-lyft Oct 7, 2024
247dab9
Revert "Merge pull request #68 from lyft/revert_rtop_logging"
maheepm-lyft Oct 9, 2024
8b736ff
remove stream.tell()
maheepm-lyft Oct 9, 2024
afc2c82
Merge pull request #69 from lyft/rtop_logging
maheepm-lyft Oct 9, 2024
8765f5a
added more logs to bundle_processor.py
maheepm-lyft Oct 14, 2024
a7d5045
Merge pull request #70 from lyft/rtop_logging
maheepm-lyft Oct 14, 2024
9c7b3d5
Revert "added more logs to bundle_processor.py"
maheepm-lyft Oct 14, 2024
9541350
Revert "remove stream.tell()"
maheepm-lyft Oct 14, 2024
757de00
remove all logs
maheepm-lyft Oct 14, 2024
439b4e0
Merge pull request #71 from lyft/wta_logging
maheepm-lyft Oct 14, 2024
195aac6
added back rtop logs + more
maheepm-lyft Oct 15, 2024
8947470
add coder_impl logging
maheepm-lyft Oct 15, 2024
6e8e9f1
coder_impl logging
maheepm-lyft Oct 15, 2024
64a4ba1
Merge pull request #72 from lyft/wta_logging
maheepm-lyft Oct 15, 2024
0cb18ea
remove pipeline.py logs
maheepm-lyft Oct 15, 2024
eeeecd8
Merge pull request #73 from lyft/wta_logging
maheepm-lyft Oct 15, 2024
13692c3
update pom
jingweiz-lyft Oct 22, 2024
43c8bab
remove rtop logs
maheepm-lyft Oct 24, 2024
fb80136
Revert "update pom"
maheepm-lyft Oct 24, 2024
490450e
[RTOP-2169] Adding support for kinesis consumer synchronization
kjbilton Nov 7, 2024
90d2e9b
add extra condition
kjbilton Nov 7, 2024
aa7d08d
update param name
kjbilton Nov 11, 2024
519fdfb
Add missing import
kjbilton Nov 11, 2024
e8bc339
Merge pull request #74 from lyft/kbilton/add_kinesis_watermark_tracker
kjbilton Nov 11, 2024
f31cd96
Update S3Input to align with S3AndKinesisInput
kjbilton Dec 30, 2024
c027b7f
update variable name for consistency
kjbilton Jan 3, 2025
a7a0715
Merge pull request #75 from lyft/kbilton/update_s3_input
kjbilton Jan 3, 2025
0ba21f7
Use KafkaSource instead of FlinkKafkaConsumer (#77)
ardakuyumcu Feb 21, 2025
d36efc1
Add v2 Kafka source (#80)
kjbilton Feb 22, 2025
72640e5
[RTOP-2408] Adding Kafka source watermark alignment (#76)
kjbilton Feb 26, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
113 changes: 113 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,113 @@
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# This file configures github and jira notifications based on
# https://s.apache.org/asfyaml-notify

github:
description: "Apache Beam is a unified programming model for Batch and Streaming data processing."
homepage: https://beam.apache.org/
labels:
- batch
- beam
- big-data
- golang
- java
- python
- sql
- streaming
features:
# Enable issue management
issues: true
# Enable projects for project management boards
projects: true

# Give some users issue triage permissions
collaborators:
- pcoet
- olehborysevych
- rshamunov
- andreydevyatkin
- liferoad

enabled_merge_buttons:
squash: true
merge: true
rebase: false

protected_branches:
master: {}
release-2.49.0: {}
release-2.48.0: {}
release-2.47.0: {}
release-2.46.0: {}
release-2.45.0: {}
release-2.44.0: {}
release-2.43.0: {}
release-2.42.0: {}
release-2.41.0: {}
release-2.40.0: {}
release-2.39.0: {}
release-2.38.0: {}
release-2.37.0: {}
release-2.36.0: {}
release-2.35.0: {}
release-2.34.0: {}
release-2.33.0: {}
release-2.32.0: {}
release-2.31.0: {}
release-2.30.0: {}
release-2.29.0: {}
release-2.28.0: {}
release-2.27.0: {}
release-2.26.0: {}
release-2.25.0: {}
release-2.24.0: {}
release-2.23.0: {}
release-2.22.0: {}
release-2.21.0: {}
release-2.20.0: {}
release-2.19.0: {}
release-2.18.0: {}
release-2.17.0: {}
release-2.16.0: {}
release-2.15.0: {}
release-2.14.0: {}
release-2.13.0: {}
release-2.12.0: {}
release-2.11.0: {}
release-2.10.0: {}
release-2.8.0: {}
release-2.8.0: {}
release-2.7.0: {}
release-2.6.0: {}
release-2.5.0: {}
release-2.4.0: {}
release-2.3.0: {}
release-2.2.0: {}
release-2.1.1: {}
release-2.1.0: {}
release-0.6.0: {}
release-0.5.0: {}
release-0.4.0: {}
release-0.4.0-incubating: {}
release-0.3.0-incubating: {}
release-0.2.0-incubating: {}
release-0.1.0-incubating: {}

notifications:
commits: [email protected]
issues: [email protected]
pullrequests: [email protected]
8 changes: 5 additions & 3 deletions .gitattributes
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,8 @@ Dockerfile text
# git repository
.gitignore export-ignore
.gitattributes export-ignore
gradlew export-ignore
gradlew.bat export-ignore
/gradle export-ignore
/gradlew* export-ignore
**/gradle export-ignore

# Website is not part of archive
/website export-ignore
100 changes: 100 additions & 0 deletions .github/ACTIONS.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,100 @@
<!--
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
-->

> **PLEASE update this file if you add new GitHub Action or change name/trigger phrase of a GitHub Action.**
## About GitHub Actions Runners and Self-hosted Runners
According to GitHub Docs, we can define a GitHub-hosted runner and a self-hosted runner as the following:
* A [GitHub-hosted runner](https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners) is a new virtual machine (VM) hosted by GitHub with the runner application and other tools preinstalled, and is available with Ubuntu Linux, Windows, or macOS operating systems.
* A [self-hosted runner](https://docs.github.com/en/actions/hosting-your-own-runners/about-self-hosted-runners) is a system that you deploy and manage to execute jobs from GitHub Actions on GitHub.com.

## Apache Beam GitHub Actions

Currently, we have both GitHub-hosted and self-hosted runners for running the GitHub Actions workflows, hosted on Google Cloud Platform(GCP) Virtual Machines and Google Kubernetes Engine(GKE). The majority of our workflows that run in Ubuntu and Windows run in self-hosted runners, except for those that runs on MacOS and the `Monitor Self-Hosted Runners Status` workflow that monitors our GCP self-hosted runners.

### Getting Started with self-hosted runners
* Refer to [this README](./gh-actions-self-hosted-runners/README.md) for the steps for creating your own self-hosted runners for testing your workflows.
* Depending on your workflow's needs, it must specify the following `runs-on` tags to run in the specified operating system:
* Ubuntu 20.04 self-hosted runner: `[self-hosted, ubuntu-20.04]`
* Windows Server 2019 self-hosted runner: `[self-hosted, windows-server-2019]`
* MacOS GitHub-hosted runner: `macos-latest`
* Every workflow that tests the source code, needs to have the workflow trigger `pull_request_target` instead of `pull_request`.
* The workflow must have set read permissions for all the available scopes and jobs: `permissions: read-all`. It must be set at the top of the `jobs` directive.
* For those workflows that have the `pull_request_target` trigger, in the checkout step must be added a ref to `${{ github.event.pull_request.head.sha }}`
``` yaml
- name: Checkout code
uses: actions/checkout@v#
with:
ref: ${{ github.event.pull_request.head.sha }}
```
* If your workflow runs successfully in a GitHub-hosted runner but not in the self-hosted runner, it might need a new installation step.
```yaml
- name: Setup Node
uses: actions/setup-node@v3
with:
node-version: 16
```
* You can find the GitHub-hosted runner installations in the following links:
* [Ubuntu-20.04](https://github.com/actions/runner-images/blob/main/images/linux/Ubuntu2004-Readme.md#installed-apt-packages)
* [Windows-2019](https://github.com/actions/runner-images/blob/main/images/win/Windows2019-Readme.md)

#### GitHub Actions Example
```yaml
name: GitHub Actions Example
on:
pull_request_target:
branches: ['master']
permissions: read-all
jobs:
github-actions-example:
runs-on: [self-hosted, ubuntu-20.04]
steps:
- name: Check out repository code
uses: actions/checkout@v2
with:
ref: ${{ github.event.pull_request.head.sha }}
- run: echo "This job is now running on a ubuntu server hosted by Apache Beam!"
- name: Setup Node
uses: actions/setup-node@v3
with:
node-version: 16
- name: Install npm dependencies
run: npm ci
working-directory: 'scripts/ci/your-path'
- name: Run Node.js code
run: npm run functionName
env:
VAR_1: my-var
working-directory: 'scripts/ci/your-path'
```

#### IMPORTANT for Committers
* A **detailed review** for changes in the workflows is needed due to important **security concerns**.
* **DO NOT** Approve and Run changes in the workflows in the PR Conversation tab, under "Workflow(s) awaiting approval".
* For approving the updates in the workflows, you should go to the Repository Actions and filter All Workflows by `action_required`. The search will display the workflows that need to be reviewed before running. **Please make sure reviewing the file that is referenced by the workflow.**
* Seed job will be emulated using the `Approve and Run` built-in feature of GitHub Actions, since the workflows will use the `pull_request_target` directive; no modifications would be allowed either for new or existent jobs unless a committer explicitly approves the job from GitHub Actions UI.
### Issue Management

Phrases self-assign, close, or manage labels on an issue:
| Phrase | Effect |
|--------|--------|
| `.take-issue` | Self-assign the issue |
| `.close-issue` | Close the issue as completed |
| `.close-issue not_planned` | Close the issue as not-planned |
| `.reopen-issue` | Reopen a closed issue |
| `.add-labels` | Add comma separated labels to the issue (e.g. `add-labels l1, 'l2 with spaces'`) |
| `.remove-labels` | Remove comma separated labels to the issue (e.g. `remove-labels l1, 'l2 with spaces'`) |
| `.set-labels` | Sets comma separated labels to the issue and removes any other labels (e.g. `set-labels l1, 'l2 with spaces'`) |
75 changes: 75 additions & 0 deletions .github/ISSUE_TEMPLATE/bug.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
#
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#

name: Bug Report
description: File a bug report
title: "[Bug]: "
labels: ["bug", "awaiting triage"]
assignees:
- octocat
body:
- type: markdown
attributes:
value: |
Thanks for taking the time to fill out this bug report! Once you've created an issue, you can self-assign by commenting `.take-issue` and you can close by commenting `.close-issue`.
Anyone can reopen a closed issue by commenting `.reopen-issue`.
You can also add/remove labels by commenting `.add-labels label1,label2,'label 3 with spaces'` or `.remove-labels label1,label2,'label 3 with spaces'`,
or with `.set-labels label1,label2,'label 3 with spaces'` (which removes any labels not in that set).
- type: textarea
id: what-happened
attributes:
label: What happened?
description: Please describe your issue, including the version of Beam and any other languages or tools being used.
placeholder: |
Please include steps to reproduce your issue.

If this isn't a bug and you have a question or support request, please email [email protected] with a description of the problem instead of opening this issue.
validations:
required: true
- type: dropdown
id: priority
attributes:
label: Issue Priority
description: What priority is this bug? See https://beam.apache.org/contribute/issue-priorities for the meaning and expectations around issue priorities.
options:
- "Priority: 3 (minor)"
- "Priority: 2 (default / most bugs should be filed as P2)"
- "Priority: 1 (data loss / total loss of function)"
- "Priority: 0 (outage / urgent vulnerability)"
validations:
required: true
- type: checkboxes
id: component
attributes:
label: Issue Components
description: Which languages, SDKs, or features are related to your report? (check all that apply)
options:
- label: "Component: Python SDK"
- label: "Component: Java SDK"
- label: "Component: Go SDK"
- label: "Component: Typescript SDK"
- label: "Component: IO connector"
- label: "Component: Beam examples"
- label: "Component: Beam playground"
- label: "Component: Beam katas"
- label: "Component: Website"
- label: "Component: Spark Runner"
- label: "Component: Flink Runner"
- label: "Component: Samza Runner"
- label: "Component: Twister2 Runner"
- label: "Component: Hazelcast Jet Runner"
- label: "Component: Google Cloud Dataflow Runner"
81 changes: 81 additions & 0 deletions .github/ISSUE_TEMPLATE/failing_test.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,81 @@
#
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#

name: Failing Test
description: Report a failing test
title: "[Failing Test]: "
labels: ["bug", "awaiting triage", "failing test"]
assignees:
- octocat
body:
- type: markdown
attributes:
value: |
Thanks for taking the time to fill out this feature request! Once you've created an issue, you can self-assign by commenting `.take-issue` and you can close by commenting `.close-issue`.
Anyone can reopen a closed issue by commenting `.reopen-issue`.
You can also add/remove labels by commenting `.add-labels label1,label2,'label 3 with spaces'` or `.remove-labels label1,label2,'label 3 with spaces'`,
or with `.set-labels label1,label2,'label 3 with spaces'` (which removes any labels not in that set).
- type: textarea
id: what-happened
attributes:
label: What happened?
description: Please name which test is failing.
placeholder: |
If possible include: when the test started failing, a jenkins link, and any initial debugging you've done.

If this isn't a bug and you have a question or support request, please email [email protected] with a description of the problem instead of opening this issue.
- type: dropdown
id: failure
attributes:
label: Issue Failure
options:
- "Failure: Test is flaky"
- "Failure: Test is continually failing"
validations:
required: true
- type: dropdown
id: priority
attributes:
label: Issue Priority
description: What priority is this bug? A permanently failing test should be marked P1. See https://beam.apache.org/contribute/issue-priorities for the meaning and expectations around issue priorities.
options:
- "Priority: 2 (backlog / disabled test but we think the product is healthy)"
- "Priority: 1 (unhealthy code / failing or flaky postcommit so we cannot be sure the product is healthy)"
- "Priority: 0 (outage / failing precommit test impacting development)"
validations:
required: true
- type: checkboxes
id: component
attributes:
label: Issue Components
description: Which languages, SDKs, or features are related to your report? (check all that apply)
options:
- label: "Component: Python SDK"
- label: "Component: Java SDK"
- label: "Component: Go SDK"
- label: "Component: Typescript SDK"
- label: "Component: IO connector"
- label: "Component: Beam examples"
- label: "Component: Beam playground"
- label: "Component: Beam katas"
- label: "Component: Website"
- label: "Component: Spark Runner"
- label: "Component: Flink Runner"
- label: "Component: Samza Runner"
- label: "Component: Twister2 Runner"
- label: "Component: Hazelcast Jet Runner"
- label: "Component: Google Cloud Dataflow Runner"
Loading