Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
264 commits
Select commit Hold shift + click to select a range
6e03fd8
feat: added get_document and list_document functions (#7)
galz10 Oct 3, 2022
87b65aa
Bump protobuf from 3.20.1 to 3.20.2 in /.kokoro (#6)
dependabot[bot] Oct 6, 2022
3e20634
chore: updated comments (#8)
galz10 Oct 7, 2022
5ccef75
feat: wrapped tables (#9)
galz10 Oct 18, 2022
992672c
Chore: fix missing changes (#15)
galz10 Oct 19, 2022
d19df84
feat: added helper functions to DocumentWrapper (#12)
galz10 Oct 19, 2022
3c112d5
feat: added client_info to storage client (#10)
galz10 Oct 27, 2022
ad93a8b
chore: update readme (#4)
galz10 Oct 28, 2022
c3455c6
chore: refactored classes (#14)
galz10 Oct 31, 2022
7990697
feat: refactor code (#17)
dizcology Nov 15, 2022
14e0242
chore: made print_gcs_document_tree accessible (#18)
galz10 Nov 15, 2022
e73ce4a
chore: add unit test for Entity (#19)
dizcology Nov 15, 2022
75c82ec
chore: fixed to_dataframe header issue (#20)
galz10 Nov 24, 2022
2842d09
chore: release 0.1.0-alpha (#22)
galz10 Dec 5, 2022
0583b87
chore: updated readme (#24)
galz10 Dec 6, 2022
6de8028
chore: added tests for page.py (#23)
galz10 Dec 7, 2022
5871c5c
chore(deps): bump certifi from 2022.9.24 to 2022.12.7 in /.kokoro (#26)
dependabot[bot] Dec 9, 2022
cf7432e
chore: update repo-metadata.json (#29)
parthea Jan 11, 2023
0128ccf
chore: update client_documentation and issue_tracker in .repo-metadat…
parthea Jan 24, 2023
7b63e71
chore: changed gcs_prefix pattern comment (#21)
galz10 Jan 24, 2023
86b5f04
chore: documentation changes (#33)
galz10 Jan 30, 2023
b111653
chore: updated testing constraints (#34)
galz10 Jan 30, 2023
4daab05
docs: fix docs arrangement (#35)
dandhlee Jan 31, 2023
b809c1e
chore(main): release 0.1.0-alpha (#25)
release-please[bot] Jan 31, 2023
eeb1f98
chore: fix readme release issues (#38)
galz10 Jan 31, 2023
f1fa28a
chore: minor refactoring of GCS Functions in document wrapper (#39)
holtskinner Jan 31, 2023
ac3df1c
chore: update docs url (#41)
parthea Feb 1, 2023
03587a0
chore: documentation fixes (#44)
galz10 Feb 1, 2023
f68f660
docs(samples): added quickstart sample (#27)
galz10 Feb 1, 2023
6d64547
chore(deps): update all dependencies (#46)
renovate-bot Feb 2, 2023
ee19b02
chore: changed test file name for quickstart sample (#47)
galz10 Feb 2, 2023
d41a14a
chore: minor refactoring for readability/simplicity (#42)
holtskinner Feb 2, 2023
7800f73
fix: Updated Pip install name in README (#52)
holtskinner Feb 8, 2023
319a11d
chore(main): release 0.1.1-alpha (#45)
release-please[bot] Feb 8, 2023
3855ca9
chore(deps): update dependency google-cloud-documentai to v2 (#49)
renovate-bot Feb 8, 2023
4567ee3
chore(deps): update dependency google-cloud-documentai to v2.12.0 (#55)
renovate-bot Feb 9, 2023
3ad4af8
feat: Added Support for Form Fields (#48)
holtskinner Feb 14, 2023
b02670f
samples: Added Table Parsing Sample (#54)
holtskinner Feb 14, 2023
f114624
feat: Add PDF Splitter (#51)
holtskinner Feb 15, 2023
7f07b90
feat: Add `entities_to_dict()` and `entities_to_bigquery()` to `Docum…
holtskinner Feb 15, 2023
29d62af
chore(main): release 0.2.0-alpha (#56)
release-please[bot] Feb 15, 2023
af424ac
docs: Update to README (#58)
holtskinner Feb 15, 2023
9428869
chore(main): release 0.2.1-alpha (#59)
release-please[bot] Feb 16, 2023
77ec05c
chore(deps): update dependency google-cloud-documentai to v2.13.0 (#61)
renovate-bot Feb 22, 2023
fccd7ce
chore(deps): update dependency google-cloud-bigquery to v3.6.0 (#62)
renovate-bot Feb 22, 2023
308137c
feat: added docproto to AnnotateFile convertor (#63)
galz10 Feb 24, 2023
05b8f56
chore: added init file to converter folder (#64)
galz10 Feb 27, 2023
4ada5bb
chore(main): release 0.3.0-alpha (#65)
release-please[bot] Feb 27, 2023
448389a
build(deps): Update to .kokoro/requirements.txt for dependbot alerts …
holtskinner Mar 1, 2023
1bbe055
feat: Added Batch creation for Cloud Storage documents. (#66)
holtskinner Mar 1, 2023
2494980
fix: Handle Edge Case where GCS Shards are out of order (#69)
holtskinner Mar 2, 2023
281b017
refactor: Moved `print_gcs_document_tree()` to utilities. (#71)
holtskinner Mar 2, 2023
5f4a303
chore(deps): update dependency pytest to v7.2.2 (#73)
renovate-bot Mar 6, 2023
0263e01
chore(deps): update dependency google-cloud-documentai to v2.14.0 (#74)
renovate-bot Mar 9, 2023
09bcb1a
feat: add config based annotation converter (#72)
galz10 Mar 9, 2023
9dc1751
feat: added list_gcs_document_tree (#75)
galz10 Mar 9, 2023
4b322ac
chore(main): release 0.4.0-alpha (#70)
release-please[bot] Mar 10, 2023
838fd6d
chore: Add `PythonDependency` to auto-approve.yml (#77)
holtskinner Mar 13, 2023
502077f
test: Update Utilities and Tests to improve coverage and address lint…
holtskinner Mar 17, 2023
ef93340
chore: Update Owlbot filenames (#80)
holtskinner Mar 17, 2023
d48a22e
chore(deps): update dependency google-cloud-bigquery to v3.7.0 (#79)
renovate-bot Mar 17, 2023
9e666dd
chore(main): release 0.4.1-alpha (#86)
release-please[bot] Mar 21, 2023
adf5e59
chore(deps): update dependency google-cloud-documentai to v2.15.0 (#87)
renovate-bot Mar 24, 2023
790ef0e
chore(deps): update dependency google-cloud-bigquery to v3.8.0 (#89)
renovate-bot Mar 27, 2023
db731f6
chore(deps): update dependency google-cloud-bigquery to v3.9.0 (#90)
renovate-bot Mar 28, 2023
867d569
chore(deps): update dependency google-cloud-storage to v2.8.0 (#92)
renovate-bot Mar 30, 2023
1cd526f
feat: Update Max Files per Batch Request to 1000 (#91)
holtskinner Apr 3, 2023
b348009
chore: changed convert_external_annotation test (#95)
galz10 Apr 6, 2023
32d83f3
chore(deps): Update Dependencies for pandas and pikepdf (#93)
holtskinner Apr 6, 2023
940a847
feat: Added Export Images functionality (#96)
holtskinner Apr 6, 2023
6df4751
feat: Add Import Document from Batch Process Metadata & Operation (#88)
holtskinner Apr 7, 2023
c2390fe
chore(main): release 0.5.0-alpha (#94)
release-please[bot] Apr 7, 2023
5ad1a90
chore(deps): update dependency pytest to v7.3.0 (#100)
renovate-bot Apr 10, 2023
8769391
chore: changed get_storage_client to add module name for analytics (#…
galz10 Apr 13, 2023
28a8f08
feat: Add blocks to PageWrapper (#107)
sharmanirudh Apr 17, 2023
425bc14
chore(deps): update all dependencies (#105)
renovate-bot Apr 17, 2023
a1436bd
feat: Added `form_fields_to_bigquery()` method (#104)
holtskinner Apr 17, 2023
dee790b
chore(main): release 0.6.0-alpha (#108)
release-please[bot] Apr 17, 2023
1f209ae
chore(deps): update dependency google-cloud-bigquery to v3.10.0 (#109)
renovate-bot Apr 19, 2023
2c20547
chore(deps): update dependency google-cloud-storage to v2.9.0 (#116)
renovate-bot May 23, 2023
118d26c
build(deps): bump requests from 2.28.1 to 2.31.0 in /synthtool/gcp/te…
gcf-owl-bot[bot] May 25, 2023
b7e18f8
feat: added text_annotation to vision conversion (#114)
galz10 May 31, 2023
a59cbf6
chore(main): release 0.7.0-alpha (#119)
release-please[bot] May 31, 2023
f4cb585
chore(deps): update dependency google-cloud-bigquery to v3.11.0 (#120)
renovate-bot Jun 1, 2023
c1f0627
build(deps): bump cryptography from 39.0.1 to 41.0.0 in /synthtool/gc…
gcf-owl-bot[bot] Jun 3, 2023
afb3c32
feat: add convert_document_to_annotate_file_json (#124)
galz10 Jun 7, 2023
929a39e
chore(main): release 0.8.0-alpha (#125)
release-please[bot] Jun 8, 2023
21602a7
chore(deps): update dependency pytest to v7.3.2 (#126)
renovate-bot Jun 12, 2023
f305858
chore(deps): update dependency google-cloud-bigquery to v3.11.1 (#127)
renovate-bot Jun 14, 2023
d501312
chore(deps): update dependency google-cloud-documentai to v2.16.0 (#130)
renovate-bot Jun 20, 2023
e11ddb1
refactor: Restructuring of classes to reduce duplication and increase…
holtskinner Jun 23, 2023
9a43f18
chore(deps): update all dependencies (#131)
renovate-bot Jun 26, 2023
9973e59
chore: remove pinned Sphinx version [autoapprove] (#133)
gcf-owl-bot[bot] Jun 27, 2023
7833b5b
chore(deps): update dependency google-cloud-storage to v2.10.0 (#132)
renovate-bot Jun 27, 2023
2535c7d
chore(deps): update dependency google-cloud-bigquery to v3.11.3 (#134)
renovate-bot Jun 27, 2023
c42fe90
chore: store artifacts in placer (#135)
gcf-owl-bot[bot] Jun 28, 2023
8fe98bd
feat: added hOCR export functionality (#123)
galz10 Jun 28, 2023
40be61f
chore(deps): update dependency google-cloud-documentai to v2.16.1 (#138)
renovate-bot Jul 5, 2023
6acebab
refactor: Refactor of hocr functions and fixing lost changes (#137)
holtskinner Jul 7, 2023
ea487be
chore(main): release 0.9.0-alpha (#136)
release-please[bot] Jul 7, 2023
892007c
build(deps): [autoapprove] bump cryptography from 41.0.0 to 41.0.2 (#…
gcf-owl-bot[bot] Jul 17, 2023
4ba382b
chore(deps): update all dependencies (#140)
renovate-bot Jul 17, 2023
fe55937
chore(deps): update dependency google-cloud-bigquery to v3.11.4 (#143)
renovate-bot Jul 20, 2023
b5ce391
refactor: Major refactoring of functions to improve readability, effi…
holtskinner Jul 20, 2023
56e4bae
build(deps): [autoapprove] bump pygments from 2.13.0 to 2.15.0 (#146)
gcf-owl-bot[bot] Jul 21, 2023
548ec2a
build(deps): [autoapprove] bump certifi from 2022.12.7 to 2023.7.22 (…
gcf-owl-bot[bot] Jul 25, 2023
f6bd33f
chore(deps): update dependency google-cloud-documentai to v2.18.0 (#149)
renovate-bot Jul 26, 2023
e369396
chore(main): release 0.9.1-alpha (#152)
release-please[bot] Jul 27, 2023
eccd940
chore: [autoapprove] Pin flake8 version (#153)
gcf-owl-bot[bot] Aug 1, 2023
62a1723
build: [autoapprove] bump cryptography from 41.0.2 to 41.0.3 (#155)
gcf-owl-bot[bot] Aug 2, 2023
bde9a66
fix: Update noxfile.py (#129)
holtskinner Aug 8, 2023
850851e
feat: Add export merged sharded Document proto (#145)
holtskinner Aug 9, 2023
6abd11e
refactor: Remove `table.to_csv()` method and add more export options …
holtskinner Aug 9, 2023
51c7fcb
chore(main): release 0.10.0-alpha (#157)
release-please[bot] Aug 9, 2023
7c0c837
chore(deps): update all dependencies (#162)
renovate-bot Sep 5, 2023
fbb57e9
fix: Add handling for documents missing all layout elements. (#161)
holtskinner Sep 5, 2023
a2e1bb1
chore(main): release 0.10.1-alpha (#164)
release-please[bot] Sep 5, 2023
397dc4c
chore(deps): update all dependencies (#165)
renovate-bot Sep 8, 2023
91e0833
chore(deps): update all dependencies (#166)
renovate-bot Sep 19, 2023
d7f0bb8
chore(deps): update all dependencies (#167)
renovate-bot Sep 19, 2023
756d766
chore(deps): update all dependencies (#170)
renovate-bot Sep 28, 2023
50b7dfc
chore: [autoapprove] bump cryptography from 41.0.3 to 41.0.4 (#172)
gcf-owl-bot[bot] Oct 3, 2023
b277159
fix: Change `ocr_line` `<span>` to include all `ocr_word` (#169)
ralscha Oct 3, 2023
768b3b3
chore(main): release 0.10.2-alpha (#174)
release-please[bot] Oct 3, 2023
6507484
chore(deps): update all dependencies (#175)
renovate-bot Oct 3, 2023
5b189dd
fix: `docai_utilities.py` to return `Optional` (#176)
holtskinner Oct 6, 2023
68c51d1
chore(main): release 0.10.3-alpha (#177)
release-please[bot] Oct 6, 2023
07e3543
chore: [autoapprove] Update `black` and `isort` to latest versions (#…
gcf-owl-bot[bot] Oct 9, 2023
ee362be
chore(deps): update all dependencies (#180)
renovate-bot Oct 11, 2023
cba079c
chore(deps): update all dependencies (#181)
renovate-bot Oct 13, 2023
778d11c
feat: Update Toolbox for OCR 2.0 features (#171)
holtskinner Oct 18, 2023
c7c4193
chore(main): release 0.11.0-alpha (#183)
release-please[bot] Oct 18, 2023
43ba96e
chore: rename rst files to avoid conflict with service names (#184)
gcf-owl-bot[bot] Oct 18, 2023
e67a2ed
fix: Empty Page SubElements (#186)
holtskinner Oct 23, 2023
4211708
chore(main): release 0.11.1-alpha (#187)
release-please[bot] Oct 23, 2023
025cf69
chore(deps): update all dependencies (#188)
renovate-bot Oct 24, 2023
9b6e7c8
chore(deps): update all dependencies (#189)
renovate-bot Oct 30, 2023
2cc5a17
chore(deps): update all dependencies (#190)
renovate-bot Oct 31, 2023
d3e7f78
chore(deps): update all dependencies (#191)
renovate-bot Nov 2, 2023
40bae13
chore: update docfx minimum Python version (#192)
gcf-owl-bot[bot] Nov 3, 2023
c7cb3d4
fix: Updates to hOCR Template to follow hOCR Spec (#195)
holtskinner Nov 7, 2023
81b07cd
chore(main): release 0.11.2-alpha (#196)
release-please[bot] Nov 7, 2023
6ed5581
chore: bump urllib3 from 1.26.12 to 1.26.18 (#197)
gcf-owl-bot[bot] Nov 8, 2023
0a26cb7
chore(python): Add Python 3.12 (#203)
gcf-owl-bot[bot] Nov 27, 2023
9470b5b
feat: Introduce compatibility with native namespace packages (#201)
parthea Nov 27, 2023
d759953
chore: bump cryptography from 41.0.5 to 41.0.6 in /synthtool/gcp/temp…
gcf-owl-bot[bot] Nov 29, 2023
674f10a
build: treat warnings as errors (#208)
holtskinner Nov 29, 2023
9f3a3e4
fix: Implement lazy-loading of properties to speed up initialization …
holtskinner Nov 29, 2023
d7cf548
build: Ignore `DeprecationWarning` from google-api-core (#211)
holtskinner Nov 29, 2023
097dd49
build: ignore warning from google-auth (#212)
parthea Nov 30, 2023
a8c81b3
fix: Prevent sorting entities labeled in Document AI Workbench (#200)
lababidi Nov 30, 2023
627877c
chore(main): release 0.12.0-alpha (#204)
release-please[bot] Nov 30, 2023
5de2305
build: update warning filters (#219)
parthea Dec 8, 2023
6aa2d57
build: update actions/checkout and actions/setup-python (#220)
gcf-owl-bot[bot] Dec 10, 2023
1204767
chore(deps): update dependency google-cloud-documentai to v2.21.0 (#221)
renovate-bot Dec 10, 2023
8002041
chore(deps): update dependency google-cloud-bigquery to v3.14.0 (#222)
renovate-bot Dec 12, 2023
6edd983
chore(deps): update dependency google-cloud-storage to v2.14.0 (#225)
renovate-bot Dec 12, 2023
a188afc
chore(deps): update dependency google-cloud-documentai to v2.21.1 (#226)
renovate-bot Dec 13, 2023
b6f51db
chore(deps): update dependency google-cloud-bigquery to v3.14.1 (#227)
renovate-bot Dec 14, 2023
558b774
build: update actions/upload-artifact and actions/download-artifact (…
gcf-owl-bot[bot] Dec 15, 2023
642f58d
chore(deps): update dependency pytest to v7.4.4 (#230)
renovate-bot Jan 2, 2024
1fd7574
chore(deps): update dependency google-cloud-bigquery to v3.15.0 (#232)
renovate-bot Jan 11, 2024
a1328ef
chore(deps): update dependency google-cloud-bigquery to v3.16.0 (#234)
renovate-bot Jan 12, 2024
6714e98
build(python): fix `docs` and `docfx` builds (#235)
gcf-owl-bot[bot] Jan 17, 2024
09239cc
Add ignore warning for PyArrow/Pandas (#238)
holtskinner Jan 24, 2024
3bab8d5
chore(deps): update dependency google-cloud-bigquery to v3.17.0 (#242)
renovate-bot Jan 24, 2024
763f8f1
chore(deps): update dependency google-cloud-bigquery to v3.17.1 (#243)
renovate-bot Jan 25, 2024
08cd660
fix: Add `Python37DeprecationWarning` (#241)
holtskinner Feb 2, 2024
cc9d37d
chore(main): release 0.12.1-alpha (#248)
release-please[bot] Feb 2, 2024
090981f
fix: Reduce API polling for `Document.from_batch_process_operation()`…
holtskinner Feb 2, 2024
c1b76fa
chore(main): release 0.12.2-alpha (#250)
release-please[bot] Feb 2, 2024
7f1d8cc
build(deps): bump cryptography from 41.0.6 to 42.0.0 in /synthtool/gc…
gcf-owl-bot[bot] Feb 6, 2024
b1cdb8b
chore(deps): update all dependencies (#244)
renovate-bot Feb 6, 2024
5ff48b3
chore(deps): update all dependencies (#255)
renovate-bot Feb 7, 2024
1160dd8
build(deps): bump cryptography from 42.0.0 to 42.0.2 in .kokoro (#258)
gcf-owl-bot[bot] Feb 20, 2024
2dfcdf6
chore(deps): update all dependencies (#256)
renovate-bot Feb 23, 2024
778421f
feat: Added `gcs_uri` parameter to `Document.from_gcs()` to allow imp…
holtskinner Feb 26, 2024
cb89d36
chore(main): release 0.13.0-alpha (#263)
release-please[bot] Feb 26, 2024
01843e4
build(deps): bump cryptography from 42.0.2 to 42.0.4 in .kokoro (#265)
gcf-owl-bot[bot] Feb 27, 2024
d8aadc4
fix: Changed `client_info` import and added new quickstart samples (#…
holtskinner Mar 4, 2024
103d6cd
chore(main): release 0.13.1-alpha (#269)
release-please[bot] Mar 4, 2024
a8c463d
fix: Add trailing slash if not present for `gcs_prefix` in `Document.…
holtskinner Mar 8, 2024
872e550
chore(main): release 0.13.2-alpha (#275)
release-please[bot] Mar 8, 2024
381f65d
fix: Drop Python 3.7 Support (#262)
holtskinner Mar 8, 2024
76b0198
chore(deps): update all dependencies (#260)
renovate-bot Mar 11, 2024
02af25b
fix: Escape html special characters in `hocr_document_template.xml.j2…
holtskinner Mar 11, 2024
4d24ed3
chore(deps): Fix possible dependency conflict with `google-api-core` …
holtskinner Mar 11, 2024
2106eba
chore(deps): update dependency google-cloud-bigquery to v3.19.0 (#282)
renovate-bot Mar 11, 2024
18b30af
chore(main): release 0.13.3-alpha (#276)
release-please[bot] Mar 11, 2024
baa83c4
chore(python): update dependencies in /.kokoro (#287)
gcf-owl-bot[bot] Mar 18, 2024
1b8a8fc
chore(python): add requirements for docs build (#288)
gcf-owl-bot[bot] Mar 18, 2024
1aa5689
chore(deps): update dependency google-cloud-storage to v2.16.0 (#289)
renovate-bot Mar 19, 2024
e37b2b0
chore(deps): update dependency google-cloud-documentai to v2.24.2 (#290)
renovate-bot Mar 22, 2024
f501652
chore(deps): update all dependencies (#294)
renovate-bot Mar 28, 2024
2ecbf7b
chore(deps): update all dependencies (#297)
renovate-bot Apr 2, 2024
0320059
chore(deps): update all dependencies (#299)
renovate-bot Apr 3, 2024
cc9e867
chore(python): bump idna from 3.4 to 3.7 in .kokoro (#304)
gcf-owl-bot[bot] Apr 12, 2024
21b0745
chore(deps): update all dependencies (#309)
renovate-bot May 21, 2024
d6f7de1
refactor: Change read-only properties to `@cached_property` for reada…
holtskinner Jun 12, 2024
932e0ba
fix: Change `if` condition typo in `_get_children_of_element()` (#313)
holtskinner Jun 13, 2024
ffb64b7
chore(main): release 0.13.4-alpha (#314)
release-please[bot] Jun 13, 2024
d48ffa2
Update README.rst for Python 3.7 (#298)
holtskinner Jun 13, 2024
3ede3cb
fix: Refactor page.py to improve performance and organization (#316)
holtskinner Jul 2, 2024
02ca614
chore(main): release 0.13.5-alpha (#319)
release-please[bot] Jul 2, 2024
df67ed5
chore(deps): update all dependencies (#315)
renovate-bot Jul 3, 2024
df02c0f
chore(python): Use latest python runtime in prerelease_deps session (…
gcf-owl-bot[bot] Jul 4, 2024
f2301c8
test: Remove try/except for mock import (#281)
holtskinner Jul 8, 2024
3504df9
chore(deps): update all dependencies (#322)
renovate-bot Jul 8, 2024
ec08e7d
chore(python): use python 3.10 for docs build (#330)
gcf-owl-bot[bot] Jul 8, 2024
98e7bc1
chore(deps): update dependency google-cloud-documentai to v2.29.2 (#331)
renovate-bot Jul 9, 2024
9cc07cc
fix: Add support for Classifier entities (#333)
holtskinner Jul 16, 2024
e91fdb0
feat: Add Support for Layout Parser Documents (#334)
holtskinner Jul 16, 2024
cb11ead
chore(main): release 0.14.0-alpha (#337)
release-please[bot] Jul 25, 2024
d38bfe4
chore(deps): update all dependencies (#338)
renovate-bot Jul 25, 2024
0cba81f
chore(deps): update dependency google-cloud-documentai to v2.29.3 (#340)
renovate-bot Jul 31, 2024
cdd6211
chore(deps): update dependency google-cloud-documentai to v2.30.0 (#342)
renovate-bot Aug 7, 2024
d9edfa9
chore(python): fix docs build (#341)
gcf-owl-bot[bot] Aug 8, 2024
072f09d
chore(deps): update all dependencies (#347)
renovate-bot Sep 4, 2024
dd2e21f
chore(python): update unittest workflow template (#348)
gcf-owl-bot[bot] Sep 4, 2024
93fe2b2
build(python): release script update (#351)
gcf-owl-bot[bot] Sep 16, 2024
563ba62
chore: Update setup.py to accept Document AI client library >=3.0.0 (…
holtskinner Oct 25, 2024
8bb7344
chore(deps): update all dependencies (#352)
renovate-bot Oct 25, 2024
5364fb9
build: use multiScm for Kokoro release builds (#358)
gcf-owl-bot[bot] Oct 31, 2024
8c97cb1
chore: Add proto-plus dependency for Python 3.13 (#356)
holtskinner Oct 31, 2024
caae6d9
chore(python): remove obsolete release scripts and config files (#359)
gcf-owl-bot[bot] Nov 11, 2024
22824e4
chore(python): update dependencies in .kokoro/docker/docs (#360)
gcf-owl-bot[bot] Nov 12, 2024
bd77489
chore(deps): update all dependencies (#357)
renovate-bot Nov 15, 2024
5c3f9c2
chore(python): update dependencies in .kokoro/docker/docs (#363)
gcf-owl-bot[bot] Dec 17, 2024
52f114f
chore(main): release 0.14.1-alpha (#365)
release-please[bot] Dec 17, 2024
8b7b897
chore(python): Update the python version in docs presubmit to use 3.1…
gcf-owl-bot[bot] Jan 3, 2025
498a357
chore(python): exclude .github/workflows/unittest.yml in renovate con…
gcf-owl-bot[bot] Jan 9, 2025
04a95f9
chore(deps): bump jinja2 from 3.1.4 to 3.1.5 in /.kokoro (#369)
dependabot[bot] Jan 9, 2025
29b9541
chore(python): fix docs publish build (#372)
gcf-owl-bot[bot] Jan 16, 2025
f69f663
chore(deps): bump cryptography from 43.0.1 to 44.0.1 in /.kokoro (#376)
dependabot[bot] Feb 11, 2025
baefcea
chore(python): conditionally load credentials in .kokoro/build.sh (#377)
gcf-owl-bot[bot] Feb 24, 2025
81ee999
chore(deps): update all dependencies (#362)
renovate-bot Mar 5, 2025
8ad946f
chore: remove unused files (#379)
parthea Mar 5, 2025
ebc6dc5
fix: allow google-cloud-storage 3.x (#378)
parthea Mar 5, 2025
7119b1f
chore(main): release 0.14.2-alpha (#381)
release-please[bot] Mar 5, 2025
ad93114
fix: remove setup.cfg configuration for creating universal wheels (#383)
parthea Mar 14, 2025
fa8038f
feat: Add support for Python 3.14 (#390)
daniel-sanche Dec 11, 2025
b810dd8
chore(main): release 0.15.0-alpha (#384)
release-please[bot] Dec 15, 2025
ce0b4c5
fix: Update `storage.Blob.from_string()` to `from_uri()` (#385)
holtskinner Jan 14, 2026
5cab8dc
chore(main): release 0.15.1-alpha (#393)
release-please[bot] Jan 14, 2026
eabdf35
Merge remote-tracking branch 'remote.googleapis/python-documentai-too…
parthea Mar 2, 2026
71a7a89
Trigger owlbot post-processor
parthea Mar 2, 2026
729e243
build: google-cloud-documentai-toolbox migration: adjust owlbot-relat…
parthea Mar 2, 2026
784a580
tests: fix build
parthea Mar 2, 2026
7328538
chore: delete unused files
chalmerlowe Mar 3, 2026
781d121
chore: delete unused files, part 2
chalmerlowe Mar 3, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 16 additions & 0 deletions packages/google-cloud-documentai-toolbox/.OwlBot.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# Copyright 2021 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.


begin-after-commit-hash: 92006bb3cdc84677aa93c7f5235424ec2b157146
39 changes: 39 additions & 0 deletions packages/google-cloud-documentai-toolbox/.coveragerc
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
# -*- coding: utf-8 -*-
#
# Copyright 2024 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# Generated by synthtool. DO NOT EDIT!
[run]
branch = True
omit =
google/__init__.py
google/cloud/__init__.py

[report]
fail_under = 98
show_missing = True
exclude_lines =
# Re-enable the standard pragma
pragma: NO COVER
# Ignore debug-only repr
def __repr__
# Ignore abstract methods
raise NotImplementedError
omit =
*/gapic/*.py
*/proto/*.py
*/core/*.py
*/site-packages/*.py
google/cloud/__init__.py
33 changes: 33 additions & 0 deletions packages/google-cloud-documentai-toolbox/.flake8
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
# -*- coding: utf-8 -*-
#
# Copyright 2024 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# Generated by synthtool. DO NOT EDIT!
[flake8]
ignore = E203, E231, E266, E501, W503
exclude =
# Exclude generated code.
**/proto/**
**/gapic/**
**/services/**
**/types/**
*_pb2.py

# Standard linting exemptions.
**/.nox/**
__pycache__,
.git,
*.pyc,
conf.py
14 changes: 14 additions & 0 deletions packages/google-cloud-documentai-toolbox/.repo-metadata.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
{
"name": "documentai-toolbox",
"name_pretty": "Document AI Toolbox",
"issue_tracker": "https://github.com/googleapis/python-documentai-toolbox/issues",
"client_documentation": "https://cloud.google.com/python/docs/reference/documentai-toolbox/latest",
"release_level": "preview",
"language": "python",
"library_type": "OTHER",
"repo": "googleapis/python-documentai-toolbox",
"distribution_name": "google-cloud-documentai-toolbox",
"requires_billing": true,
"default_version": "v1",
"codeowner_team": "@googleapis/cdpe-cloudai"
}
307 changes: 307 additions & 0 deletions packages/google-cloud-documentai-toolbox/CHANGELOG.md

Large diffs are not rendered by default.

285 changes: 285 additions & 0 deletions packages/google-cloud-documentai-toolbox/CONTRIBUTING.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,285 @@
.. Generated by synthtool. DO NOT EDIT!
############
Contributing
############

#. **Please sign one of the contributor license agreements below.**
#. Fork the repo, develop and test your code changes, add docs.
#. Make sure that your commit messages clearly describe the changes.
#. Send a pull request. (Please Read: `Faster Pull Request Reviews`_)

.. _Faster Pull Request Reviews: https://github.com/kubernetes/community/blob/master/contributors/guide/pull-requests.md#best-practices-for-faster-reviews

.. contents:: Here are some guidelines for hacking on the Google Cloud Client libraries.

***************
Adding Features
***************

In order to add a feature:

- The feature must be documented in both the API and narrative
documentation.

- The feature must work fully on the following CPython versions:
3.8, 3.9, 3.10, 3.11, 3.12, 3.13 and 3.14 on both UNIX and Windows.

- The feature must not add unnecessary dependencies (where
"unnecessary" is of course subjective, but new dependencies should
be discussed).

****************************
Using a Development Checkout
****************************

You'll have to create a development environment using a Git checkout:

- While logged into your GitHub account, navigate to the
``python-documentai-toolbox`` `repo`_ on GitHub.

- Fork and clone the ``python-documentai-toolbox`` repository to your GitHub account by
clicking the "Fork" button.

- Clone your fork of ``python-documentai-toolbox`` from your GitHub account to your local
computer, substituting your account username and specifying the destination
as ``hack-on-python-documentai-toolbox``. E.g.::

$ cd ${HOME}
$ git clone git@github.com:USERNAME/python-documentai-toolbox.git hack-on-python-documentai-toolbox
$ cd hack-on-python-documentai-toolbox
# Configure remotes such that you can pull changes from the googleapis/python-documentai-toolbox
# repository into your local repository.
$ git remote add upstream git@github.com:googleapis/python-documentai-toolbox.git
# fetch and merge changes from upstream into main
$ git fetch upstream
$ git merge upstream/main

Now your local repo is set up such that you will push changes to your GitHub
repo, from which you can submit a pull request.

To work on the codebase and run the tests, we recommend using ``nox``,
but you can also use a ``virtualenv`` of your own creation.

.. _repo: https://github.com/googleapis/python-documentai-toolbox

Using ``nox``
=============

We use `nox <https://nox.readthedocs.io/en/latest/>`__ to instrument our tests.

- To test your changes, run unit tests with ``nox``::
$ nox -s unit

- To run a single unit test::

$ nox -s unit-3.14 -- -k <name of test>


.. note::

The unit tests and system tests are described in the
``noxfile.py`` files in each directory.

.. nox: https://pypi.org/project/nox/

*****************************************
I'm getting weird errors... Can you help?
*****************************************

If the error mentions ``Python.h`` not being found,
install ``python-dev`` and try again.
On Debian/Ubuntu::

$ sudo apt-get install python-dev

************
Coding Style
************
- We use the automatic code formatter ``black``. You can run it using
the nox session ``blacken``. This will eliminate many lint errors. Run via::

$ nox -s blacken

- PEP8 compliance is required, with exceptions defined in the linter configuration.
If you have ``nox`` installed, you can test that you have not introduced
any non-compliant code via::

$ nox -s lint

- In order to make ``nox -s lint`` run faster, you can set some environment
variables::

export GOOGLE_CLOUD_TESTING_REMOTE="upstream"
export GOOGLE_CLOUD_TESTING_BRANCH="main"

By doing this, you are specifying the location of the most up-to-date
version of ``python-documentai-toolbox``. The
remote name ``upstream`` should point to the official ``googleapis``
checkout and the branch should be the default branch on that remote (``main``).

- This repository contains configuration for the
`pre-commit <https://pre-commit.com/>`__ tool, which automates checking
our linters during a commit. If you have it installed on your ``$PATH``,
you can enable enforcing those checks via:

.. code-block:: bash

$ pre-commit install
pre-commit installed at .git/hooks/pre-commit

Exceptions to PEP8:

- Many unit tests use a helper method, ``_call_fut`` ("FUT" is short for
"Function-Under-Test"), which is PEP8-incompliant, but more readable.
Some also use a local variable, ``MUT`` (short for "Module-Under-Test").

********************
Running System Tests
********************

- To run system tests, you can execute::

# Run all system tests
$ nox -s system

# Run a single system test
$ nox -s system-3.14 -- -k <name of test>


.. note::

System tests are only configured to run under Python 3.9 and 3.14.
For expediency, we do not run them in older versions of Python 3.

This alone will not run the tests. You'll need to change some local
auth settings and change some configuration in your project to
run all the tests.

- System tests will be run against an actual project. You should use local credentials from gcloud when possible. See `Best practices for application authentication <https://cloud.google.com/docs/authentication/best-practices-applications#local_development_and_testing_with_the>`__. Some tests require a service account. For those tests see `Authenticating as a service account <https://cloud.google.com/docs/authentication/production>`__.

*************
Test Coverage
*************

- The codebase *must* have 100% test statement coverage after each commit.
You can test coverage via ``nox -s cover``.

******************************************************
Documentation Coverage and Building HTML Documentation
******************************************************

If you fix a bug, and the bug requires an API or behavior modification, all
documentation in this package which references that API or behavior must be
changed to reflect the bug fix, ideally in the same commit that fixes the bug
or adds the feature.

Build the docs via:

$ nox -s docs

*************************
Samples and code snippets
*************************

Code samples and snippets live in the `samples/` catalogue. Feel free to
provide more examples, but make sure to write tests for those examples.
Each folder containing example code requires its own `noxfile.py` script
which automates testing. If you decide to create a new folder, you can
base it on the `samples/snippets` folder (providing `noxfile.py` and
the requirements files).

The tests will run against a real Google Cloud Project, so you should
configure them just like the System Tests.

- To run sample tests, you can execute::

# Run all tests in a folder
$ cd samples/snippets
$ nox -s py-3.8

# Run a single sample test
$ cd samples/snippets
$ nox -s py-3.8 -- -k <name of test>

********************************************
Note About ``README`` as it pertains to PyPI
********************************************

The `description on PyPI`_ for the project comes directly from the
``README``. Due to the reStructuredText (``rst``) parser used by
PyPI, relative links which will work on GitHub (e.g. ``CONTRIBUTING.rst``
instead of
``https://github.com/googleapis/python-documentai-toolbox/blob/main/CONTRIBUTING.rst``)
may cause problems creating links or rendering the description.

.. _description on PyPI: https://pypi.org/project/google-cloud-documentai-toolbox


*************************
Supported Python Versions
*************************

We support:

- `Python 3.8`_
- `Python 3.9`_
- `Python 3.10`_
- `Python 3.11`_
- `Python 3.12`_
- `Python 3.13`_
- `Python 3.14`_

.. _Python 3.8: https://docs.python.org/3.8/
.. _Python 3.9: https://docs.python.org/3.9/
.. _Python 3.10: https://docs.python.org/3.10/
.. _Python 3.11: https://docs.python.org/3.11/
.. _Python 3.12: https://docs.python.org/3.12/
.. _Python 3.13: https://docs.python.org/3.13/
.. _Python 3.14: https://docs.python.org/3.14/


Supported versions can be found in our ``noxfile.py`` `config`_.

.. _config: https://github.com/googleapis/python-documentai-toolbox/blob/main/noxfile.py


We also explicitly decided to support Python 3 beginning with version 3.8.
Reasons for this include:

- Encouraging use of newest versions of Python 3
- Taking the lead of `prominent`_ open-source `projects`_
- `Unicode literal support`_ which allows for a cleaner codebase that
works in both Python 2 and Python 3

.. _prominent: https://docs.djangoproject.com/en/1.9/faq/install/#what-python-version-can-i-use-with-django
.. _projects: http://flask.pocoo.org/docs/0.10/python3/
.. _Unicode literal support: https://www.python.org/dev/peps/pep-0414/

**********
Versioning
**********

This library follows `Semantic Versioning`_.

.. _Semantic Versioning: http://semver.org/

Some packages are currently in major version zero (``0.y.z``), which means that
anything may change at any time and the public API should not be considered
stable.

******************************
Contributor License Agreements
******************************

Before we can accept your pull requests you'll need to sign a Contributor
License Agreement (CLA):

- **If you are an individual writing original source code** and **you own the
intellectual property**, then you'll need to sign an
`individual CLA <https://developers.google.com/open-source/cla/individual>`__.
- **If you work for a company that wants to allow you to contribute your work**,
then you'll need to sign a
`corporate CLA <https://developers.google.com/open-source/cla/corporate>`__.

You can sign these electronically (just scroll to the bottom). After that,
we'll be able to accept your pull requests.
Loading
Loading