WAVE: Feed Item extract performs poorly due to Sharing accessing check
Last updated 2022-02-10 ·Reference W-3404021 ·Reported By 10 users
Summary
When trying to extract Feed Items, users may receive the following error (or similar): There was an error executing the XXX node: Get (org specific URL here): net/http: timeout awaiting response headers
Feed Items are not fully supported at this point, and the types of issues the customers can run into include (a) data flow execution fails with a time-out error, or (b) data flow execution succeeds, but the resulting feed item dataset contains only a subset of feed items from the org.
Dataflow jobs can potentially process a maximum of 100,000 FeedItem records but that limit can be reached when much fewer FeedItem records are processed if Unlisted Groups has been enabled in the Org.
Repro
1. Create Dataflow that extracts Feed Items.
2. Run Dataflow.
3. Dataflow will complete with error message similar to the one above.
Workaround
If Unlisted Groups has been enabled in the Org and an "OPERATION_TOO_LARGE" error is returned when extracting fewer than 100K FeedItem records you can add "enablePKChunking:false" on the sfdcDigest node to workaround the error.
Is it Fixed?
Any unreleased services, features, statuses, or dates referenced in this or other public statements are not currently available and may not be delivered on time or at all. Customers who purchase our services should make their purchase decisions based upon features that are currently available.