Unable to remove properties from the deals schema #119

dkarzon · 2020-05-15T08:32:16Z

I am trying to setup a hubspot tap with a postgres target and I keep getting an error about postgres trying to create a table with more than 1600 columns in it.
Even though at the time my schema only had 4 properties in it.

However going through the code it looks like if deals is selected as a stream the schema is automatically applied from the json output of this api call https://api.hubapi.com/properties/v1/deals/properties
See code here: https://github.com/singer-io/tap-hubspot/blob/master/tap_hubspot/__init__.py#L191

Is there a way to modify he computed schema for the deals stream at all to remove the properties that I don't need? I haven't been able to find a way to do that at the moment.

My deals schema:

{
"streams": [
    {
        "stream": "deals",
        "tap_stream_id": "deals",
        "key_properties": ["dealId"],
        "schema": {
            "type": "object",
            "properties": {
                "portalId": {
                    "type": [
                        "null",
                        "integer"
                    ]
                },
                "dealId": {
                    "type": [
                        "null",
                        "integer"
                    ]
                },
                "dealname": {
                    "type": [
                        "null",
                        "string"
                    ]
                },
                "dealstage": {
                    "type": [
                        "null",
                        "string"
                    ]
                }
            }
        },
        "metadata": [
            {
                "breadcrumb": [ ],
                "metadata": {
                    "selected": true,
                    "table-key-properties": [
                        "dealId"
                    ],
                    "forced-replication-method": "INCREMENTAL",
                    "valid-replication-keys": [
                        "hs_lastmodifieddate"
                    ]
                }
            }
        ]
    }
]}

The text was updated successfully, but these errors were encountered:

zyanichaimaa · 2020-07-01T12:37:37Z

have you fixed your problem ?

gmontanola · 2020-09-09T00:08:50Z

The same is happening to me. Anyone had any luck with this?

briansloane · 2020-09-09T13:52:01Z

Are you using the catalog to choose the fields that you want via selected metadata? That should allow you to limit the fields that get emitted even though the schema has all the properties in it.

gmontanola · 2020-09-09T14:07:14Z

Yes, I'm! I've only selected like 4-5 properties using "selected: true" and the others are explicitly set to false.

gmontanola · 2020-09-09T17:00:37Z

Well, I've done some testing and:

The schema is generated using all the available properties for an object (and not the selected ones as @dkarzon)
The 1600 column limit is reached because a property is an object with 4 keys (value, timestamp, source, sourceId) and this results in 4 new columns per property.

staufman · 2021-02-04T23:19:48Z

In case anyone else is hitting this, it's a real bummer. I'm not intimately familiar with the code in this repo but for now, I went into tap_hubspot/__init__.py (locally) to line 149 and changed it from if extras: to if False and extras:.

Yes, it's a hack and yes, I don't quite understand the ramifications of not syncing the extra data associated with properties. At the same time, it prevents the explosion of columns needed to pipe the data in Postgres which might be all people need.

ribeirolm mentioned this issue Jun 29, 2022

DPN-626: Ignore subfields in table creation financeit/tap-hubspot#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to remove properties from the deals schema #119

Unable to remove properties from the deals schema #119

dkarzon commented May 15, 2020

zyanichaimaa commented Jul 1, 2020

gmontanola commented Sep 9, 2020

briansloane commented Sep 9, 2020

gmontanola commented Sep 9, 2020

gmontanola commented Sep 9, 2020 •

edited

Loading

staufman commented Feb 4, 2021 •

edited

Loading

Unable to remove properties from the deals schema #119

Unable to remove properties from the deals schema #119

Comments

dkarzon commented May 15, 2020

zyanichaimaa commented Jul 1, 2020

gmontanola commented Sep 9, 2020

briansloane commented Sep 9, 2020

gmontanola commented Sep 9, 2020

gmontanola commented Sep 9, 2020 • edited Loading

staufman commented Feb 4, 2021 • edited Loading

gmontanola commented Sep 9, 2020 •

edited

Loading

staufman commented Feb 4, 2021 •

edited

Loading