Add processor type #449
Replies: 4 comments 4 replies
-
| 
         Dataplane can provide a data source management (#453 is built on it), manage and reuse various data connections, such as MySQL, GreenPlum, etc. When using SQL processor, you can select data connection.  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         For ETL related processors, lineage data kinship should be considered. #451  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         HTTP processor is mainly used for serverless functions (such as OpenFaaS), which is also related to data processing.  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         Hi @ChenpiDog, thank you for all the good feedback on lineage, data management and processors. For SQL statements, I am busy working on a Dataplane python library that will be easy to use SQL to pull data, perform transformations and push data. Is there a reason why you wouldn't use SQL inside python? I think it makes sense with SQLAlchemy because of all the adapters are available to the different databases including Greenplum for example: https://pypi.org/project/sqlalchemy-greenplum/ For HTTP processor, using requests in Python can achieve that if it is a short running process (see example below). If it is a long running process, I agree we need a delayed response i.e. send a request to the serverless function with a callback hook to resume the to the next step in the pipeline once the process is complete. I can definitely add that in our roadmap and I think quite powerful capability in general. import requests
x = requests.get('https://w3schools.com/python/demopage.htm')
print(x.text) | 
  
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Dataplane can consider adding SQL and HTTP processor types, especially SQL, which is quite common in ETL.
Beta Was this translation helpful? Give feedback.
All reactions