kafka-connect-ftp | Kafka Connect Source for FTP server | Pub Sub library
kandi X-RAY | kafka-connect-ftp Summary
kandi X-RAY | kafka-connect-ftp Summary
Monitors files on an FTP server and feeds changes into Kafka. Remote directories of interest are to be provided. On a specified interval, the list of files in the directories is refreshed. Files are downloaded when they were not known before, or when their timestamp or size are changed. Only files with a timestamp younger than the specified maximum age are considered. Hashes of the files are maintained and used to check for content changes. Changed files are then fed into Kafka, either as a whole (update) or only the appended part (tail), depending on the configuration. Optionally, file bodies can be transformed through a pluggable system prior to putting it into Kafka.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of kafka-connect-ftp
kafka-connect-ftp Key Features
kafka-connect-ftp Examples and Code Snippets
Community Discussions
Trending Discussions on kafka-connect-ftp
QUESTION
I am using Kafka connect with the kafka-connect-ftp plugin by Landoop (https://docs.lenses.io/connectors/source/ftp.html .)
Is there a way to use connect.ftp.monitor.tail
with a recursive path definition, in order to track all files in a directory recursively?
I've tried to do something like
...ANSWER
Answered 2019-Sep-11 at 07:03Looks like the code only supports single star glob patterns, and matches on filenames, not directories
Sounds like a good idea for a github issue, though
QUESTION
While running the connectors for kafka-connect-ftp showing this below error
...ANSWER
Answered 2018-Jan-16 at 19:01There are two main issues with this connector:
- It bundles, along with its dependencies, classes from Connect's API. E.g. classes in the package
org.apache.kafka.connect
. This is not advised and such dependencies should be marked as provided. - The actual reason that the connector fails with classloading issues is that it is depending (at least according to its
master
branch) on a version of Apache Kafka that probably does not match the version of the deployed Connect worker. Specifically, it depends onkafkaVersion = '0.10.2.0'
which is not the latest. Kafka Connect, in its recent versions that offer classloading isolation, will ignore what it considers system classes, such as classes inorg.apache.kafka.connect
when they are imported by connectors' jars. Instead, it will load such classes from the Kafka Connect jars that ship with Apache Kafka.
The above issues may cause classloading failures, as the ones you observe.
Ideally, they should be addressed at the connector level.
Workarounds you may apply are:
- build the connector code from source, after upgrading the Kafka version it depends upon and also marking its Kafka dependencies as provided (including Kafka Connect and Kafka Clients dependencies). Or,
- downgrade your deployed Kafka Connect version to match exactly the version that the connector currently depends upon.
A similar issue has been recorded here: kafka mongodb sink connector not starting
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install kafka-connect-ftp
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page