名称:ftpreader
协议 | 是否支持 |
---|---|
FTP | 支持 |
SFTP | 支持 |
FTP服务搭建 windows:地址 linux:地址 sftp服务搭建 windows:地址 linux:地址
- protocol
- 描述:ftp服务器协议,目前支持传输协议有
ftp
、sftp
- 必选:是
- 字段类型:string
- 默认值:无
- 描述:ftp服务器协议,目前支持传输协议有
- host
- 描述:ftp服务器地址
- 必选:是
- 字段类型:string
- 默认值:无
- port
- 描述:ftp服务器端口
- 必选:否
- 字段类型:int
- 默认值:若传输协议是sftp协议,默认值是22;若传输协议是标准ftp协议,默认值是21
- username
- 描述:ftp服务器访问用户名
- 必选:是
- 字段类型:string
- 默认值:无
- password
- 描述:ftp服务器访问密码
- 必选:否
- 字段类型:string
- 默认值:无
- path
- 描述:远程FTP文件系统的路径信息,注意这里可以支持填写多个路径,多个路径以
,
隔开 - 必选:是
- 字段类型:string
- 默认值:无
- 描述:远程FTP文件系统的路径信息,注意这里可以支持填写多个路径,多个路径以
- fieldDelimiter
- 描述:读取的字段分隔符
- 必选:是
- 字段类型:string
- 默认值:
,
- encoding
- 描述:读取文件的编码配置
- 必选:否
- 字段类型:string
- 默认值:
UTF-8
- isFirstLineHeader
- 描述:首行是否为标题行,如果是则不读取第一行
- 必选:否
- 字段类型:boolean
- 默认值:false
- timeout
- 描述:连接超时时间,单位毫秒
- 必选:否
- 字段类型:long
- 默认值:5000
- column
- 描述:需要读取的字段
- 格式:支持2种格式
1.读取全部字段,如果字段数量很多,可以使用下面的写法:
"column":["*"]
2.指定具体信息:
"column": [{
"index": 0,
"type": "datetime",
"format": "yyyy-MM-dd hh:mm:ss",
"value": "value"
}]
- 属性说明:
- index:字段索引
- type:字段类型,ftp读取的为文本文件,本质上都是字符串类型,这里可以指定要转成的类型
- format:如果字段是时间字符串,可以指定时间的格式,将字段类型转为日期格式返回
- value:如果没有指定index,则会把value的值作为常量列返回,如果指定了index,当读取的字段的值为null时,会以此value值作为默认值返回
- 必选:是
- 字段类型:数组
- 默认值:无
- connectPattern
- 描述:协议为ftp时的连接模式,可选
pasv
,port
,参数含义可参考:模式说明 - 必选:否
- 字段类型:string
- 默认值:
PASV
- 描述:协议为ftp时的连接模式,可选
- privateKeyPath
- 描述:私钥文件路径
- 必选:否
- 字段类型:string
- 默认值:无
{
"job": {
"content": [
{
"reader": {
"parameter": {
"path": "/data/ftp/flinkx/file1.csv",
"protocol": "sftp",
"port": 22,
"isFirstLineHeader": true,
"host": "localhost",
"column": [
{
"index": 0,
"type": "string"
},
{
"index": 1,
"type": "string"
},
{
"index": 2,
"type": "int"
},
{
"index": 3,
"type": "int"
}
],
"password": "pass",
"fieldDelimiter": ",",
"encoding": "utf-8",
"username": "user"
},
"name": "ftpreader"
},
"writer": {
"parameter": {},
"name": "streamwriter"
}
}
],
"setting": {
"restore": {
"maxRowNumForCheckpoint": 0,
"isRestore": false,
"restoreColumnName": "",
"restoreColumnIndex": 0
},
"errorLimit": {
"record": 100
},
"speed": {
"bytes": 0,
"channel": 1
}
}
}
}
{
"job": {
"content": [
{
"reader": {
"parameter": {
"path": "/data/ftp/flinkx/dir1",
"protocol": "sftp",
"port": 22,
"isFirstLineHeader": true,
"host": "localhost",
"column": [
{
"index": 0,
"type": "string"
},
{
"index": 1,
"type": "string"
},
{
"index": 2,
"type": "int"
},
{
"index": 3,
"type": "int"
}
],
"password": "pass",
"fieldDelimiter": ",",
"encoding": "utf-8",
"username": "user"
},
"name": "ftpreader"
},
"writer": {
"parameter": {},
"name": "streamwriter"
}
}
],
"setting": {
"speed": {
"channel": 1
}
}
}
}
{
"job": {
"content": [
{
"reader": {
"parameter": {
"path": "/data/ftp/flinkx/dir1,/data/ftp/flinkx/dir2",
"protocol": "sftp",
"port": 22,
"isFirstLineHeader": true,
"host": "host",
"column": [
{
"index": 0,
"type": "string"
},
{
"index": 1,
"type": "string"
},
{
"index": 2,
"type": "int"
},
{
"index": 3,
"type": "int"
}
],
"password": "pass",
"fieldDelimiter": ",",
"encoding": "utf-8",
"username": "user"
},
"name": "ftpreader"
},
"writer": {
"parameter": {},
"name": "streamwriter"
}
}
],
"setting": {
"speed": {
"channel": 1
}
}
}
}
{
"job": {
"content": [
{
"reader": {
"parameter": {
"isFirstLineHeader": false,
"column": [
{
"index": 0,
"type": "STRING",
"key": 0
},
{
"index": 1,
"type": "STRING",
"key": 1
}
],
"fieldDelimiter": ",",
"encoding": "utf-8",
"path": "/data/a.csv",
"protocol": "ftp",
"password": "passwd",
"connectMode": "PORT",
"port": 21,
"host": "host",
"username": "usname"
},
"name": "ftpreader"
},
"writer": {
"parameter": {
"print": true
},
"name": "streamwriter"
}
}
],
"setting": {
"speed": {
"channel": 1
}
}
}
}