`tapshell` Usage

tapshell Usage
1. tapshell Command
2. tapshell Lib

`tapshell` Command

Display the resources

To display the resources in the Tapdata system, you can use the command show

`show datasources`

To list the datasources in the Tapdata system, use the command show datasources and get the return like below.

id     status     database_type        name
03f6b1: ready      mysql               test-mysql

Filed	Description
id	Datasource id
status	Datasource status(ready/invalid)
database_type	The database type of datasource
name	The name of datasource

`show jobs`

To list the jobs in the Tapdata system, use the command show jobs and get the return like below.

a2e1d7: Mysql-2-MongoDB                             draft    custom/initial_sync+cdc

Filed	Description
id	Job id
Name	The name of job
Status	The status of job
type	The type of job - cluster-clone/initial_sync - cluster-clone/initial_sync+cdc - custom/initial_sync - custom/initial_sync+cdc

`show apis`

To list the APIs in the Tapdata system, use the command show apis and get the return like below.

api_name             tablename            basePath             status     test url
AAAA                 AUTO_CLAIM_SOURCE_111 AAAA                 active     http://v1-52.devops.tapdata.net#/apiDocAndTest?id=AAAA_v1

Filed	Description
api_name	The name of api
tablename	The table of api
basePath	The base path of api
Status	The status of api
test url	The test url of api

`show tables`

After appoint a datasource by using use datasource_name, you can use show tables to list the tables of this datasource.

DataSource Command

Switch DataSource

To switch Datasource in the context, issue the use <datasource> helper, as in the following example.

use <datasource>

After appoint a datasource , you can use the command below to list table and get the schema of table:

show tables

desc <table>

Delete DataSource

To delete a DataSource ,use the command below.

delete datasource <datasource>

Job Command

`stop job <JobID/JobName>`

Stop the job with specified JobID/JobName.

`start job <JobID/JobName>`

Start the job with specified JobID/JobName.

`status job <JobID/JobName>`

Get the status of specified job.

`delete job <JobID/JobName>`

Delete the specified job.

`logs job <JobID/JobName> [options]`

Get log of specified job.

Options

Name	Required	Example	Description
t	No	t=2	The duration of tail method
limit	No	limit=5	Specify the last lines when the tail method started
tail	no	tail=True	Tail method(Default:false)
level	no	level=error	The log level (Default:info)

`monitor job <JobID/JobName>`

Display the job monitor status in real-time.

job Car_shop_job2 status: running, total sync table: 1, finished synced: 1, total rows: 4, finished rows: 4, speed: 0, estimated time: 0:00:00

Filed	Description
job	The name of job
Status	The status of job
total sync table	The number of table that the job has total synced
finished synced	The number of table that the job has total synced
total rows	The total rows of the tables in this job
finished rows	The total rows that the job has already synced
Speed	The real-time speed of this job
estimated time	The estimated time of this job

Load/Reload DataSource schema

`validate datasource <DataSourceID/DataSourceName>`

Load and Reload the schema of specified datasource.

`tapshell` Lib

Understand the Tapdata Shell Lib Syntax

The Tapdata shell Library consists of configuration and action. Each configuration declare the attributes of the library as they pass through the pipeline.

To create an Lib or make some action, use the following syntax in the Tapdata Shell:

<variable> = <lib>.[configurations]
<variable>.<action>

Example

Car_shop = DataSource("mysql","Car_shop").host("127.0.0.1").port(3306).username('root').password('password').db('Car_shop')
Car_shop.validate()
Car_shop.save()

DataSource

The DataSource Libary is used to create , declare ,validate, save and delete a DataSource.

Declare and configure a new DataSource:

My_DataSource = DataSource(<database_type>,<name>,<source_type>).[configurations]

Name	Required	Example	Description
database_type	Yes	mysql	The database type of the DataSource. Please refer to Pre-Build Connectors for details
name	Yes	“My_DS”	The name of DataSource(Must be unique)
source_type	no	source_and_target (Default) source target	The type of DataSource. For the types supported by each database, please refer to Pre-Build Connectors .
configurations	Yes		Each database type has its own configurations, please refer to Pre-Build Connectors

Trigger a DataSource action:

My_DataSource.<action>

Action

Name	Description
save()	Save the configured DataSource（Will do Validation at the same time）
validate()	Validate the configured DataSource
delete()	Delete the specific DataSource

Example

Create a mysql DataSource.

Car_shop = DataSource("mysql","Car_shop").host("127.0.0.1").port(3306).username('root').password('password').db('Car_shop')
Car_shop.save()

Pipeline

The Pipeline Libary is used to configure a job and trigger a job action.

Declare and configure a new DataSource:

My_Pipeline = Pipeline(<name>).[configurations]

Field	Required	Example	Description
name	Yes	“My_Pipeline”	Job name

Configurations

Field	Required	Example	Description
`readFrom(<Source>)`	Yes	readFrom(Car_shop.Customer)	Declare the Pipeline’s read source.Please refer to Source for Source’s details
`writeTo(<Sink>, <Relation>)`	Yes	writeTo(Car_shop.Customer_v1) writeTo(Car_shop.Customer_v1,writeMode=upsert)	Declare the Pipeline’s destination.Please refer to Sink and Relation for more details
`rename(string, string)`	No	rename(“filed1”,”filed2”)
`js(string)`	No	js(‘record[“updated”]=new Date() ;return record;’) js(“/mypath/modify.js”)	Please refer to JS Usage for more details
`filter(string)`	No	filter(‘’)	Only supported when the readFrom source is a single table
`filterColumn([])`	No	filterColumn([column1,column2],FilterType.keep or FilterType.delete)	Only supported when the readFrom source is a single table

Trigger a Job action:

My_Pipeline.<action>

Action

Name	Description
start()	Save the configured DataSource（Will do Validation at the same time）
stop()	Validate the configured DataSource
delete()	Delete the specific DataSource

Example

Car_shop_job1 = Pipeline("Car_shop_job1").readFrom(Car_shop.Customer).writeTo(Car_shop.Customer_v1,writeMode=upsert)

Car_shop_job1.start()

Source Type

Source is used to define and describe a specified DataSource to readFrom.The simplest way to use Source is<DataSource>.<table> like Car_shop.Customers which has the same effect as Source(Car_shop,Customers).

Source(<DataSource>, <table(string or [] or table_re="regular expression")>, <sql(string)>)

Field	Required	Example	Description
`DataSource`	Yes	Car_shop	DataSource name
`table`	Yes	Customers [Customers,Orders] table_re=”Customer.*”	The table of the DataSource. Can use regular expression by table_re.
`sql`	No	” select * from Customers where name=’Edison’ “	Filter conditions of table (Using SQL) Only supported when the table field is not a array .

Example

Declare two tables (Customer and Orders) to sync from the Car_shop DataSource.

Source(Car_shop, [Customers,Orders])
Source(Car_shop, table_re="Customer.*")

Sync table Customer from the Car_shop DataSource with codition.

Source(Car_shop, Customers,"select * from Customers where name='Edison'")

Sink Type

Sink is used to define and describe a specified DataSource to writeTo.The simplest way to use Sink is<DataSource>.<table> like Car_shop_1.Customers which has the same effect as Sink(Car_shop_1,Customers).

Sink(<DataSource>, <table(string or [])>)

Field	Required	Example	Description
`DataSource`	Yes	Car_shop_1	DataSource name
`table`	No	Customers [“Customers”,”Orders”]	The table of the DataSource. When not declare the table means move all the table decalre in the Source Type

Example

Declare two tables (Customer and Orders) for the writeTo destination Car_shop DataSource.

Sink(Car_shop, ["Customers","Orders"])

When not declare the table means move all the table decalre in the Source Type

Sink(Car_shop)

Relation Type

Based on different ReadFrom Source, the available relationship types are also different

SingleTableRelation

SingleTableRelation is used to define and describe the configuration of writeTo when the ReadFrom Source table is signle.

writeMode=<writeMode>,association=[(<column1>,<column2>)],path=<path>,array_key=<array_key>

Field	Required	Example	Description
`writeMode`	No	writeMode=upsert	Write Mode. insert/upsert/update/merge_embed Default is insert
`association`	No	association=[(“id”, “id”)]	The association of write mode.The specified fields will be used for matching.
`path`	No
`array_key`	No

MultiTableRelation

MultiTableRelation is used to define and describe the configuration of writeTo when the ReadFrom Source table is multiple.

prefix=<writeMode>,subfix=<subfix>,drop_type=<drop_type>

Field	Required	Example	Description
`prefix`	No	prefix=”abc_”
`subfix`	No	subfix=”xyz_”
`drop_type`	No	drop_type=DropType.data	Processing method when a table with the same name exists at the target end. DropType.no_drop, DropType.data, DropType.all Default is DropType.no_drop

tapshell Usage

Table of contents

tapshell Command

Display the resources

show datasources

show jobs

show apis

show tables

DataSource Command

Switch DataSource

Delete DataSource

Job Command

stop job <JobID/JobName>

start job <JobID/JobName>

status job <JobID/JobName>

delete job <JobID/JobName>

logs job <JobID/JobName> [options]

Options

monitor job <JobID/JobName>

Load/Reload DataSource schema

validate datasource <DataSourceID/DataSourceName>

tapshell Lib

Understand the Tapdata Shell Lib Syntax

Example

DataSource

Action

Example

Pipeline

Configurations

Action

Example

Source Type

Example

Sink Type

Example

Relation Type

SingleTableRelation

MultiTableRelation

`tapshell` Usage

`tapshell` Command

`show datasources`

`show jobs`

`show apis`

`show tables`

`stop job <JobID/JobName>`

`start job <JobID/JobName>`

`status job <JobID/JobName>`

`delete job <JobID/JobName>`

`logs job <JobID/JobName> [options]`

`monitor job <JobID/JobName>`

`validate datasource <DataSourceID/DataSourceName>`

`tapshell` Lib