; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010561 (gene) of Snake gourd v1 genome

Gene IDTan0010561
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEnzymatic polyprotein
Genome locationLG01:48407411..48413726
RNA-Seq ExpressionTan0010561
SyntenyTan0010561
Gene Ontology termsGO:0005488 - binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049709.1 hypothetical protein E6C27_scaffold76G00530 [Cucumis melo var. makuwa]2.5e-0734.08Show/hide
Query:  IDKNESIIRFHSEYGIGKSPTSSSN-IPAPMSA---------INTQWIRFPSDSMESLTP-----PPRPSASLMRPT-AVTPMRPSVSPTTPSSSQQDLQ
        ++K  +  R +S    G  P+SSSN  P+PMSA               R  S S+   +P     PP+PS +L+RP+  V  MR  VSP++   S     
Subjt:  IDKNESIIRFHSEYGIGKSPTSSSN-IPAPMSA---------INTQWIRFPSDSMESLTP-----PPRPSASLMRPT-AVTPMRPSVSPTTPSSSQQDLQ

Query:  SLQPLLKPSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----EYDINDPFLDS
        +    + P           + +SKLLAALAQ TT+AD Q++L+   + +S S   +S ++ED+     EYD++DP+LDS
Subjt:  SLQPLLKPSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----EYDINDPFLDS

KAA0057417.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.5e-0751.79Show/hide
Query:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG
        G T+L+E N++KSS+T+P++L W+++T+NP WKL    TP KR+S  A IIE  DG
Subjt:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG

KAA0066178.1 hypothetical protein E6C27_scaffold21G001880 [Cucumis melo var. makuwa]9.6e-0751.79Show/hide
Query:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG
        G T+L+E N+ KSS+T+PK+L WE+I +NP WKL     P KR S  A I E  DG
Subjt:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG

TYJ98361.1 hypothetical protein E5676_scaffold232G00950 [Cucumis melo var. makuwa]5.1e-0833.8Show/hide
Query:  RFHSEYGIGKSPTSSSN-IPAPMS------------AINTQ------WIRFPSDSMESLTPPPRPSASLMRPTA-VTPMRPSVSPTTPSSS---------
        R +S    G  P+SSSN  P+PMS             I ++      WIR P   MES T PPRPS +L+ P   V  MR S SP++   S         
Subjt:  RFHSEYGIGKSPTSSSN-IPAPMS------------AINTQ------WIRFPSDSMESLTPPPRPSASLMRPTA-VTPMRPSVSPTTPSSS---------

Query:  ------------------QQDLQSLQPLLK---------PSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----
                          Q+ +    P+++           I  ++NA+FLN K K LAAL Q T +AD Q++L+   + +S S P  S ++ED+     
Subjt:  ------------------QQDLQSLQPLLK---------PSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----

Query:  EYDINDPFLDSQP
        EYD++DPFLDSQP
Subjt:  EYDINDPFLDSQP

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]1.3e-0856.9Show/hide
Query:  ADFLSIKNKTKRNAFFQRYDENQRSEIRSKWYSFMEEIEQNIPFFTWMKQSENEINIC
        ADF+S  N +KRNAFFQ Y E +R+E+R++WYS ME I++NIPFF W +  EN + IC
Subjt:  ADFLSIKNKTKRNAFFQRYDENQRSEIRSKWYSFMEEIEQNIPFFTWMKQSENEINIC

TrEMBL top hitse value%identityAlignment
A0A5A7U8A0 Uncharacterized protein1.2e-0734.08Show/hide
Query:  IDKNESIIRFHSEYGIGKSPTSSSN-IPAPMSA---------INTQWIRFPSDSMESLTP-----PPRPSASLMRPT-AVTPMRPSVSPTTPSSSQQDLQ
        ++K  +  R +S    G  P+SSSN  P+PMSA               R  S S+   +P     PP+PS +L+RP+  V  MR  VSP++   S     
Subjt:  IDKNESIIRFHSEYGIGKSPTSSSN-IPAPMSA---------INTQWIRFPSDSMESLTP-----PPRPSASLMRPT-AVTPMRPSVSPTTPSSSQQDLQ

Query:  SLQPLLKPSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----EYDINDPFLDS
        +    + P           + +SKLLAALAQ TT+AD Q++L+   + +S S   +S ++ED+     EYD++DP+LDS
Subjt:  SLQPLLKPSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----EYDINDPFLDS

A0A5A7URX9 Enzymatic polyprotein1.2e-0751.79Show/hide
Query:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG
        G T+L+E N++KSS+T+P++L W+++T+NP WKL    TP KR+S  A IIE  DG
Subjt:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG

A0A5A7VEN9 Uncharacterized protein4.6e-0751.79Show/hide
Query:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG
        G T+L+E N+ KSS+T+PK+L WE+I +NP WKL     P KR S  A I E  DG
Subjt:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG

A0A5D3BI61 Uncharacterized protein2.5e-0833.8Show/hide
Query:  RFHSEYGIGKSPTSSSN-IPAPMS------------AINTQ------WIRFPSDSMESLTPPPRPSASLMRPTA-VTPMRPSVSPTTPSSS---------
        R +S    G  P+SSSN  P+PMS             I ++      WIR P   MES T PPRPS +L+ P   V  MR S SP++   S         
Subjt:  RFHSEYGIGKSPTSSSN-IPAPMS------------AINTQ------WIRFPSDSMESLTPPPRPSASLMRPTA-VTPMRPSVSPTTPSSS---------

Query:  ------------------QQDLQSLQPLLK---------PSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----
                          Q+ +    P+++           I  ++NA+FLN K K LAAL Q T +AD Q++L+   + +S S P  S ++ED+     
Subjt:  ------------------QQDLQSLQPLLK---------PSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDD-----

Query:  EYDINDPFLDSQP
        EYD++DPFLDSQP
Subjt:  EYDINDPFLDSQP

A0A5D3C4I7 Movement protein4.6e-0750Show/hide
Query:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG
        G T+L+E N++KSS+T+P++L W+++T+NP WKL    TP KR+S  A I E  DG
Subjt:  GSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATCAGCGAGGCAAAGCCTCATCTTCAAAGGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCCATGAGCACCGATCATTACGCATGGGATCGGGGTT
TCACTCGGGATCAACAATCCTTATAGAAGCAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATCTTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTA
CGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGGGGCTGATTTCCTTTCAATAAAAAACAAAACAAAGAGGAATGCCTTT
TTCCAAAGATATGATGAAAACCAAAGATCTGAAATAAGGTCAAAATGGTATTCATTTATGGAAGAAATAGAGCAAAATATTCCATTCTTTACATGGATGAAACAATCTGA
AAACGAAATCAATATATGTATAGATAAGAACGAATCAATTATTCGATTTCATTCTGAATATGGAATAGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCATGA
GCGCGATCAATACGCAATGGATCCGGTTTCCCTCAGATTCAATGGAGTCATTAACTCCACCACCGAGGCCATCGGCCTCCCTCATGCGGCCTACTGCCGTTACACCAATG
AGACCTTCTGTCTCACCTACTACTCCATCTTCGTCTCAACAGGATCTTCAATCCCTACAACCTTTGTTGAAGCCGTCAATCCTCCCAAAAGAAAATGCAGAGTTCCTCAA
TTCCAAATCAAAGTTGCTAGCTGCCTTGGCACAAACAACAACAGAGGCAGATTTACAAAAAATTTTGAATCAAGTTACTTCAATAGCATCTTTGTCTTCTCCACAAAACT
CTGAAGTGGAAGAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCCCAACCAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATCAGCGAGGCAAAGCCTCATCTTCAAAGGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCCATGAGCACCGATCATTACGCATGGGATCGGGGTT
TCACTCGGGATCAACAATCCTTATAGAAGCAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATCTTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTA
CGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGGGGCTGATTTCCTTTCAATAAAAAACAAAACAAAGAGGAATGCCTTT
TTCCAAAGATATGATGAAAACCAAAGATCTGAAATAAGGTCAAAATGGTATTCATTTATGGAAGAAATAGAGCAAAATATTCCATTCTTTACATGGATGAAACAATCTGA
AAACGAAATCAATATATGTATAGATAAGAACGAATCAATTATTCGATTTCATTCTGAATATGGAATAGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCATGA
GCGCGATCAATACGCAATGGATCCGGTTTCCCTCAGATTCAATGGAGTCATTAACTCCACCACCGAGGCCATCGGCCTCCCTCATGCGGCCTACTGCCGTTACACCAATG
AGACCTTCTGTCTCACCTACTACTCCATCTTCGTCTCAACAGGATCTTCAATCCCTACAACCTTTGTTGAAGCCGTCAATCCTCCCAAAAGAAAATGCAGAGTTCCTCAA
TTCCAAATCAAAGTTGCTAGCTGCCTTGGCACAAACAACAACAGAGGCAGATTTACAAAAAATTTTGAATCAAGTTACTTCAATAGCATCTTTGTCTTCTCCACAAAACT
CTGAAGTGGAAGAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCCCAACCAATGTAA
Protein sequenceShow/hide protein sequence
MSDQRGKASSSKGKSPTSSSNIPAPHEHRSLRMGSGFHSGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHTDGADFLSIKNKTKRNAF
FQRYDENQRSEIRSKWYSFMEEIEQNIPFFTWMKQSENEINICIDKNESIIRFHSEYGIGKSPTSSSNIPAPMSAINTQWIRFPSDSMESLTPPPRPSASLMRPTAVTPM
RPSVSPTTPSSSQQDLQSLQPLLKPSILPKENAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASLSSPQNSEVEEDDEYDINDPFLDSQPM