; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020298 (gene) of Snake gourd v1 genome

Gene IDTan0020298
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEnzymatic polyprotein
Genome locationLG05:47588723..47590313
RNA-Seq ExpressionTan0020298
SyntenyTan0020298
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN69162.1 hypothetical protein VITISV_016584 [Vitis vinifera]1.8e-2126.89Show/hide
Query:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFS--------------EETELT
        MNT++   AL +SPKG       + LDKS   +PK + W+ +     W    A     +R+ ++ QI+++ DG  E+ FS              E +  +
Subjt:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFS--------------EETELT

Query:  EAFTPPK--------------RNSNLAQIIEHTDAA------------------SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE--------------
         +  P +              +N  L  +  +T+ A                  S   S  Y    N++ +   + E N   +  +              
Subjt:  EAFTPPK--------------RNSNLAQIIEHTDAA------------------SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE--------------

Query:  ------AQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTG
               +P +    N++  ++ N+YP+P+FPD+QFEE+ Q TQA Y    I+EWN+DG+++Y I+  + EM + S AY+   +  DH +AQ +VAGFTG
Subjt:  ------AQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTG

Query:  QLNQW
        QL  W
Subjt:  QLNQW

CAN75928.1 hypothetical protein VITISV_021028 [Vitis vinifera]7.6e-2531.85Show/hide
Query:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSE-----ETELTEAFTPPKRN
        MNT++   AL +SPKG       + LDKS   +PK + W+ +     W    A     +R+ ++ QI+++ DG  E+ FS+      +     + P + +
Subjt:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSE-----ETELTEAFTPPKRN

Query:  SNLAQIIEHTDAASEADSKFYFDRSNSLRVK---SVNIEQNVGN-VHYEAQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNV
        S  + I   T    E  S       N   VK    ++ E+     +    +P +    N++  ++ N+YP+P+FPD+QFEE+ Q TQA Y    I+EWN+
Subjt:  SNLAQIIEHTDAASEADSKFYFDRSNSLRVK---SVNIEQNVGN-VHYEAQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNV

Query:  DGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW
        D +++Y I+  + EM + S AY+   +  DH +AQ +VAGFTGQL  W
Subjt:  DGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW

RVW30385.1 RING finger and transmembrane domain-containing protein 2 [Vitis vinifera]6.4e-2428.67Show/hide
Query:  MNTNISPRALRSSPKGSIILI-EENLDKSSITVPKSLSWEQITRNPTWKLMEAF-TPPKRNSNLAQIIEHTDGAVEIQFSEE------------------
        M T++   AL +SPKG   L   ++++KS+I +P  + W+++    +W    A  T  +R+  + QI+++ DG  ++ FS                    
Subjt:  MNTNISPRALRSSPKGSIILI-EENLDKSSITVPKSLSWEQITRNPTWKLMEAF-TPPKRNSNLAQIIEHTDGAVEIQFSEE------------------

Query:  --------TELTEAFTPPKRNSNLAQIIEHTDAA-----SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE----------AQPQSPTQTNMDNRSINWY
                T   E  +   +N  L  +  HT+ A      E DS     +  S ++ S    Q +  + Y            +    ++ N + +  N+Y
Subjt:  --------TELTEAFTPPKRNSNLAQIIEHTDAA-----SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE----------AQPQSPTQTNMDNRSINWY

Query:  PQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW
        P+P+FPD+QFEE+ Q TQA Y    I+EWN+DG+ +Y I+  + EM + S AY+   +  DH +AQ +VAGF GQL  W
Subjt:  PQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW

RVX10070.1 hypothetical protein CK203_013013 [Vitis vinifera]4.2e-2329.88Show/hide
Query:  ILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQIIEHTDAASEADS---KFY
        +L + N   SS+  P  + W  +T + TW   +   P P ++     I +   G V IQF           PPK     ++I     A     S   ++ 
Subjt:  ILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQIIEHTDAASEADS---KFY

Query:  FD-RSNSLRVKSVNIEQNVGNVHYEAQ----PQSPTQTNM-------------------------DNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIH
         D R   +++ +V+   N+    Y  Q    P SPT + M                          +R+ N++P+P+ PD+Q+EE++Q+ Q+ YDG  I+
Subjt:  FD-RSNSLRVKSVNIEQNVGNVHYEAQ----PQSPTQTNM-------------------------DNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIH

Query:  EWNVDGISDYLIINVINEMLIASNAYRQRGKRDHEIAQLLVAGFTGQLNQW
        EWN+DG+SD+ ++N++ EM++AS AY+ R   D  I   L+AGFTGQL  W
Subjt:  EWNVDGISDYLIINVINEMLIASNAYRQRGKRDHEIAQLLVAGFTGQLNQW

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]2.5e-2868.48Show/hide
Query:  TQTNMDNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRG-KRDHEIAQLLVAGFTGQLNQW
        +Q N  N+  NWYPQPSFPDIQFEEKT  TQA YDGLAI+EWN+DG+SDYLI+NV+NEM++A+ AY+ +G K DH+IAQ+LV GFTGQL  W
Subjt:  TQTNMDNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRG-KRDHEIAQLLVAGFTGQLNQW

TrEMBL top hitse value%identityAlignment
A0A438D4J2 RING finger and transmembrane domain-containing protein 23.1e-2428.67Show/hide
Query:  MNTNISPRALRSSPKGSIILI-EENLDKSSITVPKSLSWEQITRNPTWKLMEAF-TPPKRNSNLAQIIEHTDGAVEIQFSEE------------------
        M T++   AL +SPKG   L   ++++KS+I +P  + W+++    +W    A  T  +R+  + QI+++ DG  ++ FS                    
Subjt:  MNTNISPRALRSSPKGSIILI-EENLDKSSITVPKSLSWEQITRNPTWKLMEAF-TPPKRNSNLAQIIEHTDGAVEIQFSEE------------------

Query:  --------TELTEAFTPPKRNSNLAQIIEHTDAA-----SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE----------AQPQSPTQTNMDNRSINWY
                T   E  +   +N  L  +  HT+ A      E DS     +  S ++ S    Q +  + Y            +    ++ N + +  N+Y
Subjt:  --------TELTEAFTPPKRNSNLAQIIEHTDAA-----SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE----------AQPQSPTQTNMDNRSINWY

Query:  PQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW
        P+P+FPD+QFEE+ Q TQA Y    I+EWN+DG+ +Y I+  + EM + S AY+   +  DH +AQ +VAGF GQL  W
Subjt:  PQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW

A0A438JM71 Uncharacterized protein2.0e-2329.88Show/hide
Query:  ILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQIIEHTDAASEADS---KFY
        +L + N   SS+  P  + W  +T + TW   +   P P ++     I +   G V IQF           PPK     ++I     A     S   ++ 
Subjt:  ILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQIIEHTDAASEADS---KFY

Query:  FD-RSNSLRVKSVNIEQNVGNVHYEAQ----PQSPTQTNM-------------------------DNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIH
         D R   +++ +V+   N+    Y  Q    P SPT + M                          +R+ N++P+P+ PD+Q+EE++Q+ Q+ YDG  I+
Subjt:  FD-RSNSLRVKSVNIEQNVGNVHYEAQ----PQSPTQTNM-------------------------DNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIH

Query:  EWNVDGISDYLIINVINEMLIASNAYRQRGKRDHEIAQLLVAGFTGQLNQW
        EWN+DG+SD+ ++N++ EM++AS AY+ R   D  I   L+AGFTGQL  W
Subjt:  EWNVDGISDYLIINVINEMLIASNAYRQRGKRDHEIAQLLVAGFTGQLNQW

A0A5A7URX9 Enzymatic polyprotein9.4e-2140Show/hide
Query:  MNTNISPRALRSSPKGSIILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTPPKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQII
        M+TN+SP+AL  SPKG  +L+E N++KSS+T+P++L W+++T+NP WKL    TP KR+S  A IIE  DG VE+QF+         + P+ +  ++   
Subjt:  MNTNISPRALRSSPKGSIILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTPPKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQII

Query:  EHTDAASEADSKFYFDRSNSLRVKSVNIEQNVGNVHYEAQPQ--SPTQTNMDNRS
          +   +E   +    RS S+R  SV+    + +VHYE + +  SPTQ+NM+ RS
Subjt:  EHTDAASEADSKFYFDRSNSLRVKSVNIEQNVGNVHYEAQPQ--SPTQTNMDNRS

A5AL87 Uncharacterized protein8.5e-2226.89Show/hide
Query:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFS--------------EETELT
        MNT++   AL +SPKG       + LDKS   +PK + W+ +     W    A     +R+ ++ QI+++ DG  E+ FS              E +  +
Subjt:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFS--------------EETELT

Query:  EAFTPPK--------------RNSNLAQIIEHTDAA------------------SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE--------------
         +  P +              +N  L  +  +T+ A                  S   S  Y    N++ +   + E N   +  +              
Subjt:  EAFTPPK--------------RNSNLAQIIEHTDAA------------------SEADSKFYFDRSNSLRVKSVNIEQNVGNVHYE--------------

Query:  ------AQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTG
               +P +    N++  ++ N+YP+P+FPD+QFEE+ Q TQA Y    I+EWN+DG+++Y I+  + EM + S AY+   +  DH +AQ +VAGFTG
Subjt:  ------AQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTG

Query:  QLNQW
        QL  W
Subjt:  QLNQW

A5AQL0 CCHC-type domain-containing protein3.7e-2531.85Show/hide
Query:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSE-----ETELTEAFTPPKRN
        MNT++   AL +SPKG       + LDKS   +PK + W+ +     W    A     +R+ ++ QI+++ DG  E+ FS+      +     + P + +
Subjt:  MNTNISPRALRSSPKGSIILIEEN-LDKSSITVPKSLSWEQITRNPTWKLMEAFTP-PKRNSNLAQIIEHTDGAVEIQFSE-----ETELTEAFTPPKRN

Query:  SNLAQIIEHTDAASEADSKFYFDRSNSLRVK---SVNIEQNVGN-VHYEAQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNV
        S  + I   T    E  S       N   VK    ++ E+     +    +P +    N++  ++ N+YP+P+FPD+QFEE+ Q TQA Y    I+EWN+
Subjt:  SNLAQIIEHTDAASEADSKFYFDRSNSLRVK---SVNIEQNVGN-VHYEAQPQSPTQTNMDN-RSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNV

Query:  DGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW
        D +++Y I+  + EM + S AY+   +  DH +AQ +VAGFTGQL  W
Subjt:  DGISDYLIINVINEMLIASNAYRQRGK-RDHEIAQLLVAGFTGQLNQW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACAAACATTTCTCCGAGAGCTTTAAGGTCTTCCCCAAAGGGATCAATAATCCTTATAGAAGAAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATC
TTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTATGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGGAGCCGTTG
AAATACAATTCTCAGAAGAAACAGAGCTTACGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGCAGCCTCAGAAGCCGAC
TCAAAATTTTATTTTGATCGATCTAACTCGTTGCGAGTAAAATCAGTCAATATAGAACAAAATGTAGGAAATGTTCATTATGAAGCACAACCACAATCTCCAACCCAAAC
AAACATGGATAATCGATCTATAAATTGGTATCCTCAACCATCATTCCCAGATATTCAATTTGAAGAAAAAACACAAATGACTCAAGCCGTTTATGATGGATTAGCCATCC
ACGAATGGAATGTGGACGGAATATCTGATTATCTAATCATAAATGTAATAAATGAGATGTTGATCGCCTCCAATGCCTATAGACAAAGAGGCAAAAGAGATCATGAGATA
GCACAACTTCTCGTTGCCGGATTCACAGGACAATTAAATCAATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATACAAACATTTCTCCGAGAGCTTTAAGGTCTTCCCCAAAGGGATCAATAATCCTTATAGAAGAAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATC
TTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTATGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGGAGCCGTTG
AAATACAATTCTCAGAAGAAACAGAGCTTACGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATACAGATGCAGCCTCAGAAGCCGAC
TCAAAATTTTATTTTGATCGATCTAACTCGTTGCGAGTAAAATCAGTCAATATAGAACAAAATGTAGGAAATGTTCATTATGAAGCACAACCACAATCTCCAACCCAAAC
AAACATGGATAATCGATCTATAAATTGGTATCCTCAACCATCATTCCCAGATATTCAATTTGAAGAAAAAACACAAATGACTCAAGCCGTTTATGATGGATTAGCCATCC
ACGAATGGAATGTGGACGGAATATCTGATTATCTAATCATAAATGTAATAAATGAGATGTTGATCGCCTCCAATGCCTATAGACAAAGAGGCAAAAGAGATCATGAGATA
GCACAACTTCTCGTTGCCGGATTCACAGGACAATTAAATCAATGGTGA
Protein sequenceShow/hide protein sequence
MNTNISPRALRSSPKGSIILIEENLDKSSITVPKSLSWEQITRNPTWKLMEAFTPPKRNSNLAQIIEHTDGAVEIQFSEETELTEAFTPPKRNSNLAQIIEHTDAASEAD
SKFYFDRSNSLRVKSVNIEQNVGNVHYEAQPQSPTQTNMDNRSINWYPQPSFPDIQFEEKTQMTQAVYDGLAIHEWNVDGISDYLIINVINEMLIASNAYRQRGKRDHEI
AQLLVAGFTGQLNQW