; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019187 (gene) of Snake gourd v1 genome

Gene IDTan0019187
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEnzymatic polyprotein
Genome locationLG01:20687655..20688652
RNA-Seq ExpressionTan0019187
SyntenyTan0019187
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037876.1 zf-CCHC domain-containing protein/MP domain-containing protein [Cucumis melo var. makuwa]6.3e-4235.78Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDW
        MSSRPS  S     +    + RS+S+R SVDF   +PD+ YEK +GS+SPTQ+ MER+S  +YNQINV+S D        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDW

Query:  ANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNI
          +P FL   +F+            AL +  +   ++ ++K    WVT+ GKEV+S+YPPE EA FSH  +P   ++SSPYKTI+E+K + VGVREIKNI
Subjt:  ANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNI

Query:  QQQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTG
        Q QLN++NK                                                D  D L+EIN++++ ++++   K   E  +  +          
Subjt:  QQQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTG

Query:  NINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG
         INMI+   +  +S SKILP+  +  DMKNHY  PSPPDLG
Subjt:  NINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG

KAA0041674.1 movement protein [Cucumis melo var. makuwa]2.1e-4540.92Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----
        MSSRPS  S    T+    + RS+S+R SVDF   +PD+ YEK +GS+ PTQ+ ME++S  +YNQINV+S +        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----

Query:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ
         +K +   P F   ++K E +   ALV+  +   ++ ++K    WVT+ GKE++S+YPPE EA F H  +P   ++SSPYKTI+E+K + VGVREIKNIQ
Subjt:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ

Query:  QQLNYSN----------KVDTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGNINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPP
         QLN+++          K D  D L+EINK+++ ++++ D K   E           Q+P G INMI+   +  +S SKILP+  +  DMKNHY  PSPP
Subjt:  QQLNYSN----------KVDTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGNINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPP

Query:  DLG
        DLG
Subjt:  DLG

KAA0050195.1 polyprotein [Cucumis melo var. makuwa]2.8e-4253.37Show/hide
Query:  SSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDWA
        S R S+ S+ R TTR R +T+S SMR  VDF+Q +PD QYEK EGS+SPTQT MER+SG IYNQIN ++ +      G  L ++ Y+ YI+ Y+ QK+W 
Subjt:  SSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDWA

Query:  NRPKFLMKEEFLALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKT
        NRPKFL K EF+A V     +  + V+  P+TW T+ GKE+ S+YPPE EASF H  +P++TI+ S YKTIDEE+TKT
Subjt:  NRPKFLMKEEFLALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKT

KAA0050445.1 hypothetical protein E6C27_scaffold175G00510 [Cucumis melo var. makuwa]8.2e-4237.65Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----
        MSSRPS  S    T+    + RS+S+R SVDF   +PD+ YEK +GS+SPTQ+ MER+S  +YNQINV+S D        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----

Query:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ
         +K +   P F   ++K E     ALV+  +   ++ ++K    WVT+ GKE++S YPPE EA F H   P   ++SSPYKTI+E+K + VGVREIKNIQ
Subjt:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ

Query:  QQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGN
         QLN++NKV                                               D  D L EINK++  ++++ + K   E           Q+  G 
Subjt:  QQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGN

Query:  INMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG
        INMI+   +  +STSKILP+  +  DMKNHY  PSPPDLG
Subjt:  INMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG

TYK01213.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.8e-4137.9Show/hide
Query:  MSSRPSVCSDTRT-TTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKD
        MSSRPS  S  R+ ++    + RS+S+R S+DF + +P + Y+K + S+SPTQ+ MER++  +YNQINV+S++ +     ++  ++ Y  Y++ ++    
Subjt:  MSSRPSVCSDTRT-TTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKD

Query:  WANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKN
           +P FL   +F+            ALV+      ++ ++K    WVT+ GKE++S+YPPE EA F+H ++P   ++SSPYKTIDE K + VGVREIKN
Subjt:  WANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKN

Query:  IQQQLNYSNKV------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQ-KPTGNINMIR--GIDDSSTSKILPIRTYNTD
        IQ QLNY+NK                   D  D L+EINK ++ + ++ D K VP    +S     ++ +    INMI+   +  +STSK+LP+  +   
Subjt:  IQQQLNYSNKV------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQ-KPTGNINMIR--GIDDSSTSKILPIRTYNTD

Query:  MKNHYFIPSPPDLG
        MKNHY  PSPPDLG
Subjt:  MKNHYFIPSPPDLG

TrEMBL top hitse value%identityAlignment
A0A5A7T6R6 Zf-CCHC domain-containing protein/MP domain-containing protein3.0e-4235.78Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDW
        MSSRPS  S     +    + RS+S+R SVDF   +PD+ YEK +GS+SPTQ+ MER+S  +YNQINV+S D        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDW

Query:  ANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNI
          +P FL   +F+            AL +  +   ++ ++K    WVT+ GKEV+S+YPPE EA FSH  +P   ++SSPYKTI+E+K + VGVREIKNI
Subjt:  ANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNI

Query:  QQQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTG
        Q QLN++NK                                                D  D L+EIN++++ ++++   K   E  +  +          
Subjt:  QQQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTG

Query:  NINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG
         INMI+   +  +S SKILP+  +  DMKNHY  PSPPDLG
Subjt:  NINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG

A0A5A7U9B4 Polyprotein1.4e-4253.37Show/hide
Query:  SSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDWA
        S R S+ S+ R TTR R +T+S SMR  VDF+Q +PD QYEK EGS+SPTQT MER+SG IYNQIN ++ +      G  L ++ Y+ YI+ Y+ QK+W 
Subjt:  SSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDWA

Query:  NRPKFLMKEEFLALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKT
        NRPKFL K EF+A V     +  + V+  P+TW T+ GKE+ S+YPPE EASF H  +P++TI+ S YKTIDEE+TKT
Subjt:  NRPKFLMKEEFLALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKT

A0A5A7UA27 Uncharacterized protein4.0e-4237.65Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----
        MSSRPS  S    T+    + RS+S+R SVDF   +PD+ YEK +GS+SPTQ+ MER+S  +YNQINV+S D        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----

Query:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ
         +K +   P F   ++K E     ALV+  +   ++ ++K    WVT+ GKE++S YPPE EA F H   P   ++SSPYKTI+E+K + VGVREIKNIQ
Subjt:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ

Query:  QQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGN
         QLN++NKV                                               D  D L EINK++  ++++ + K   E           Q+  G 
Subjt:  QQLNYSNKV-----------------------------------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGN

Query:  INMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG
        INMI+   +  +STSKILP+  +  DMKNHY  PSPPDLG
Subjt:  INMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLG

A0A5A7VKK8 Enzymatic polyprotein8.9e-4237.9Show/hide
Query:  MSSRPSVCSDTRT-TTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKD
        MSSRPS  S  R+ ++    + RS+S+R S+DF + +P + Y+K + S+SPTQ+ MER++  +YNQINV+S++ +     ++  ++ Y  Y++ ++    
Subjt:  MSSRPSVCSDTRT-TTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKD

Query:  WANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKN
           +P FL   +F+            ALV+      ++ ++K    WVT+ GKE++S+YPPE EA F+H ++P   ++SSPYKTIDE K + VGVREIKN
Subjt:  WANRPKFLMKEEFL------------ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKN

Query:  IQQQLNYSNKV------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQ-KPTGNINMIR--GIDDSSTSKILPIRTYNTD
        IQ QLNY+NK                   D  D L+EINK ++ + ++ D K VP    +S     ++ +    INMI+   +  +STSK+LP+  +   
Subjt:  IQQQLNYSNKV------------------DTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQ-KPTGNINMIR--GIDDSSTSKILPIRTYNTD

Query:  MKNHYFIPSPPDLG
        MKNHY  PSPPDLG
Subjt:  MKNHYFIPSPPDLG

A0A5D3C4I7 Movement protein1.0e-4540.92Show/hide
Query:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----
        MSSRPS  S    T+    + RS+S+R SVDF   +PD+ YEK +GS+ PTQ+ ME++S  +YNQINV+S +        E   + Y  YI+ +++    
Subjt:  MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVR----

Query:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ
         +K +   P F   ++K E +   ALV+  +   ++ ++K    WVT+ GKE++S+YPPE EA F H  +P   ++SSPYKTI+E+K + VGVREIKNIQ
Subjt:  -QKDWANRPKF---LMKEEFL---ALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQ

Query:  QQLNYSN----------KVDTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGNINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPP
         QLN+++          K D  D L+EINK+++ ++++ D K   E           Q+P G INMI+   +  +S SKILP+  +  DMKNHY  PSPP
Subjt:  QQLNYSN----------KVDTEDLLSEINKKISKMNVSTDHKEVPETSKKSISFGVSQKPTGNINMIR--GIDDSSTSKILPIRTYNTDMKNHYFIPSPP

Query:  DLG
        DLG
Subjt:  DLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGTCGCCCTAGTGTATGTTCAGACACCAGGACAACTACTAGAGGGAGACTAGTCACTAGGTCCAAATCCATGCGAACGTCTGTAGACTTTGATCAGCCAGTCCC
AGATATTCAATATGAAAAATTTGAAGGATCCATTTCTCCGACTCAAACTTATATGGAGCGGAAATCTGGATATATATATAATCAAATCAATGTTCTCTCTACAGATATGA
ATGAAATAACTGTAGGCGATGAACTAGGCGAAAAGATGTATGATGACTACATCAACAATTATGTCCGCCAAAAAGATTGGGCCAATCGACCTAAATTCTTGATGAAAGAA
GAATTCCTAGCCTTAGTAAGGCCAGGAAGACCTCAAGAAAAACTGAATGTCTTAAAGACGCCCATTACATGGGTTACTTCCTGTGGAAAAGAAGTATCTTCCGACTATCC
TCCTGAAGCAGAAGCTAGCTTCTCTCATGCATTGGTACCTACCTCTACAATAATCTCTTCACCATACAAAACTATCGATGAAGAGAAGACCAAAACTGTTGGAGTCAGAG
AGATAAAAAATATCCAACAACAACTCAACTATTCAAATAAGGTAGACACAGAAGATCTTCTCTCTGAAATAAATAAGAAGATCTCAAAGATGAATGTTTCTACAGATCAC
AAAGAAGTGCCAGAAACATCAAAGAAAAGTATCAGTTTTGGAGTTTCCCAAAAACCAACTGGTAACATCAACATGATTAGAGGAATTGATGATTCTTCTACTTCAAAAAT
CCTTCCTATAAGAACATACAACACTGATATGAAGAATCATTACTTTATACCATCTCCTCCTGATCTAGGGATGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAGTCGCCCTAGTGTATGTTCAGACACCAGGACAACTACTAGAGGGAGACTAGTCACTAGGTCCAAATCCATGCGAACGTCTGTAGACTTTGATCAGCCAGTCCC
AGATATTCAATATGAAAAATTTGAAGGATCCATTTCTCCGACTCAAACTTATATGGAGCGGAAATCTGGATATATATATAATCAAATCAATGTTCTCTCTACAGATATGA
ATGAAATAACTGTAGGCGATGAACTAGGCGAAAAGATGTATGATGACTACATCAACAATTATGTCCGCCAAAAAGATTGGGCCAATCGACCTAAATTCTTGATGAAAGAA
GAATTCCTAGCCTTAGTAAGGCCAGGAAGACCTCAAGAAAAACTGAATGTCTTAAAGACGCCCATTACATGGGTTACTTCCTGTGGAAAAGAAGTATCTTCCGACTATCC
TCCTGAAGCAGAAGCTAGCTTCTCTCATGCATTGGTACCTACCTCTACAATAATCTCTTCACCATACAAAACTATCGATGAAGAGAAGACCAAAACTGTTGGAGTCAGAG
AGATAAAAAATATCCAACAACAACTCAACTATTCAAATAAGGTAGACACAGAAGATCTTCTCTCTGAAATAAATAAGAAGATCTCAAAGATGAATGTTTCTACAGATCAC
AAAGAAGTGCCAGAAACATCAAAGAAAAGTATCAGTTTTGGAGTTTCCCAAAAACCAACTGGTAACATCAACATGATTAGAGGAATTGATGATTCTTCTACTTCAAAAAT
CCTTCCTATAAGAACATACAACACTGATATGAAGAATCATTACTTTATACCATCTCCTCCTGATCTAGGGATGGGATGA
Protein sequenceShow/hide protein sequence
MSSRPSVCSDTRTTTRGRLVTRSKSMRTSVDFDQPVPDIQYEKFEGSISPTQTYMERKSGYIYNQINVLSTDMNEITVGDELGEKMYDDYINNYVRQKDWANRPKFLMKE
EFLALVRPGRPQEKLNVLKTPITWVTSCGKEVSSDYPPEAEASFSHALVPTSTIISSPYKTIDEEKTKTVGVREIKNIQQQLNYSNKVDTEDLLSEINKKISKMNVSTDH
KEVPETSKKSISFGVSQKPTGNINMIRGIDDSSTSKILPIRTYNTDMKNHYFIPSPPDLGMG