; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009258 (gene) of Snake gourd v1 genome

Gene IDTan0009258
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationLG03:58975130..58977543
RNA-Seq ExpressionTan0009258
SyntenyTan0009258
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV76700.1 retrotransposon protein, putative, Ty3-gypsy subclass [Tanacetum cinerariifolium]1.6e-2432.46Show/hide
Query:  QIGNSGGDRPQC---KECGKRPLGGYTSGRAYASTGRDIYGSDSVVLCTLPLLGHLAFVLFDSQFYAVFYSATFAKVI---------------ASGS-LG
        + GN+G  +  C   K  G     G   GR YA  GRD     +V+  T  L  H A +LFD+     F S TF+ +I               A G  +G
Subjt:  QIGNSGGDRPQC---KECGKRPLGGYTSGRAYASTGRDIYGSDSVVLCTLPLLGHLAFVLFDSQFYAVFYSATFAKVI---------------ASGS-LG

Query:  VLACILDARCTEISVTSHYKKRIH------ALCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVH
             L    T+ +      KR+         CIDY ELNK+T+KN+YP  RIDDLFD+  G  V+ KID RS    +  REEDIPKT            
Subjt:  VLACILDARCTEISVTSHYKKRIH------ALCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVH

Query:  GDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVFIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTR
                 P  FMDLMNR  +   D FVIVFIDDI+IYS                               W        HV+    + +   K+E +  
Subjt:  GDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVFIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTR

Query:  WPRPTTDSE-----------KVVGRMRCKDGKVIAYASRQLK
        W  P T +E           K +G +  ++ KVIAYAS+QLK
Subjt:  WPRPTTDSE-----------KVVGRMRCKDGKVIAYASRQLK

GEW72254.1 putative reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]4.2e-2558.72Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPR-SLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        +CIDY ELNK+T+KN+YP  RIDDLFD+  G  V+SKID R     +  REEDIPK   + +LW LR+ G  +WFD+R   FMDLMNR  +   D FVIV
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPR-SLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYS
        FIDDILIYS
Subjt:  FIDDILIYS

KAA0051239.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-2440.74Show/hide
Query:  CIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF
        CIDY ELNKVT+KNKYP SRIDDLFD+  G  VFSKID RS    +  ++ D+PKT                     P  FMDLMNR  +   DTFVIVF
Subjt:  CIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF

Query:  IDDILIYSG----------------WHVVSESWVSL-IRKVEEVTRWPRPTT--------------------------------------DSEKVVGRMR
        IDDILIYS                  HVVS++ VS+   K+E VT WPRP+T                                       S+K +G + 
Subjt:  IDDILIYSG----------------WHVVSESWVSL-IRKVEEVTRWPRPTT--------------------------------------DSEKVVGRMR

Query:  CKDGKVIAYASRQLKT
         + GKV+AYASRQLK+
Subjt:  CKDGKVIAYASRQLKT

TYK11149.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-2445.71Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF
        LCIDY ELNKVT+KN YP  RIDDLFD+  G  VFSKID +    ++ ++ D+PKT                     P  FMDLMNR  +   DTFVIVF
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF

Query:  IDDILIYSGWHVVSESWVSLIRKVEEVTRWPRPTTDSE---------------KVVGRMRCKDGKVIAYASRQLK
        IDDILIYS   V  E     + K+E +T WPRP+T SE               K +G +  + G V+AYASRQLK
Subjt:  IDDILIYSGWHVVSESWVSLIRKVEEVTRWPRPTTDSE---------------KVVGRMRCKDGKVIAYASRQLK

TYK22669.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-2439.47Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        LCIDY ELNKVT+KN+YP  RIDDLFD+  G  VFSKID RS    +  R+ DIPKT                     P  FMDLMNR  +   D+FVIV
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE----------------------
        FIDDILIYS                               W        HVVS   VS+ + K+E VT WPRP+T SE                      
Subjt:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE----------------------

Query:  -----KVVGRMRCKDGKVIAYASRQLKT
             K +G M  + G+V+AYASRQLK+
Subjt:  -----KVVGRMRCKDGKVIAYASRQLKT

TrEMBL top hitse value%identityAlignment
A0A5A7T6T6 Pol protein1.7e-2441.28Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        LCIDY ELNKVT+KN+YP  RIDDLFD+  G  VFSKID RS    +  R+ DIPKT                        FMDLMNR  +   DTFVIV
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE-----------------KVVGR
        FIDDILIYS                               W        HVVS   VS+   K+E VT WPRP+T SE                 K +G 
Subjt:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE-----------------KVVGR

Query:  MRCKDGKVIAYASRQLKT
        +  + GKV+AYASRQLK+
Subjt:  MRCKDGKVIAYASRQLKT

A0A5A7UCL9 Ty3-gypsy retrotransposon protein1.7e-2440.74Show/hide
Query:  CIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF
        CIDY ELNKVT+KNKYP SRIDDLFD+  G  VFSKID RS    +  ++ D+PKT                     P  FMDLMNR  +   DTFVIVF
Subjt:  CIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF

Query:  IDDILIYSG----------------WHVVSESWVSL-IRKVEEVTRWPRPTT--------------------------------------DSEKVVGRMR
        IDDILIYS                  HVVS++ VS+   K+E VT WPRP+T                                       S+K +G + 
Subjt:  IDDILIYSG----------------WHVVSESWVSL-IRKVEEVTRWPRPTT--------------------------------------DSEKVVGRMR

Query:  CKDGKVIAYASRQLKT
         + GKV+AYASRQLK+
Subjt:  CKDGKVIAYASRQLKT

A0A5D3CLQ9 Ty3-gypsy retrotransposon protein7.7e-2545.71Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF
        LCIDY ELNKVT+KN YP  RIDDLFD+  G  VFSKID +    ++ ++ D+PKT                     P  FMDLMNR  +   DTFVIVF
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVF

Query:  IDDILIYSGWHVVSESWVSLIRKVEEVTRWPRPTTDSE---------------KVVGRMRCKDGKVIAYASRQLK
        IDDILIYS   V  E     + K+E +T WPRP+T SE               K +G +  + G V+AYASRQLK
Subjt:  IDDILIYSGWHVVSESWVSLIRKVEEVTRWPRPTTDSE---------------KVVGRMRCKDGKVIAYASRQLK

A0A5D3D066 Reverse transcriptase2.3e-2446.45Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        LCIDY +LNKVTI+NKYP  RIDDLFD+  G  +FSKID RS    ++ RE +I KT                     P  FMDLMNR      D FVIV
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYSG---------WHVVSESWVSL-IRKVEEVTRWPRPT--TDSEKVVGRMRCK----------DGKVIAYASRQLK
        FIDDIL+YS           HVVS   VS+  +KVE V  W RPT  T++ K     +C+          DG VIAYASRQLK
Subjt:  FIDDILIYSG---------WHVVSESWVSL-IRKVEEVTRWPRPT--TDSEKVVGRMRCK----------DGKVIAYASRQLK

A0A5D3D4K2 Pol protein1.7e-2441.28Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        LCIDY ELNKVT+KN+YP  RIDDLFD+  G  VFSKID RS    +  R+ DIPKT                        FMDLMNR  +   DTFVIV
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE-----------------KVVGR
        FIDDILIYS                               W        HVVS   VS+   K+E VT WPRP+T SE                 K +G 
Subjt:  FIDDILIYSG------------------------------W--------HVVSESWVSL-IRKVEEVTRWPRPTTDSE-----------------KVVGR

Query:  MRCKDGKVIAYASRQLKT
        +  + GKV+AYASRQLK+
Subjt:  MRCKDGKVIAYASRQLKT

SwissProt top hitse value%identityAlignment
P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-0427.03Show/hide
Query:  HALCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKID-PRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFV
        + + IDY +LN++TI ++YP   +D++  +      F+ ID  +    IE  EE I KT                     P  F   MN  ++ + +   
Subjt:  HALCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKID-PRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFV

Query:  IVFIDDILIYS
        +V++DDI+I+S
Subjt:  IVFIDDILIYS

P31843 RNA-directed DNA polymerase homolog2.1e-1134.65Show/hide
Query:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV
        +CIDY  L KVTIKNKYP  R+DDLFDR      F+K+D RS    +   + D PKT       +                F +LMN  +    D FV+V
Subjt:  LCIDYEELNKVTIKNKYP-SRIDDLFDRFTGEHVFSKIDPRS-LPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIV

Query:  FIDDILIYSGWHVVSESWVSLIRKVEE
        ++DD+++Y+ +       +  +R V E
Subjt:  FIDDILIYSGWHVVSESWVSLIRKVEE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCGCCACGAGGACCGGAGGTGTCCTGTGCTACCTTTATCTTTGAGGATGACGCGTATTTATGGTGGGAGTCCGCTTGAGGAACCATTGTCGCCGCCCGTGAGAAA
CGATTACACTGAATCTGTTCAAAGAAGCATTCTGGCGGAAATTCTATCCGCTGCTGACTCGTTCACGAGAAGCGAAGCGGAATTTTCTGGTATGCGTAACCGGAGGGGCA
GTGGAGCTCACAGCGTCCCAGCAGACACCGTGATACTGCGAGGCATTTTTCGTACAAGTTCGGTCGAAACCACAAGTCAACAGATCGGAAATAGCGGCGGAGATAGGCCA
CAGTGCAAAGAATGTGGTAAGCGACCTTTGGGCGGTTATACATCGGGGAGAGCCTATGCAAGCACCGGTAGAGACATCTATGGCTCGGACTCGGTGGTTCTGTGTACACT
TCCATTACTTGGGCACCTTGCCTTTGTATTGTTTGATTCTCAGTTCTACGCAGTCTTTTATTCTGCAACCTTTGCGAAAGTTATTGCATCGGGGAGCTTGGGCGTCTTAG
CTTGCATACTAGATGCAAGGTGCACAGAGATCTCGGTGACATCGCATTACAAAAAGCGGATCCATGCGCTGTGTATAGATTACGAGGAGTTGAATAAAGTTACGATCAAG
AACAAATACCCCTCCCGAATAGATGACTTGTTCGATCGATTCACAGGAGAACATGTGTTCTCCAAGATCGATCCGAGATCGCTACCATCGATTGAGGATCGTGAAGAGGA
TATTCCTAAAACAAAGATTGAGATCACGTTATGGGCATTACGAGTTCACGGTGATGTCGTTTGGTTTGACCAACGCCCCTGCGCTTTCATGGACTTGATGAACCGATTTA
TTCAAGGAGTTTTTGATACGTTCGTGATTGTGTTCATCGATGATATATTGATATACTCCGGATGGCATGTGGTGTCCGAAAGTTGGGTCTCGCTGATCCGTAAAGTGGAG
GAGGTCACACGCTGGCCTCGTCCGACCACCGATTCTGAGAAAGTGGTTGGGCGTATGCGATGCAAAGACGGTAAAGTAATAGCTTATGCATCTCGACAATTGAAGACGCT
TACGAGAGGAACTATCCAACTCATGACACGGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCGCCACGAGGACCGGAGGTGTCCTGTGCTACCTTTATCTTTGAGGATGACGCGTATTTATGGTGGGAGTCCGCTTGAGGAACCATTGTCGCCGCCCGTGAGAAA
CGATTACACTGAATCTGTTCAAAGAAGCATTCTGGCGGAAATTCTATCCGCTGCTGACTCGTTCACGAGAAGCGAAGCGGAATTTTCTGGTATGCGTAACCGGAGGGGCA
GTGGAGCTCACAGCGTCCCAGCAGACACCGTGATACTGCGAGGCATTTTTCGTACAAGTTCGGTCGAAACCACAAGTCAACAGATCGGAAATAGCGGCGGAGATAGGCCA
CAGTGCAAAGAATGTGGTAAGCGACCTTTGGGCGGTTATACATCGGGGAGAGCCTATGCAAGCACCGGTAGAGACATCTATGGCTCGGACTCGGTGGTTCTGTGTACACT
TCCATTACTTGGGCACCTTGCCTTTGTATTGTTTGATTCTCAGTTCTACGCAGTCTTTTATTCTGCAACCTTTGCGAAAGTTATTGCATCGGGGAGCTTGGGCGTCTTAG
CTTGCATACTAGATGCAAGGTGCACAGAGATCTCGGTGACATCGCATTACAAAAAGCGGATCCATGCGCTGTGTATAGATTACGAGGAGTTGAATAAAGTTACGATCAAG
AACAAATACCCCTCCCGAATAGATGACTTGTTCGATCGATTCACAGGAGAACATGTGTTCTCCAAGATCGATCCGAGATCGCTACCATCGATTGAGGATCGTGAAGAGGA
TATTCCTAAAACAAAGATTGAGATCACGTTATGGGCATTACGAGTTCACGGTGATGTCGTTTGGTTTGACCAACGCCCCTGCGCTTTCATGGACTTGATGAACCGATTTA
TTCAAGGAGTTTTTGATACGTTCGTGATTGTGTTCATCGATGATATATTGATATACTCCGGATGGCATGTGGTGTCCGAAAGTTGGGTCTCGCTGATCCGTAAAGTGGAG
GAGGTCACACGCTGGCCTCGTCCGACCACCGATTCTGAGAAAGTGGTTGGGCGTATGCGATGCAAAGACGGTAAAGTAATAGCTTATGCATCTCGACAATTGAAGACGCT
TACGAGAGGAACTATCCAACTCATGACACGGAACTAG
Protein sequenceShow/hide protein sequence
MKRHEDRRCPVLPLSLRMTRIYGGSPLEEPLSPPVRNDYTESVQRSILAEILSAADSFTRSEAEFSGMRNRRGSGAHSVPADTVILRGIFRTSSVETTSQQIGNSGGDRP
QCKECGKRPLGGYTSGRAYASTGRDIYGSDSVVLCTLPLLGHLAFVLFDSQFYAVFYSATFAKVIASGSLGVLACILDARCTEISVTSHYKKRIHALCIDYEELNKVTIK
NKYPSRIDDLFDRFTGEHVFSKIDPRSLPSIEDREEDIPKTKIEITLWALRVHGDVVWFDQRPCAFMDLMNRFIQGVFDTFVIVFIDDILIYSGWHVVSESWVSLIRKVE
EVTRWPRPTTDSEKVVGRMRCKDGKVIAYASRQLKTLTRGTIQLMTRN