; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr09:11285371..11294587
RNA-Seq ExpressionCmoCh09G012780
SyntenyCmoCh09G012780
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601747.1 hypothetical protein SDJN03_06980, partial [Cucurbita argyrosperma subsp. sororia]1.1e-9878.99Show/hide
Query:  MDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSE
        M VFK+LD GTSNVD  NE QPVE   P EA P+P A SASRDQ TVVITLEALQSL+ESRVDQA+Q+RVDQ  QAAL GLGSQAAPT  +S Q T VS+
Subjt:  MDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSE

Query:  APGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPA
        AP      VIPPT  TELPGTAVVTEAP +VVTYGR+CMTEESEYIR FMKL PPTFGGKGTDPEAAEWWLEC+ETKFTF+NCPENHKVLCATYLLEGPA
Subjt:  APGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPA

Query:  HFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARL
        HF WKSKKPKM+AGGA I WA  +HEFCEKYYPALARL
Subjt:  HFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARL

XP_022930572.1 uncharacterized protein LOC111436980 [Cucurbita moschata]4.9e-11288.09Show/hide
Query:  MQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSEAPGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPE
        MQSRVDQAVQAAL GLGSQAAPT PV GQTTLVSEAPGVGVQTVIPPTRLTELP                     EESEYIRDFMKL PPTFGGKGTDPE
Subjt:  MQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSEAPGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPE

Query:  AAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS
        AAEWW+EC+ETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS
Subjt:  AAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS

Query:  RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQ
        RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHAS+
Subjt:  RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQ

XP_022933364.1 uncharacterized protein LOC111440728, partial [Cucurbita moschata]3.7e-12381.29Show/hide
Query:  MTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCE---------
        MTEESEYIRDFMKL PPTFGGKGTDPEAAEWWLEC+ET F FYNCPE+HKVLCATYLLEGPA+FWWKSKKPKM AGGAPITWAAFRHEFCE         
Subjt:  MTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCE---------

Query:  --------------KYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS
                      KYYPALARLRNRKAFMQLEQGNRSVEEYE EFT LSRFVPLM ATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS
Subjt:  --------------KYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS

Query:  TPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR
        TPRNNQGSSS                  QQTVPQRVDRRPYSGEGRAGCSKCGL HAGSCNAADKICYNCGKTGHLAR
Subjt:  TPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR

XP_022962669.1 uncharacterized protein LOC111463090 [Cucurbita moschata]1.0e-9397.27Show/hide
Query:  MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV
        MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV
Subjt:  MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV

Query:  KLDASTPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR
        KLDASTPRNNQGSSSQ RPS KRKFLQISTGSQQQTVPQRVDRR YSGEGRAGCSKCGL HAGSCNA DKICYNCGKTGHLAR
Subjt:  KLDASTPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR

XP_023521106.1 uncharacterized protein LOC111784731 [Cucurbita pepo subsp. pepo]6.3e-8393.18Show/hide
Query:  MDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSE
        M VFKNLDPGTSNVDLGNEGQPVE  APAEA+PEPAAQSASRDQPTVVITLEALQSLIES+VDQA+QSRVDQAVQAAL GLGSQAAPT PVSGQTTLVS+
Subjt:  MDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSE

Query:  APGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVET
        APGVGVQTVIPPTRLTELPGTAVVTEA SRVVTYGRRCMTEESEYIRDFMKL PPTFGGKGTDPEAAEWWLEC+ET
Subjt:  APGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVET

TrEMBL top hitse value%identityAlignment
A0A5C7GTM0 CCHC-type domain-containing protein5.4e-5628.47Show/hide
Query:  FMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAG---GAPITWAAFRHEFCEKYYPALARLRNRKAFM
        F KLGPP F G   DP AAE WL+ +E  FT   CP+  KV  A+++LE  A  WW +    ++        ITW  F++ F EKY+P     +  + F+
Subjt:  FMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAG---GAPITWAAFRHEFCEKYYPALARLRNRKAFM

Query:  QLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV----------KLDASTPRNNQGSSSQARPSSKR--
         L+QGN+SV EYE +FT LSRFV  ++  +E K   F+ GL  +I+  V       +    + A+          K      RNNQ    +   + +   
Subjt:  QLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV----------KLDASTPRNNQGSSSQARPSSKR--

Query:  KFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLARSL--------------------------------------
        +F + + G  +       D      +    C +CG  H+G C  + K C+ CGK+ H  +                                        
Subjt:  KFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLARSL--------------------------------------

Query:  -------------------------------------------LILSIVTP------VNRVGDERALQV---------------DMDVFKNLD---PGTS
                                                   L ++I TP      V +V     L V               D D+   +D      +
Subjt:  -------------------------------------------LILSIVTP------VNRVGDERALQV---------------DMDVFKNLD---PGTS

Query:  NVDLGNE----GQPVEE--------------IAPAEAVPEPAAQSASRDQP---------------------TVVITLEALQS-----------------
        ++D  ++     +P E+              I+  +      A + SR                         +V+T  A +S                 
Subjt:  NVDLGNE----GQPVEE--------------IAPAEAVPEPAAQSASRDQP---------------------TVVITLEALQS-----------------

Query:  ---LIESRV-----------DQAMQSRV-------------------------------DQAVQAALATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGP
           L ES V           D  M+  +                                + V   +++I M P+EALYGR+CR+PI W+EVG + LLGP
Subjt:  ---LIESRV-----------DQAMQSRV-------------------------------DQAVQAALATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGP

Query:  DLLRTTNE--AIQKIKKRILTAQSRQKSYADIRRKDLEFEVGDHVFLKIAPVRGVLRFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAVHNVFHV
        +L++ T +   I+ IK+R+  AQSRQKSYAD RR+DLEFE GD VFLK++P +GV RFG+KGKLSPRFIGPFE+LER+G VAYR+ LPP L  +HNVFHV
Subjt:  DLLRTTNE--AIQKIKKRILTAQSRQKSYADIRRKDLEFEVGDHVFLKIAPVRGVLRFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAVHNVFHV

Query:  SMLRSNCEYVMHV
        S+LR       HV
Subjt:  SMLRSNCEYVMHV

A0A6J1EQX1 uncharacterized protein LOC1114369802.4e-11288.09Show/hide
Query:  MQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSEAPGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPE
        MQSRVDQAVQAAL GLGSQAAPT PV GQTTLVSEAPGVGVQTVIPPTRLTELP                     EESEYIRDFMKL PPTFGGKGTDPE
Subjt:  MQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSEAPGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPE

Query:  AAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS
        AAEWW+EC+ETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS
Subjt:  AAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLS

Query:  RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQ
        RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHAS+
Subjt:  RFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQ

A0A6J1EZK0 uncharacterized protein LOC1114407281.8e-12381.29Show/hide
Query:  MTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCE---------
        MTEESEYIRDFMKL PPTFGGKGTDPEAAEWWLEC+ET F FYNCPE+HKVLCATYLLEGPA+FWWKSKKPKM AGGAPITWAAFRHEFCE         
Subjt:  MTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCE---------

Query:  --------------KYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS
                      KYYPALARLRNRKAFMQLEQGNRSVEEYE EFT LSRFVPLM ATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS
Subjt:  --------------KYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDAS

Query:  TPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR
        TPRNNQGSSS                  QQTVPQRVDRRPYSGEGRAGCSKCGL HAGSCNAADKICYNCGKTGHLAR
Subjt:  TPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR

A0A6J1HHR1 uncharacterized protein LOC1114630905.0e-9497.27Show/hide
Query:  MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV
        MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV
Subjt:  MEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAV

Query:  KLDASTPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR
        KLDASTPRNNQGSSSQ RPS KRKFLQISTGSQQQTVPQRVDRR YSGEGRAGCSKCGL HAGSCNA DKICYNCGKTGHLAR
Subjt:  KLDASTPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGKTGHLAR

A0A7J0DJJ1 Retrotrans_gag domain-containing protein4.6e-5529.68Show/hide
Query:  IRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPI-TWAAFRHEFCEKYYPALARLRNRKAF
        I+ F +L PPTF G   DP AAE WL  +E  F    C +  KV+ AT++ EG A  WW+ KKP       P+  W  F   F E+Y+  + R +  + F
Subjt:  IRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPI-TWAAFRHEFCEKYYPALARLRNRKAF

Query:  MQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSV---------SAHASQDFV---------TAYNAAVKLDASTPRNNQGSSSQ
        + L+QGN  V  Y A+F  LSR+ P +V+TE  K   F  GLR  I+  V           H  +D           T+ +A+V+   +   NNQ    +
Subjt:  MQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEEKTDLFIQGLRQEIQGSV---------SAHASQDFV---------TAYNAAVKLDASTPRNNQGSSSQ

Query:  A-------RPSSKRKFLQI------------STGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNC----------------------
        A        P++    L I             +GS    V    +    +            + +G     D++  +C                      
Subjt:  A-------RPSSKRKFLQI------------STGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNC----------------------

Query:  ------------------------GKTGHLARSLLILSIVTP---VNRVGDERALQ----------VDMDV-------------FKNLDPGTSNV--DLG
                                    H   +   + +V P   ++ +   R +Q          VD  V             F +  PG +N+  D  
Subjt:  ------------------------GKTGHLARSLLILSIVTP---VNRVGDERALQ----------VDMDV-------------FKNLDPGTSNV--DLG

Query:  NEGQPV-------EEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLI------------------ESRVDQAMQSRVDQAVQAALATIQMAPFEALYGR
        +    V        ++   + +     +  + D  T++ T+ A  +LI                  E  V+      +     +  A+I MAP+EALYGR
Subjt:  NEGQPV-------EEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLI------------------ESRVDQAMQSRVDQAVQAALATIQMAPFEALYGR

Query:  RCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDLEFEVGDHVFLKIAPVRGVLRFGRKGKLSPRFIGPFEILERVGPVAY
        +CR+PI W EVG + +LGP++++ T + I+ I++R+ TAQSRQKSYA++RR+DLEF  GDHVFLKI+P +G+ RFG++GKL PR+IGPFEIL+R+G VAY
Subjt:  RCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDLEFEVGDHVFLKIAPVRGVLRFGRKGKLSPRFIGPFEILERVGPVAY

Query:  RLTLPPALDAVHNVFHVSMLRSNCEYVMHV
         + LPP L  VH+VFHV MLR       HV
Subjt:  RLTLPPALDAVHNVFHVSMLRSNCEYVMHV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.9e-0631.08Show/hide
Query:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL
        S V Q+   A+ +  QM PFE ++  R    +   E+ S      +  + T +  Q +K+ + T   + K Y D++ +++ EF+ GD V +K     G L
Subjt:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL

Query:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML
           +  KL+P F GPF +L++ GP  Y L LP ++  +  + FHVS L
Subjt:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML

P0CT35 Transposon Tf2-2 polyprotein2.9e-0631.08Show/hide
Query:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL
        S V Q+   A+ +  QM PFE ++  R    +   E+ S      +  + T +  Q +K+ + T   + K Y D++ +++ EF+ GD V +K     G L
Subjt:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL

Query:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML
           +  KL+P F GPF +L++ GP  Y L LP ++  +  + FHVS L
Subjt:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML

P0CT36 Transposon Tf2-3 polyprotein2.9e-0631.08Show/hide
Query:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL
        S V Q+   A+ +  QM PFE ++  R    +   E+ S      +  + T +  Q +K+ + T   + K Y D++ +++ EF+ GD V +K     G L
Subjt:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL

Query:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML
           +  KL+P F GPF +L++ GP  Y L LP ++  +  + FHVS L
Subjt:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML

P0CT41 Transposon Tf2-12 polyprotein2.9e-0631.08Show/hide
Query:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL
        S V Q+   A+ +  QM PFE ++  R    +   E+ S      +  + T +  Q +K+ + T   + K Y D++ +++ EF+ GD V +K     G L
Subjt:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL

Query:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML
           +  KL+P F GPF +L++ GP  Y L LP ++  +  + FHVS L
Subjt:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML

Q9UR07 Transposon Tf2-11 polyprotein2.9e-0631.08Show/hide
Query:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL
        S V Q+   A+ +  QM PFE ++  R    +   E+ S      +  + T +  Q +K+ + T   + K Y D++ +++ EF+ GD V +K     G L
Subjt:  SRVDQAVQAAL-ATIQMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDL-EFEVGDHVFLKIAPVRGVL

Query:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML
           +  KL+P F GPF +L++ GP  Y L LP ++  +  + FHVS L
Subjt:  RFGRKGKLSPRFIGPFEILERVGPVAYRLTLPPALDAV-HNVFHVSML

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTTTGCTTCGGGACACAACGGGTATTAAAGGAAAAGATTGGGAAAAATTGCAGGGAGGAGAGGTTGAAGACGAGCGCGTGAAGGACACGCGAGGCGCGTGGGA
GGCCGCGAGTGGATGTGCGACTCTCAGCGGTATCTCGCCTCGGTTAGATATGGACGTATTTAAGAATTTAGACCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGC
AGCCTGTAGAGGAAATTGCTCCAGCAGAGGCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCCAGAGATCAGCCGACTGTTGTGATTACTTTGGAAGCATTACAATCATTG
ATTGAGAGTCGAGTAGATCAGGCAATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGTTGGTCTTGGAAGCCAGGCGGCTCCAACAGTACCTGTATCGGGCCA
GACGACATTGGTGTCTGAAGCACCAGGAGTAGGTGTTCAGACAGTAATACCTCCAACACGGTTGACAGAGCTACCTGGTACAGCTGTGGTGACAGAGGCACCATCGCGGG
TAGTAACTTATGGTCGACGATGTATGACAGAAGAGAGTGAGTACATACGAGATTTCATGAAACTTGGCCCGCCAACTTTTGGAGGAAAGGGGACTGATCCGGAGGCAGCT
GAATGGTGGTTGGAATGTGTTGAAACAAAATTTACATTCTACAACTGCCCAGAGAATCATAAAGTGTTGTGTGCTACGTATTTGTTGGAGGGGCCAGCCCATTTTTGGTG
GAAATCAAAGAAGCCGAAGATGGAGGCTGGTGGAGCCCCAATCACATGGGCGGCCTTTAGACATGAGTTTTGTGAGAAGTATTATCCTGCTCTAGCGAGATTGCGAAACC
GAAAAGCATTTATGCAATTGGAACAGGGTAATAGGTCAGTAGAGGAGTATGAGGCGGAATTCACACGATTATCTAGATTTGTTCCTCTTATGGTTGCTACTGAGGAAGAG
AAGACGGATCTCTTTATTCAGGGTTTGAGGCAAGAAATACAGGGATCCGTGTCTGCCCATGCTTCTCAAGATTTTGTTACGGCTTATAATGCTGCGGTCAAGTTAGATGC
TAGCACCCCGAGGAATAATCAGGGTTCTAGCAGTCAGGCACGGCCAAGTAGTAAACGAAAATTTCTTCAAATATCCACGGGTTCGCAACAGCAGACGGTACCACAACGAG
TTGATCGCCGGCCTTATTCCGGTGAAGGGAGGGCAGGTTGTAGCAAGTGTGGGTTAATTCATGCGGGTTCTTGTAATGCTGCAGACAAGATATGTTATAACTGTGGCAAG
ACTGGTCATCTCGCCAGAAGTCTCTTAATATTGTCGATAGTAACTCCTGTTAATCGCGTTGGCGATGAGCGGGCGTTACAAGTTGATATGGACGTATTTAAGAATTTAGA
CCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGCAGCCTGTAGAGGAAATTGCTCCAGCAGAGGCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCTAGAGATCAGC
CGACTGTTGTGATTACTTTGGAAGCATTACAATCATTGATTGAGAGTCGAGTAGATCAGGCAATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGCAACTATC
CAGATGGCTCCTTTTGAAGCACTTTATGGACGGAGGTGTCGAACTCCAATTTATTGGGAGGAGGTAGGTAGTAAACCGCTATTAGGTCCAGATTTGCTACGAACGACCAA
TGAGGCGATTCAGAAGATTAAGAAGAGAATTTTGACCGCCCAAAGTCGTCAGAAGAGTTATGCAGATATCAGAAGGAAGGATTTAGAGTTTGAGGTGGGGGATCATGTAT
TTCTGAAGATTGCACCAGTTCGTGGTGTTTTGCGATTTGGTAGAAAGGGGAAGCTCAGTCCACGCTTTATAGGACCTTTTGAGATCCTTGAAAGAGTTGGCCCAGTAGCC
TATAGATTGACATTACCACCAGCGTTGGATGCTGTACATAATGTTTTTCATGTTTCGATGCTGAGGAGTAACTGTGAGTATGTTATGCATGTTAATGGAAAGGTTATGAC
TCTAGATATGCTTATTGGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTTTGCTTCGGGACACAACGGGTATTAAAGGAAAAGATTGGGAAAAATTGCAGGGAGGAGAGGTTGAAGACGAGCGCGTGAAGGACACGCGAGGCGCGTGGGA
GGCCGCGAGTGGATGTGCGACTCTCAGCGGTATCTCGCCTCGGTTAGATATGGACGTATTTAAGAATTTAGACCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGC
AGCCTGTAGAGGAAATTGCTCCAGCAGAGGCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCCAGAGATCAGCCGACTGTTGTGATTACTTTGGAAGCATTACAATCATTG
ATTGAGAGTCGAGTAGATCAGGCAATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGTTGGTCTTGGAAGCCAGGCGGCTCCAACAGTACCTGTATCGGGCCA
GACGACATTGGTGTCTGAAGCACCAGGAGTAGGTGTTCAGACAGTAATACCTCCAACACGGTTGACAGAGCTACCTGGTACAGCTGTGGTGACAGAGGCACCATCGCGGG
TAGTAACTTATGGTCGACGATGTATGACAGAAGAGAGTGAGTACATACGAGATTTCATGAAACTTGGCCCGCCAACTTTTGGAGGAAAGGGGACTGATCCGGAGGCAGCT
GAATGGTGGTTGGAATGTGTTGAAACAAAATTTACATTCTACAACTGCCCAGAGAATCATAAAGTGTTGTGTGCTACGTATTTGTTGGAGGGGCCAGCCCATTTTTGGTG
GAAATCAAAGAAGCCGAAGATGGAGGCTGGTGGAGCCCCAATCACATGGGCGGCCTTTAGACATGAGTTTTGTGAGAAGTATTATCCTGCTCTAGCGAGATTGCGAAACC
GAAAAGCATTTATGCAATTGGAACAGGGTAATAGGTCAGTAGAGGAGTATGAGGCGGAATTCACACGATTATCTAGATTTGTTCCTCTTATGGTTGCTACTGAGGAAGAG
AAGACGGATCTCTTTATTCAGGGTTTGAGGCAAGAAATACAGGGATCCGTGTCTGCCCATGCTTCTCAAGATTTTGTTACGGCTTATAATGCTGCGGTCAAGTTAGATGC
TAGCACCCCGAGGAATAATCAGGGTTCTAGCAGTCAGGCACGGCCAAGTAGTAAACGAAAATTTCTTCAAATATCCACGGGTTCGCAACAGCAGACGGTACCACAACGAG
TTGATCGCCGGCCTTATTCCGGTGAAGGGAGGGCAGGTTGTAGCAAGTGTGGGTTAATTCATGCGGGTTCTTGTAATGCTGCAGACAAGATATGTTATAACTGTGGCAAG
ACTGGTCATCTCGCCAGAAGTCTCTTAATATTGTCGATAGTAACTCCTGTTAATCGCGTTGGCGATGAGCGGGCGTTACAAGTTGATATGGACGTATTTAAGAATTTAGA
CCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGCAGCCTGTAGAGGAAATTGCTCCAGCAGAGGCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCTAGAGATCAGC
CGACTGTTGTGATTACTTTGGAAGCATTACAATCATTGATTGAGAGTCGAGTAGATCAGGCAATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGCAACTATC
CAGATGGCTCCTTTTGAAGCACTTTATGGACGGAGGTGTCGAACTCCAATTTATTGGGAGGAGGTAGGTAGTAAACCGCTATTAGGTCCAGATTTGCTACGAACGACCAA
TGAGGCGATTCAGAAGATTAAGAAGAGAATTTTGACCGCCCAAAGTCGTCAGAAGAGTTATGCAGATATCAGAAGGAAGGATTTAGAGTTTGAGGTGGGGGATCATGTAT
TTCTGAAGATTGCACCAGTTCGTGGTGTTTTGCGATTTGGTAGAAAGGGGAAGCTCAGTCCACGCTTTATAGGACCTTTTGAGATCCTTGAAAGAGTTGGCCCAGTAGCC
TATAGATTGACATTACCACCAGCGTTGGATGCTGTACATAATGTTTTTCATGTTTCGATGCTGAGGAGTAACTGTGAGTATGTTATGCATGTTAATGGAAAGGTTATGAC
TCTAGATATGCTTATTGGTGGATGA
Protein sequenceShow/hide protein sequence
MEFLLRDTTGIKGKDWEKLQGGEVEDERVKDTRGAWEAASGCATLSGISPRLDMDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSL
IESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSEAPGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAA
EWWLECVETKFTFYNCPENHKVLCATYLLEGPAHFWWKSKKPKMEAGGAPITWAAFRHEFCEKYYPALARLRNRKAFMQLEQGNRSVEEYEAEFTRLSRFVPLMVATEEE
KTDLFIQGLRQEIQGSVSAHASQDFVTAYNAAVKLDASTPRNNQGSSSQARPSSKRKFLQISTGSQQQTVPQRVDRRPYSGEGRAGCSKCGLIHAGSCNAADKICYNCGK
TGHLARSLLILSIVTPVNRVGDERALQVDMDVFKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALATI
QMAPFEALYGRRCRTPIYWEEVGSKPLLGPDLLRTTNEAIQKIKKRILTAQSRQKSYADIRRKDLEFEVGDHVFLKIAPVRGVLRFGRKGKLSPRFIGPFEILERVGPVA
YRLTLPPALDAVHNVFHVSMLRSNCEYVMHVNGKVMTLDMLIGG