; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010665 (gene) of Chayote v1 genome

Gene IDSed0010665
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG03:40964733..40966378
RNA-Seq ExpressionSed0010665
SyntenySed0010665
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035574.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.0e-16056.74Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++  H E+LR +L  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+S      + + + F
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF

Query:  --------WF-------------------------------------GELLPPV-----------------LGSRVFAAKIWRHYPYGERIQMFTDHQSL
                WF                                     G+++                    L + VFA KIWRHY YGE+IQ+FTDH+SL
Subjt:  --------WF-------------------------------------GELLPPV-----------------LGSRVFAAKIWRHYPYGERIQMFTDHQSL

Query:  KYLFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPA
        KY FTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q+++P 
Subjt:  KYLFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPA

Query:  LKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGL
        L        A    +F  S D GL +  RLCVP  + ++ ELLSEAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+P GL
Subjt:  LKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGL

Query:  LQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGT
        LQPLS+P+WKWE +SMDFI+GLPRT R ++ IWV+VDR TKSAHF+PG+ TYT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT
Subjt:  LQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGT

KAA0035938.1 pol protein [Cucumis melo var. makuwa]1.5e-16259.05Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVS--------------
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+              
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVS--------------

Query:  -SLAADFR-------------------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLEL
          LA  +R                             QL    +  P     L + VFA KIWRHY YGE+IQ+FTDH+SLKY FTQKELNMRQRRWLEL
Subjt:  -SLAADFR-------------------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLEL

Query:  VKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFS
        VKDYDCEILYHPGKANVV DALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F  S
Subjt:  VKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFS

Query:  PDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFI
         D GL +  RLCVP  + ++ ELL+EAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+PAGLLQPLS+P+WKWE +SMDFI
Subjt:  PDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFI

Query:  SGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        +GLPRT R ++ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  SGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

KAA0041490.1 hypothetical protein E6C27_scaffold6G00780 [Cucumis melo var. makuwa]2.8e-16160.25Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSS-------------
        MD+M+RVFK FLDSFVIVFI DIL+YS+++ +H E+LR+VL  LR N+LYAKFSKC+FWL++V FLGHVVS +G++VDPAKIEAV++             
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSS-------------

Query:  --LAADFR------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLELVKDYDCEILYHPG
          LA  +R                QL    +  P     L + VFA KIWRHY YGE+IQ++TDH+SLKY FTQKELNMR RRWLELVKDYDCEILYHPG
Subjt:  --LAADFR------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLELVKDYDCEILYHPG

Query:  KANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCV
        KANVVADALSRK +HSAAL++ Q  L R+FERAE+AV +GE+T+++ QL+VQP++RQRII +QL+DP L      V      DF  S D GL + GRLCV
Subjt:  KANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCV

Query:  PDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQI
        P+ + ++ ELL+EAHSSPF  HPG TKMYQDL+  +WW  MKR VA+FVSRCL CQQVKAPRQRPAGLLQPLSVP WKWE +SMDFI+GLP+T + Y+ I
Subjt:  PDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQI

Query:  WVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        WV+VDRLTKSAHF+ G+ TYT  +W QL++ +I+R HGVPV I+SD+D+RFTS+FW+  Q  LGT+  F   F
Subjt:  WVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

KAA0047001.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-16058.08Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRD----
        MD+M RVF+ FLD+F+IVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G+ VDPAKIEAV+       + +    
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRD----

Query:  ---------------------------------------TQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRW
                                                QL    +  P     L + VFA KIWRHY YGE+IQ+FTD++SLKY FTQKELNMRQRRW
Subjt:  ---------------------------------------TQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRW

Query:  LELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDF
        LELVKDYDCEILYHPGKANVVADALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F
Subjt:  LELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDF

Query:  VFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISM
          S D GL +  RLCVP  + ++ ELLSEAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVSRCL CQQVKAPRQ+PAGLLQPLS+P+WKWE +SM
Subjt:  VFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISM

Query:  DFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        DFI+GLPRT R ++ IWV+VDRLTKSAHF+PG+  YT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  DFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

TYK04315.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-16057.99Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+S      + + + F
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF

Query:  WFGELLP-----------------------------------------PV----LGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWL
           E L                                          P     L + VFA KIWRHY YGE+IQ+FTDH+SLKY FTQKELNMRQRRWL
Subjt:  WFGELLP-----------------------------------------PV----LGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWL

Query:  ELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFV
        ELVKDYDCEILYHPGKANVVADALS+K SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F 
Subjt:  ELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFV

Query:  FSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMD
         S D G  +  RLCVP  + ++ ELLSEAHSSPF+ HPG TK+YQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+PAGLLQPLSVP+WKWE +SMD
Subjt:  FSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMD

Query:  FISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        FI+GLPRT R ++ IWV+VDRLTK  HF+PG+ T+T  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  FISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

TrEMBL top hitse value%identityAlignment
A0A5A7SW90 Reverse transcriptase3.4e-16056.74Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++  H E+LR +L  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+S      + + + F
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF

Query:  --------WF-------------------------------------GELLPPV-----------------LGSRVFAAKIWRHYPYGERIQMFTDHQSL
                WF                                     G+++                    L + VFA KIWRHY YGE+IQ+FTDH+SL
Subjt:  --------WF-------------------------------------GELLPPV-----------------LGSRVFAAKIWRHYPYGERIQMFTDHQSL

Query:  KYLFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPA
        KY FTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q+++P 
Subjt:  KYLFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPA

Query:  LKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGL
        L        A    +F  S D GL +  RLCVP  + ++ ELLSEAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+P GL
Subjt:  LKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGL

Query:  LQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGT
        LQPLS+P+WKWE +SMDFI+GLPRT R ++ IWV+VDR TKSAHF+PG+ TYT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT
Subjt:  LQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGT

A0A5A7SZD6 Pol protein7.3e-16359.05Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVS--------------
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+              
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVS--------------

Query:  -SLAADFR-------------------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLEL
          LA  +R                             QL    +  P     L + VFA KIWRHY YGE+IQ+FTDH+SLKY FTQKELNMRQRRWLEL
Subjt:  -SLAADFR-------------------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLEL

Query:  VKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFS
        VKDYDCEILYHPGKANVV DALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F  S
Subjt:  VKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFS

Query:  PDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFI
         D GL +  RLCVP  + ++ ELL+EAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+PAGLLQPLS+P+WKWE +SMDFI
Subjt:  PDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFI

Query:  SGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        +GLPRT R ++ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  SGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

A0A5A7TDX1 Reverse transcriptase1.4e-16160.25Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSS-------------
        MD+M+RVFK FLDSFVIVFI DIL+YS+++ +H E+LR+VL  LR N+LYAKFSKC+FWL++V FLGHVVS +G++VDPAKIEAV++             
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSS-------------

Query:  --LAADFR------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLELVKDYDCEILYHPG
          LA  +R                QL    +  P     L + VFA KIWRHY YGE+IQ++TDH+SLKY FTQKELNMR RRWLELVKDYDCEILYHPG
Subjt:  --LAADFR------------LRDTQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLELVKDYDCEILYHPG

Query:  KANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCV
        KANVVADALSRK +HSAAL++ Q  L R+FERAE+AV +GE+T+++ QL+VQP++RQRII +QL+DP L      V      DF  S D GL + GRLCV
Subjt:  KANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCV

Query:  PDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQI
        P+ + ++ ELL+EAHSSPF  HPG TKMYQDL+  +WW  MKR VA+FVSRCL CQQVKAPRQRPAGLLQPLSVP WKWE +SMDFI+GLP+T + Y+ I
Subjt:  PDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQI

Query:  WVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        WV+VDRLTKSAHF+ G+ TYT  +W QL++ +I+R HGVPV I+SD+D+RFTS+FW+  Q  LGT+  F   F
Subjt:  WVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

A0A5A7U077 Reverse transcriptase1.2e-16058.08Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRD----
        MD+M RVF+ FLD+F+IVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G+ VDPAKIEAV+       + +    
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRD----

Query:  ---------------------------------------TQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRW
                                                QL    +  P     L + VFA KIWRHY YGE+IQ+FTD++SLKY FTQKELNMRQRRW
Subjt:  ---------------------------------------TQLFWFGELLPP---VLGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRW

Query:  LELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDF
        LELVKDYDCEILYHPGKANVVADALSRK SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F
Subjt:  LELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDF

Query:  VFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISM
          S D GL +  RLCVP  + ++ ELLSEAHSSPF+ HPG TKMYQDLK+ +WW  MKR VAEFVSRCL CQQVKAPRQ+PAGLLQPLS+P+WKWE +SM
Subjt:  VFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISM

Query:  DFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        DFI+GLPRT R ++ IWV+VDRLTKSAHF+PG+  YT  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  DFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

A0A5D3BYT1 Reverse transcriptase6.8e-16157.99Show/hide
Query:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF
        MD+M+RVF+ FLD+FVIVFIDDILIYS+++ +H E+LR VL  LR+N+LYAKFSKC+FWL+QV+FLGHVVS+ G++VDPAKIEAV+S      + + + F
Subjt:  MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLF

Query:  WFGELLP-----------------------------------------PV----LGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWL
           E L                                          P     L + VFA KIWRHY YGE+IQ+FTDH+SLKY FTQKELNMRQRRWL
Subjt:  WFGELLP-----------------------------------------PV----LGSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWL

Query:  ELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFV
        ELVKDYDCEILYHPGKANVVADALS+K SHSAAL++ Q  L R+ ERAE+AV +G +T ++ QLTVQP++RQRIID+Q +DP L        A    +F 
Subjt:  ELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFV

Query:  FSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMD
         S D G  +  RLCVP  + ++ ELLSEAHSSPF+ HPG TK+YQDLK+ +WW  MKR VAEFVS+CL CQQVKAPRQ+PAGLLQPLSVP+WKWE +SMD
Subjt:  FSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMD

Query:  FISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF
        FI+GLPRT R ++ IWV+VDRLTK  HF+PG+ T+T  +WAQL++ +I+R HGVPV IVSD+D+RFTS+FW+  Q  +GT+  F   F
Subjt:  FISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFKDDF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.3e-3923.65Show/hide
Query:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------
        +S V+ ++DDILI+S+S+ +H ++++ VL  L+   L    +KC+F   QV F+G+ +S  G       I+ V         ++ + F            
Subjt:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------

Query:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------
                   LL                         PPVL    F+ KI                                                 
Subjt:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------

Query:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV
                WRHY     E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++     + ++ E   +  
Subjt:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV

Query:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW
                V Q+++    + +++    +D  L  L +      + +        +  + ++ +P+   L   ++ + H      HPG   +   + + F 
Subjt:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW

Query:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH
        W G+++ + E+V  C TCQ  K+   +P G LQP+   +  WE +SMDFI+ LP +   Y+ ++V+VDR +K A  +P   + T EQ A++F +++I Y 
Subjt:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH

Query:  GVPVRIVSDQDSRFTSRFWR
        G P  I++D D  FTS+ W+
Subjt:  GVPVRIVSDQDSRFTSRFWR

P0CT35 Transposon Tf2-2 polyprotein8.3e-3923.65Show/hide
Query:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------
        +S V+ ++DDILI+S+S+ +H ++++ VL  L+   L    +KC+F   QV F+G+ +S  G       I+ V         ++ + F            
Subjt:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------

Query:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------
                   LL                         PPVL    F+ KI                                                 
Subjt:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------

Query:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV
                WRHY     E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++     + ++ E   +  
Subjt:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV

Query:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW
                V Q+++    + +++    +D  L  L +      + +        +  + ++ +P+   L   ++ + H      HPG   +   + + F 
Subjt:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW

Query:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH
        W G+++ + E+V  C TCQ  K+   +P G LQP+   +  WE +SMDFI+ LP +   Y+ ++V+VDR +K A  +P   + T EQ A++F +++I Y 
Subjt:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH

Query:  GVPVRIVSDQDSRFTSRFWR
        G P  I++D D  FTS+ W+
Subjt:  GVPVRIVSDQDSRFTSRFWR

P0CT36 Transposon Tf2-3 polyprotein8.3e-3923.65Show/hide
Query:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------
        +S V+ ++DDILI+S+S+ +H ++++ VL  L+   L    +KC+F   QV F+G+ +S  G       I+ V         ++ + F            
Subjt:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------

Query:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------
                   LL                         PPVL    F+ KI                                                 
Subjt:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------

Query:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV
                WRHY     E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++     + ++ E   +  
Subjt:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV

Query:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW
                V Q+++    + +++    +D  L  L +      + +        +  + ++ +P+   L   ++ + H      HPG   +   + + F 
Subjt:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW

Query:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH
        W G+++ + E+V  C TCQ  K+   +P G LQP+   +  WE +SMDFI+ LP +   Y+ ++V+VDR +K A  +P   + T EQ A++F +++I Y 
Subjt:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH

Query:  GVPVRIVSDQDSRFTSRFWR
        G P  I++D D  FTS+ W+
Subjt:  GVPVRIVSDQDSRFTSRFWR

P0CT37 Transposon Tf2-4 polyprotein8.3e-3923.65Show/hide
Query:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------
        +S V+ ++DDILI+S+S+ +H ++++ VL  L+   L    +KC+F   QV F+G+ +S  G       I+ V         ++ + F            
Subjt:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------

Query:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------
                   LL                         PPVL    F+ KI                                                 
Subjt:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------

Query:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV
                WRHY     E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++     + ++ E   +  
Subjt:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV

Query:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW
                V Q+++    + +++    +D  L  L +      + +        +  + ++ +P+   L   ++ + H      HPG   +   + + F 
Subjt:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW

Query:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH
        W G+++ + E+V  C TCQ  K+   +P G LQP+   +  WE +SMDFI+ LP +   Y+ ++V+VDR +K A  +P   + T EQ A++F +++I Y 
Subjt:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH

Query:  GVPVRIVSDQDSRFTSRFWR
        G P  I++D D  FTS+ W+
Subjt:  GVPVRIVSDQDSRFTSRFWR

P0CT41 Transposon Tf2-12 polyprotein8.3e-3923.65Show/hide
Query:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------
        +S V+ ++DDILI+S+S+ +H ++++ VL  L+   L    +KC+F   QV F+G+ +S  G       I+ V         ++ + F            
Subjt:  DSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFW-----------

Query:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------
                   LL                         PPVL    F+ KI                                                 
Subjt:  --------FGELL-------------------------PPVLGSRVFAAKI-------------------------------------------------

Query:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV
                WRHY     E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++     + ++ E   +  
Subjt:  --------WRHYPYG--ERIQMFTDHQSLKYLFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAV

Query:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW
                V Q+++    + +++    +D  L  L +      + +        +  + ++ +P+   L   ++ + H      HPG   +   + + F 
Subjt:  LLGEITARVPQLTVQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFW

Query:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH
        W G+++ + E+V  C TCQ  K+   +P G LQP+   +  WE +SMDFI+ LP +   Y+ ++V+VDR +K A  +P   + T EQ A++F +++I Y 
Subjt:  WMGMKRNVAEFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYH

Query:  GVPVRIVSDQDSRFTSRFWR
        G P  I++D D  FTS+ W+
Subjt:  GVPVRIVSDQDSRFTSRFWR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGATGGATAGGGTGTTCAAATCATTTTTGGACAGCTTCGTCATCGTGTTCATTGACGATATTCTGATTTATTCGAGGTCTAAGGAGGACCATGCAGAGTATTT
GAGGCGAGTATTGGCAGCTTTAAGGGAAAACCAGCTTTATGCGAAGTTTTCCAAATGTGATTTCTGGTTGGAGCAGGTAGCCTTTCTAGGTCACGTAGTTTCGAGAGATG
GTATAGCCGTGGACCCCGCAAAGATTGAGGCCGTGAGTTCGTTGGCTGCCGACTTCAGGCTGCGAGATACGCAGCTTTTCTGGTTTGGCGAGTTACTACCACCGGTTCTT
GGCAGCAGAGTGTTCGCGGCCAAGATTTGGAGGCATTACCCGTATGGTGAGCGGATTCAGATGTTTACCGATCACCAGAGCCTCAAGTATCTGTTTACTCAGAAGGAGTT
GAATATGAGGCAGAGGAGATGGTTAGAGTTGGTAAAGGATTATGACTGTGAGATTTTGTACCATCCGGGTAAAGCCAATGTAGTGGCTGATGCTTTGAGTAGGAAGACGT
CACACTCCGCGGCGTTGTTGAGTTCCCAGCCCAGATTGCGGAGAGAGTTCGAGCGGGCCGAAGTTGCAGTGTTGTTGGGTGAGATTACCGCCAGAGTACCTCAATTGACA
GTTCAGCCGTCGATTAGGCAGCGGATTATTGATAGCCAGCTGAGTGACCCCGCACTTAAGAGATTGTTTGACAGAGTTAGTGCTGAAGGAGATTTTGATTTCGTATTCTC
GCCAGACAGTGGGTTGGCTTATAGAGGTCGGTTGTGTGTTCCGGATGTTGCAGGATTGCGAGCAGAGTTGCTCTCAGAGGCCCACAGTTCTCCGTTTGCCTGGCACCCCG
GGGGTACTAAAATGTATCAGGACCTTAAGCAGACATTCTGGTGGATGGGTATGAAAAGAAATGTTGCAGAGTTTGTTAGTCGGTGCCTCACTTGTCAGCAAGTGAAGGCG
CCCAGGCAGAGGCCAGCCGGTTTGTTGCAGCCACTGAGTGTGCCTCAGTGGAAATGGGAGGAGATCTCCATGGACTTTATTTCTGGGTTGCCGAGGACTCCGAGGAAGTA
CTCTCAGATCTGGGTGATAGTCGACCGTTTGACTAAGAGCGCACACTTCATTCCCGGTAGAGACACTTATACCGTGGAACAGTGGGCGCAGTTGTTTTTGGAGCAGATCA
TTCGTTATCATGGTGTACCTGTGAGGATTGTATCTGATCAGGACTCGCGGTTCACTTCTCGTTTCTGGAGAGCACACCAGAAGGGTTTAGGCACAAAGTGCGCTTTCAAA
GATGACTTTTTCACCCTCGCATGGATGGACAGATCGAGAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACATGATGGATAGGGTGTTCAAATCATTTTTGGACAGCTTCGTCATCGTGTTCATTGACGATATTCTGATTTATTCGAGGTCTAAGGAGGACCATGCAGAGTATTT
GAGGCGAGTATTGGCAGCTTTAAGGGAAAACCAGCTTTATGCGAAGTTTTCCAAATGTGATTTCTGGTTGGAGCAGGTAGCCTTTCTAGGTCACGTAGTTTCGAGAGATG
GTATAGCCGTGGACCCCGCAAAGATTGAGGCCGTGAGTTCGTTGGCTGCCGACTTCAGGCTGCGAGATACGCAGCTTTTCTGGTTTGGCGAGTTACTACCACCGGTTCTT
GGCAGCAGAGTGTTCGCGGCCAAGATTTGGAGGCATTACCCGTATGGTGAGCGGATTCAGATGTTTACCGATCACCAGAGCCTCAAGTATCTGTTTACTCAGAAGGAGTT
GAATATGAGGCAGAGGAGATGGTTAGAGTTGGTAAAGGATTATGACTGTGAGATTTTGTACCATCCGGGTAAAGCCAATGTAGTGGCTGATGCTTTGAGTAGGAAGACGT
CACACTCCGCGGCGTTGTTGAGTTCCCAGCCCAGATTGCGGAGAGAGTTCGAGCGGGCCGAAGTTGCAGTGTTGTTGGGTGAGATTACCGCCAGAGTACCTCAATTGACA
GTTCAGCCGTCGATTAGGCAGCGGATTATTGATAGCCAGCTGAGTGACCCCGCACTTAAGAGATTGTTTGACAGAGTTAGTGCTGAAGGAGATTTTGATTTCGTATTCTC
GCCAGACAGTGGGTTGGCTTATAGAGGTCGGTTGTGTGTTCCGGATGTTGCAGGATTGCGAGCAGAGTTGCTCTCAGAGGCCCACAGTTCTCCGTTTGCCTGGCACCCCG
GGGGTACTAAAATGTATCAGGACCTTAAGCAGACATTCTGGTGGATGGGTATGAAAAGAAATGTTGCAGAGTTTGTTAGTCGGTGCCTCACTTGTCAGCAAGTGAAGGCG
CCCAGGCAGAGGCCAGCCGGTTTGTTGCAGCCACTGAGTGTGCCTCAGTGGAAATGGGAGGAGATCTCCATGGACTTTATTTCTGGGTTGCCGAGGACTCCGAGGAAGTA
CTCTCAGATCTGGGTGATAGTCGACCGTTTGACTAAGAGCGCACACTTCATTCCCGGTAGAGACACTTATACCGTGGAACAGTGGGCGCAGTTGTTTTTGGAGCAGATCA
TTCGTTATCATGGTGTACCTGTGAGGATTGTATCTGATCAGGACTCGCGGTTCACTTCTCGTTTCTGGAGAGCACACCAGAAGGGTTTAGGCACAAAGTGCGCTTTCAAA
GATGACTTTTTCACCCTCGCATGGATGGACAGATCGAGAGGGTGA
Protein sequenceShow/hide protein sequence
MDMMDRVFKSFLDSFVIVFIDDILIYSRSKEDHAEYLRRVLAALRENQLYAKFSKCDFWLEQVAFLGHVVSRDGIAVDPAKIEAVSSLAADFRLRDTQLFWFGELLPPVL
GSRVFAAKIWRHYPYGERIQMFTDHQSLKYLFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKTSHSAALLSSQPRLRREFERAEVAVLLGEITARVPQLT
VQPSIRQRIIDSQLSDPALKRLFDRVSAEGDFDFVFSPDSGLAYRGRLCVPDVAGLRAELLSEAHSSPFAWHPGGTKMYQDLKQTFWWMGMKRNVAEFVSRCLTCQQVKA
PRQRPAGLLQPLSVPQWKWEEISMDFISGLPRTPRKYSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIIRYHGVPVRIVSDQDSRFTSRFWRAHQKGLGTKCAFK
DDFFTLAWMDRSRG