; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000135 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000135
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:325894..340427
RNA-Seq ExpressionLag0000135
SyntenyLag0000135
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]5.3e-12735.76Show/hide
Query:  IEEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLI
        ++E +++WRFTGIY + V+    ETW L+ RL    ++PW+LGG FNEI  + EK  G  R  S ++ F++ +D CG+LD GF G  +TWC+ H     I
Subjt:  IEEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLI

Query:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEW---KEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKECLTRL
        WERLDRFLIN  +     + ++ HL  LASDHRP+LAEW    E    + +    H  RFEE W  + ECK+IV++VW   G         K   CL  L
Subjt:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEW---KEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKECLTRL

Query:  GRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFD
         +W+      S++GAI RKE E Q  +  G     + + + +++LE LLE++E YW+Q                                          
Subjt:  GRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFD

Query:  EDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQINNTYIALI
                            SP N+ +  +L++    ++E+  + L   F  EE+  V+  M P++AP                   ++ Q+N T+I LI
Subjt:  EDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQINNTYIALI

Query:  PKVNDPKSMKDFRPISL----RSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVY
         K  D K MKDF  ISL     +++YK+I+K LANRLK+VLN +I P+QSAF+P RLITDNAI+GFECIH++  + +GK G +ALKLDM  AYDRVEW Y
Subjt:  PKVNDPKSMKDFRPISL----RSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVY

Query:  IR-------------------------GIMEHMGFSSRWIDLIMRCVESVSFQV--LLNGLPSSFEWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEM
        +R                         G +  MG     I  + + + + +     L   + +       RFWWGS S + KIHW+SW+ LC  K  G M
Subjt:  IR-------------------------GIMEHMGFSSRWIDLIMRCVESVSFQV--LLNGLPSSFEWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEM

Query:  DFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------
         FRD+S+FNQA+LAKQSW  +R+P SL+A+ LRG+Y+KTGSFL+A  G  PSY WRSI+WGR+LF+K                                 
Subjt:  DFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------

Query:  ----------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ
                              R+ F+  +A +IL  P+ +  + DEIIW  D  G+F+V+SAY LG+Q
Subjt:  ----------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]7.9e-12330.79Show/hide
Query:  HIDATIEEKK--WSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK
        HIDA I E    + WR TG Y +P     +++W L+  L+ Q ++PW+  G FNEI +  EK GG++R    + GFR+ ++ CG  D+G+CGP+YTW N 
Subjt:  HIDATIEEKK--WSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK

Query:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRR-----FEEIWTKYDECKDIVKQVWQEHGRRSF-RNLT
            + I  RLDR L   D  ++    KV HL     DH  LL            + + H  R     FE  WTK ++CK I++  W      S    ++
Subjt:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRR-----FEEIWTKYDECKDIVKQVWQEHGRRSF-RNLT

Query:  EKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-DEEMS----KLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKT
        E  + C   L +WS + Y     G I +K ++ ++ L++  +   DE++S    +L +E+  LL+D+E YW QR+   WL+  D NTK+FH +AS  RK 
Subjt:  EKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-DEEMS----KLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKT

Query:  NRVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV
        N + G++D  G W D +  +A+ A  YF +++ SS  H   IE + EA P  ++E+ N  L   FT+EE+   ++ +HP +APG DG+ A+F QKYW IV
Subjt:  NRVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV

Query:  EKN--------------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNK
          N              + ++N T I+LIPK N+PK M DFRPISL +++YK+I+K LANRLK +L +II  +QSAF   RLITDN ++ FE +H + +K
Subjt:  EKN--------------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNK

Query:  RQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGL------------------PSSF-------------------
          GK+G +A+KLDMS A+DRVEW +I  +ME MGF +RW DL+M+C+ SVS+ +L+NG+                  PS F                   
Subjt:  RQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGL------------------PSSF-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------EWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQ
                                                               E   K FWWG  + E K+ W SW+++C  KA G + FR+L  FN 
Subjt:  -------------------------------------------------------EWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQ

Query:  ALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-------------------------------------------
        A+LAKQ+W  +  P+SLV RVL+ RY+ TG  L A  G +PSY WRSI    E+ R+                                           
Subjt:  ALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-------------------------------------------

Query:  --------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL
                      R IFL  +   IL IP+      D++IW G+ KG F+VKSAY +
Subjt:  --------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]3.3e-12134.86Show/hide
Query:  HIDATI-EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKH
        HIDA I +E+  +WRFTG Y +P       +W L+KR++     PW+ GG FNEI    EK GG  +    +K F  A+D+C + ++ + G  +TWCN  
Subjt:  HIDATI-EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKH

Query:  FDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEP-PDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE-HGRRSFRNLTEKTKE
           ++I+ERLDR ++N +        KV HL+   SDH PLL  +   P  DQ +    +   +E+ W   +EC+ I++  W+E +   S  +L E    
Subjt:  FDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEP-PDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE-HGRRSFRNLTEKTKE

Query:  CLTRLGRWSRSHYDRS---IKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLF
        C T L +W++     +   IK      EK + +  + G      ++  LEK+L   L  +E++WKQRS   WL   D NT++FH +A++ RK NR+ GLF
Subjt:  CLTRLGRWSRSHYDRS---IKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLF

Query:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-------
        D    W     ++      +FQDLF ++   +   E +    P+  S  QN  L   FT  +I  V+  ++  +AP  DG+  +F + +W+++       
Subjt:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-------

Query:  -------EKNLDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGV
                K+  Q+N T + LIPK+  PK + D+RPISL ++ YK+IAK LANR+K  L  +I  +QSA I  RLI DNAILGFE +H +K  R G    
Subjt:  -------EKNLDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGV

Query:  IALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN---------------------------GLPSS--------FE--------
        +ALKLDMS AYDRVEW  +  +M  +G+  RW+D IM C++S+SF +LLN                           G+P+S        FE        
Subjt:  IALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN---------------------------GLPSS--------FE--------

Query:  ----WDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPI
            W +                                          RFWWGS  ++ KIHW +W KLC  K  G M F++L +FNQ+LLAKQ W  I
Subjt:  ----WDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPI

Query:  RYPHSLVARVLRGRYYKTGSFLKA---GFGHNPSYIWRSIIWGRELFRK---------RDIFLEED
          PHS++ARVL+  YY   +FL+A   GFG   SY+WRSI+WGR++  K         RDI + ED
Subjt:  RYPHSLVARVLRGRYYKTGSFLKA---GFGHNPSYIWRSIIWGRELFRK---------RDIFLEED

XP_035546285.1 uncharacterized protein LOC108996706 [Juglans regia]6.9e-11931.19Show/hide
Query:  RHIDATI--EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCN
        RHI+A I  E+    W  TG Y  P   L  E W L+  L    E+ W + G FNEI  H EK GG+ RL+  +  FREA++  G+ D+G+ G  YTW N
Subjt:  RHIDATI--EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCN

Query:  KHFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKE
        KH D     ERLDR + N   +       V  LA   SDHRP+L    +    + +     L R+E  W+  ++C++++K+VWQ    R+     EK  +
Subjt:  KHFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKE

Query:  CLTRLGRWSRSHYDRSI---KGAIARKEKENQNSLSSGV-LCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGL
         L    R +   + R I   KG + +++ E    L       N  E+ K+++E+   LE +++ W+QR+  DW    D NTK+FH  A+  RK N +  +
Subjt:  CLTRLGRWSRSHYDRSI---KGAIARKEKENQNSLSSGV-LCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGL

Query:  FDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNL--
        +D       +  ++  V N YFQ+LF S++P    IE  L     C+++D N  LT TFTR E+   ++ M P ++PG DG    F QK+W +V   +  
Subjt:  FDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNL--

Query:  ------------DQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDG
                      +N TY+ALIPKV +P+   DFRPISL ++ YK+++K LANRLK+ LN +I P+QSAFIP +LITDN ++ +E +H++K +++GK G
Subjt:  ------------DQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDG

Query:  VIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSS--------------------------------------------
         +A+KLDMS AYDR+EW Y+  +M  +GF  +WI+LIM+CV +VS+ VL+NG P S                                            
Subjt:  VIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSS--------------------------------------------

Query:  ------------------------FEW---------------------------------DSKR------------------------------------
                                 EW                                 + KR                                    
Subjt:  ------------------------FEW---------------------------------DSKR------------------------------------

Query:  -----FWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSI--------
             FWWG+ SS N IHW SW+KL   K  G + FRDL  FN ALLAKQ W  ++ P S+ A++ + +Y+ + S  +A  G+ PS+IWRS+        
Subjt:  -----FWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSI--------

Query:  --------------IWGRELFRK---------------------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL
                      IWG++  +                                    IF +E+   I  IPI   +  D++IW G  KG F+V SAY++
Subjt:  --------------IWGRELFRK---------------------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL

Query:  GMQK
         M K
Subjt:  GMQK

XP_035546588.1 uncharacterized protein LOC118348634 [Juglans regia]7.6e-11830.29Show/hide
Query:  HIDATIEEKKWS--WRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK
        HI+A I+E + +  W  TG Y  P   L  ETWSL+     Q +  W + G FNEI  H EKSGG  + +  +  FRE ++  G+ D+G+ G  YTW NK
Subjt:  HIDATIEEKKWS--WRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK

Query:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE--HGRRSFRNLTEKTK
        H D     ERLDR + N   +       V  L    SDHRP+L    +E   Q +     L RFE  W+  + C++++  VW+E   GR     L  K  
Subjt:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE--HGRRSFRNLTEKTK

Query:  ECLTRLGRWSRSHYDRSIKGAIARKE--KENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLF
        E    L RWS+       K    + E  KE Q +  S    N   +  L+KE+  LLE +++ W+QR+  +W Q  D NTK+FH  A+  RK NR++ + 
Subjt:  ECLTRLGRWSRSHYDRSIKGAIARKE--KENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLF

Query:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-------
        D     +     +  V   +F+ LF SS P +  IE  L     C++   N  L+ TFTREEI   ++SM P ++PG DG  A F QK+W+ +       
Subjt:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-------

Query:  ------EKNLDQINN-TYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGV
              E +L    N TY+ALIPK++ P+   DFRPISL +++YK++AK LANRLK VLN II P+QSAFIP RLI+DN ++ +E +H++K +++GK G 
Subjt:  ------EKNLDQINN-TYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGV

Query:  IALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSSF--------------------------------------------
        +A+KLDMS AYDR+EW Y+R ++  MGF  +WI+LIM CV SVS+ VL+NG PS                                              
Subjt:  IALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSSF--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQS
                W +                                          RFWWG+  S   +HW SW+++   K  G M +RDL  FN ALLAKQ 
Subjt:  -------EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQS

Query:  WLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSI----------------------IWGR-------------------------ELFRK---
        W  ++  +S+ A++L+ +YYK  S L A  GH PS++WRS+                      IWG                          EL +    
Subjt:  WLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSI----------------------IWGR-------------------------ELFRK---

Query:  ------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQK
              R+IF +E+A  I  IP+ +    D ++W    KG+F+V+SAY L M++
Subjt:  ------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQK

TrEMBL top hitse value%identityAlignment
A0A2N9GY38 Reverse transcriptase domain-containing protein7.7e-12431.73Show/hide
Query:  HIDATIEEK-KWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKH
        HIDA +++    +WRFTG Y  P      E+W+L++RL+ QS +PW   G FNE+    EK G  +R +  ++ FR+ +D CG +D+GF GP +TW N  
Subjt:  HIDATIEEK-KWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKH

Query:  FDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE--HGRRSFRNLTEKTKE
           D+ WERLDR +   +      S +V+HL +  SDH+PL   W    P  M+  +    RFEE+WT    C++ V   W++  +G   F ++ EK   
Subjt:  FDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE--HGRRSFRNLTEKTKE

Query:  CLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-----DEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRG
        C   L  WS+ ++     G I  K KE ++ L      +       ++  L +EL +LL  +E  W+QRS  +WL+  D NT++FH RA+  ++ N V  
Subjt:  CLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-----DEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRG

Query:  LFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-----
        L +  G W    A+   +   ++  LFQ+  P  + I+++ E     ++E+ N  L   FT  ++   ++ M P +APG DG+  +F QKYW ++     
Subjt:  LFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIV-----

Query:  ---------EKNLDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKD
                  K L  IN+TYI LIPK+ +P+++ DFRPISL +++YK+I+K LANRLK +L  I+  SQSAF+P RLITDN ++ FE +H ++++++G+ 
Subjt:  ---------EKNLDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKD

Query:  GVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSSFEWDS--------------------------------------
        G +ALKLDMS AYDRVEW Y++ +ME MGFSS+W+ ++M C+ +VS+ +L+NG P  F   S                                      
Subjt:  GVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLPSSFEWDS--------------------------------------

Query:  -------------------------------------------------------KRFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLA
                                                               +RFWWG    + K+HW SW  LC  KA G + FR+L  FN+ALLA
Subjt:  -------------------------------------------------------KRFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLA

Query:  KQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-----------------------------------------------
        KQ W  +  P SL  +V + +Y+   S L+A      SY W+SI+  R+L +K                                               
Subjt:  KQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-----------------------------------------------

Query:  ----------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL
                  R+ FL  DA  I+ IP+ +  + D ++W G   G + V+S Y L
Subjt:  ----------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRL

A0A2N9I2P8 Reverse transcriptase domain-containing protein4.5e-12431.17Show/hide
Query:  HIDATIEEKK--WSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK
        HIDA + +K+   S+R TG Y NP      E+W+L+K LS  +  PW+  G FNEI ++ E+ G   R    I+ FREA+    + D+GF G  +TW NK
Subjt:  HIDATIEEKK--WSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK

Query:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFR--NLTEKTK
              +  RLDR L +    +   S  V+HL +  SDH PLL    + P   +      + RFE +WTK ++C+ ++ + W E  R   R   +TEK K
Subjt:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFR--NLTEKTK

Query:  ECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDG
        +C   L  WS+  +  S+  +I  K ++ Q+  +  +      + +L+ EL  LLE +EI+W+QRS   W+   D NTK+FH   +  R+TN +RGL+D 
Subjt:  ECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDG

Query:  IGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKN------
           W  E  ++A +A  YFQ++F SS P  + I   LE     ++ D N  L A FT EE+F  ++ M+PT+APG DG+ AIF Q YW++V         
Subjt:  IGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKN------

Query:  --------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIA
                L +IN T+IAL+PK+   + + DFRPI+L +++YK+I+K LANRLK++L  I+  SQSAF+P RLITDN ++ FE +H++  KR G+ G +A
Subjt:  --------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIA

Query:  LKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN----------------------------------------------------
        LKLDMS AYDRVEW ++  IM  +GF+  WI LIM C++SVS+ VL+N                                                    
Subjt:  LKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN----------------------------------------------------

Query:  -------------------------------------------------------------------------------GLPSSF---------------
                                                                                       GLPS                 
Subjt:  -------------------------------------------------------------------------------GLPSSF---------------

Query:  -----EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWL
              W  K                                          FWWG      K HW  W K+C  K  G + FRD+ +FN+ALLAKQ W 
Subjt:  -----EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWL

Query:  PIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSK
         +++ +SL++RV + +Y+   SFL+A   H PSY WRS+I  R++                      R +F E +  VI  IP+    + D + W     
Subjt:  PIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK-------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSK

Query:  GVFTVKSAYRL
        G+FTV+SAY +
Subjt:  GVFTVKSAYRL

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.0e-12431.16Show/hide
Query:  HIDATI--EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK
        HIDA I  +EK   +R TG Y NP      E+W+L+K LS  S  PW+  G FNEI ++ E+ G   R +  I+ FREA+  CG+ D+G+ G +YTW  K
Subjt:  HIDATI--EEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNK

Query:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE---HGRRSFRNLTEKT
           + L+  RLDR + +    +      V HLA+  SDH P+L +  + P   +      L RFE +W K ++C++++   W +    G   F  + EK 
Subjt:  HFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQE---HGRRSFRNLTEKT

Query:  KECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFD
        K C T L  WSR  +  S+  +I RK ++ Q+ ++         + +L+ +L  LLE +EI+W+QRS   W+   D NTK+FH + +  R+TN + GL D
Subjt:  KECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFD

Query:  GIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQ--
          G W  E  ++A +A  YFQ +F SS+P  E+I  +L+     ++   N  L A FT++E+   ++ M+PT+APG DG+ AIF Q YWDIV   + Q  
Subjt:  GIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQ--

Query:  ------------INNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVI
                    IN T+IALIPKV +P+++ DFRPISL +++YK+++K LANRLK+VL  +I  +QSAF+P RLITDN ++ FE +H++  KR+GK G +
Subjt:  ------------INNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVI

Query:  ALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN---------------------------------------------------
        ALKLDMS AYDRVEWV++  IM  MGF+  WI L+M C+ SVS+ VL+N                                                   
Subjt:  ALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN---------------------------------------------------

Query:  --------------------------------------------------------------------------------GLPS--------SF------
                                                                                        GLPS        SF      
Subjt:  --------------------------------------------------------------------------------GLPS--------SF------

Query:  ------EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSW
               W  K                                          FWWG      K HW  W KLC  KA G + FRDL  FN ALLAKQ W
Subjt:  ------EWDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSW

Query:  LPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------------------------
          +++ +SLV RV + +Y+  G F+ A  G+ PSY WRSI   R++ R                                                    
Subjt:  LPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------------------------

Query:  ------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ
                +F   +A +I  IP+    + D + WN    G+FTVKSAY L ++
Subjt:  ------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ

A0A6J1DRA0 uncharacterized protein LOC1110224232.6e-12735.76Show/hide
Query:  IEEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLI
        ++E +++WRFTGIY + V+    ETW L+ RL    ++PW+LGG FNEI  + EK  G  R  S ++ F++ +D CG+LD GF G  +TWC+ H     I
Subjt:  IEEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLI

Query:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEW---KEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKECLTRL
        WERLDRFLIN  +     + ++ HL  LASDHRP+LAEW    E    + +    H  RFEE W  + ECK+IV++VW   G         K   CL  L
Subjt:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEW---KEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKECLTRL

Query:  GRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFD
         +W+      S++GAI RKE E Q  +  G     + + + +++LE LLE++E YW+Q                                          
Subjt:  GRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFD

Query:  EDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQINNTYIALI
                            SP N+ +  +L++    ++E+  + L   F  EE+  V+  M P++AP                   ++ Q+N T+I LI
Subjt:  EDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQINNTYIALI

Query:  PKVNDPKSMKDFRPISL----RSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVY
         K  D K MKDF  ISL     +++YK+I+K LANRLK+VLN +I P+QSAF+P RLITDNAI+GFECIH++  + +GK G +ALKLDM  AYDRVEW Y
Subjt:  PKVNDPKSMKDFRPISL----RSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVY

Query:  IR-------------------------GIMEHMGFSSRWIDLIMRCVESVSFQV--LLNGLPSSFEWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEM
        +R                         G +  MG     I  + + + + +     L   + +       RFWWGS S + KIHW+SW+ LC  K  G M
Subjt:  IR-------------------------GIMEHMGFSSRWIDLIMRCVESVSFQV--LLNGLPSSFEWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEM

Query:  DFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------
         FRD+S+FNQA+LAKQSW  +R+P SL+A+ LRG+Y+KTGSFL+A  G  PSY WRSI+WGR+LF+K                                 
Subjt:  DFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK---------------------------------

Query:  ----------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ
                              R+ F+  +A +IL  P+ +  + DEIIW  D  G+F+V+SAY LG+Q
Subjt:  ----------------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQ

A0A7N2LIH6 Uncharacterized protein2.4e-12530.23Show/hide
Query:  DSGLKHSEARHIDATIE--EKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGF
        D   K     HID  +        WR TG Y +P  G  + +W L++ L+ Q EMPW++ G FNEI +  EK G  DR  + +  FRE +  CG++D+GF
Subjt:  DSGLKHSEARHIDATIE--EKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGF

Query:  CGPNYTWCNKHFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSF
         GP +TWCN  F       RLDR + N+         KV H+++ ASDH  LLA +  +  +Q +        FEE+WT+ +ECK+IV+  W  +   S 
Subjt:  CGPNYTWCNKHFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSF

Query:  RNLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTN
          + E+ + C   L +W+++ +    KG   +K +  Q    + +    EE+  L+KE+  L   +E+ WKQRS   WLQ+ D N+K+FH  AS  R+ N
Subjt:  RNLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTN

Query:  RVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVE
        R+ GL D +G W ++     ++   YF+D++ S+ P   + +  LEA    ++ + N  L   F   E++  ++ MHPT+APG DG+  IF QKYWDIV 
Subjt:  RVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVE

Query:  KNL--------------DQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKR
         ++                IN TYI LIPK  +P+ + +FRPISL +++YK+I+K LANRLK+VL+ +I  +QSAF+P R+ITDN I+ FE +H++  +R
Subjt:  KNL--------------DQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKR

Query:  QGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLP-SSFE------------------------------------
        +GK+G++A+KLDMS AYDRVEW Y+  +M+ MGF  RWI LIM CV SVSF VL+NG P  SF                                     
Subjt:  QGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNGLP-SSFE------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------WDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQA
                      W  K                                          FWWG    E K+ W SW+ LC  K  G M F+DL  FN A
Subjt:  --------------WDSK-----------------------------------------RFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQA

Query:  LLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK--------------------------------------------
        LLAKQ W   + P+SL  RVL+ +Y+   SF++A  G  PSYIWRSI+  + + ++                                            
Subjt:  LLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK--------------------------------------------

Query:  -------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYR
                     +  F+  +A  IL IP+ + +  D ++W     G FTVKSAYR
Subjt:  -------------RDIFLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.7e-1622.3Show/hide
Query:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSV---LNHLRRFEEIWTKYDECKDIVKQVWQ--EHGRRSFRNLTEKTKEC--
        + ++D  + ++ + S+C   ++  +    SDH  +  E + +   Q +S    LN+L    + W  ++E K  +K  ++  E+   +++NL +  K    
Subjt:  WERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSV---LNHLRRFEEIWTKYDECKDIVKQVWQ--EHGRRSFRNLTEKTKEC--

Query:  --LTRLGRWSRSHYDRSIKGAIAR-KEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHN-TKWFHMRASTGRKTNRVRGLF
             L  + R      I    ++ KE E Q    S      +E++K+  EL+  +E  +   K      W   R +   +         R+ N++  + 
Subjt:  --LTRLGRWSRSHYDRSIKGAIAR-KEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHN-TKWFHMRASTGRKTNRVRGLF

Query:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEA-TPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKY----------
        +  GD   +  E+      Y++ L+ +   + E ++  L+  T   +++++   L    T  EI  +I S+   ++PG DG  A F Q+Y          
Subjt:  DGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEA-TPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKY----------

Query:  -WDIVEKNLDQINNTY---IALIPKV-NDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKD
         +  +EK     N+ Y   I LIPK   D    ++FRPISL ++  K++ K LANR+++ +  +I   Q  FIP      N       I  + N+ + K+
Subjt:  -WDIVEKNLDQINNTY---IALIPKV-NDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKD

Query:  GVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNG
         VI + +D   A+D+++  ++   +  +G    ++ +I    +  +  ++LNG
Subjt:  GVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNG

P11369 LINE-1 retrotransposable element ORF2 protein1.8e-1627.2Show/hide
Query:  GDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPV-CISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWD----IVEKNLD
        GD   +  E+      +++ L+ +   + + +++ L+   V  +++DQ   L +  + +EI  VI S+   ++PG DG  A F Q + +    I+ K   
Subjt:  GDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPV-CISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWD----IVEKNLD

Query:  QI-------NNTY---IALIPK-VNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVI
        +I       N+ Y   I LIPK   DP  +++FRPISL ++  K++ K LANR++  +  II P Q  FIP      N       IH + NK + K+ +I
Subjt:  QI-------NNTY---IALIPK-VNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVI

Query:  ALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNG
         + LD   A+D+++  ++  ++E  G    ++++I          + +NG
Subjt:  ALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNG

P14381 Transposon TX1 uncharacterized 149 kDa protein4.3e-2321.99Show/hide
Query:  RHIDATIEEKKWSWRFTGIY---ENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDV-GFCGPN---
        R +   + E   ++    +Y     P R    E+ S      D  E   ++GG FN   +  +++    R DS+    RE +    ++DV     P    
Subjt:  RHIDATIEEKKWSWRFTGIY---ENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDV-GFCGPN---

Query:  YTWCNKHFDHDLIWERLDRFLINQDMQSRCGSFKV----------FHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKD--IVKQVW
        +T+      H +   R+DR  I+  + SR  S  +            L +  +   P  A W     + +       +   + W  +   +D       W
Subjt:  YTWCNKHFDHDLIWERLDRFLINQDMQSRCGSFKV----------FHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKD--IVKQVW

Query:  QEHGRRSFRNLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMR
         + G+   + L ++     T+     R+    ++ G +   E+    S    + C   E  + ++ L N+ +        RS    L   D  +++F+  
Subjt:  QEHGRRSFRNLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMR

Query:  ASTGRKTNRVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFN
                ++  LF   G   ++   +   A  ++Q+LF       +A E + +  PV +SE +   L    T +E+   +R M   ++PG DG+   F 
Subjt:  ASTGRKTNRVRGLFDGIGDWFDEDAEMARVANLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFN

Query:  QKYWDIVEKNLDQI--------------NNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFEC
        Q +WD +  +  ++                  ++L+PK  D + +K++RP+SL S  YK++AK ++ RLK VL  +I P QS  +P R I DN  L  + 
Subjt:  QKYWDIVEKNLDQI--------------NNTYIALIPKVNDPKSMKDFRPISLRSMLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFEC

Query:  IHAVKNKRQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN
        +H    +R G   +  L LD   A+DRV+  Y+ G ++   F  +++  +     S    V +N
Subjt:  IHAVKNKRQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLN

P93295 Uncharacterized mitochondrial protein AtMg003108.2e-2252.58Show/hide
Query:  FWWGSVSSENKIHWKSWQKLCTHKA-HGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK
        FWW S  ++ KI W +WQKLC  K   G + FRDL  FNQALLAKQS+  I  PH+L++R+LR RY+   S ++   G  PSY WRSII GREL  +
Subjt:  FWWGSVSSENKIHWKSWQKLCTHKA-HGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.8e-1723.45Show/hide
Query:  NKEEVQRSPNDTQFEKTQGNPQD--SGLKHSEARHIDATIEEKKWSWRFTGIYENPVRGLHHETW--SLMKRLSDQSEMPWVLGGGFNEI---TNHFEKS
        +K  + RS       K  G+P+   SG   SE+ +  A ++    SWR    Y     G     W  S+   +  +++   +L G F++I   ++H+   
Subjt:  NKEEVQRSPNDTQFEKTQGNPQD--SGLKHSEARHIDATIEEKKWSWRFTGIYENPVRGLHHETW--SLMKRLSDQSEMPWVLGGGFNEI---TNHFEKS

Query:  GGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLIWERLDRFLINQD-MQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLR
          S  +   ++ F+  +    ++D+   G +YTW N H D + I  +LDR + N D   S   +  VF L+ + SDH P +   +  P    +       
Subjt:  GGSDRLDSTIKGFREAMDSCGVLDVGFCGPNYTWCNKHFDHDLIWERLDRFLINQD-MQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLR

Query:  RFEEIWTKYDECKDIVKQVWQEH---GRRSFR--NLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-DEEMSKLE----KELENL
        R+    + +      +   W+E    G   F      +  K+C   L R    +     K A+     ++  S+ S +L N  + + ++E    K+    
Subjt:  RFEEIWTKYDECKDIVKQVWQEH---GRRSFR--NLTEKTKECLTRLGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCN-DEEMSKLE----KELENL

Query:  LEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFDEDAEMARVANL------YFQDLFQSSSP--HNEAIERILEATPVCISE
            E +++Q+S   WLQ  D NT++FH      +  N ++ L        D+D  +  V  +      Y+  L  S S     ++++RI +  P   ++
Subjt:  LEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFDEDAEMARVANL------YFQDLFQSSSP--HNEAIERILEATPVCISE

Query:  DQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKN--------------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVI
             L+A  + +EI   + +M   +APG D   A F  + W +V+ +              L + N T I LIPKV     +  FRP+S  +++YK+I
Subjt:  DQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKN--------------LDQINNTYIALIPKVNDPKSMKDFRPISLRSMLYKVI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.5e-1238.64Show/hide
Query:  LANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMR
        +  RLK ++ N+I P+Q++FIP R+ TDN +   E +H+++ K+ G  G + LKLD+  AYDR+ W Y+   +   GF   W+  I R
Subjt:  LANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMR

AT4G29090.1 Ribonuclease H-like superfamily protein7.3e-1839.58Show/hide
Query:  FWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK
        FWW +      +HWK+W  L  +KA G + F+D+  FN ALL KQ W  +  P SL+A+V + RY+     L A  G  PS++W+SI   +E+ R+
Subjt:  FWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.8e-2352.58Show/hide
Query:  FWWGSVSSENKIHWKSWQKLCTHKA-HGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK
        FWW S  ++ KI W +WQKLC  K   G + FRDL  FNQALLAKQS+  I  PH+L++R+LR RY+   S ++   G  PSY WRSII GREL  +
Subjt:  FWWGSVSSENKIHWKSWQKLCTHKA-HGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACCCAATTGTGCTGAAGCGGGCTTGAATTATGACACCAAAAGCCAAAGATTGAAGCCCCAAACCAAGAAGAAGCGAGACCAAGCCGAGAAAAGGAGTAGGAA
GTCGAGTCAAGAGGTTGGCGTCTCGACGCCGATCTTGGCGTTGATGTGTAGAAATTGGTTGAGGAGCTGCGAAATAGGTGCTGGAAGGTTGTCCAGTCAGTTTCGGGGTC
AAAACAACCGGTTAGGTGCGAAGAAGAATCGTCAACAACGACCCGCGTGCGACTCGGAACGGCCTCTGTTTTTCCAGAAATTCTGTCGCATTTCATTTTTGCCTTCCCAG
AAATGCGGTCGCATTTTTGGGGAATTCCAGCTTCTCCTCTGGGATATTCTTGAAGTCTGGGGTAAGGATGAAGACAGATGGATTTCAATTTCGTATGAAAAGCTTCCAAA
TTTTTGCTATGGATGTGGTCGTCTAGGTCATATCATAAATGATTGTGAGGAGCAATGTGGAGCTACTGAGGAAGAGCTCCCTTATGGACCTTGGTTACGTGAACCAGTTA
AATTAAAAATCAGTGATGGCATTCACTCCCCCAGGCCTACTATGTGTGGTAATCCTTCCCTGGAATCATCTAATGCTCCAGCCATCCCCAAACCGTCGATGTCCACTCAA
CCGCCGGGGTCAGTGAGGTCCGTTGCGGCGGAGAACACAGAGATGGAAAGAGAGTCAGTTTCCAAAATTAATGGGAGCGGTTATGGTAAAGAATCTGGAATGGAAACTGG
CGGTGATTTAGCCCTGATACAGGAAGGCGATGATTTTGTGGTTAATTGTCTCAACGGTCATATTCTAGATAACATCTTAATGGAGGTGGACCAGGCTTTTGTTGGGTTTT
ATGCCCTAAAACTCGTAGATAGTGGATCCTCTCCCATTCTTCTTCTTCGTTGGTACCCGATCGACAGCTTCCTCGTGCCCCATCTCCGGCGACGTTCTTTAGCACGCTCC
GACCAACAACAGGCCTCACGACGAGCTCCACGGCTGCATGCATTCCCTCCGACGAGTCTCGGCGGTGCAGCGGCATTCGCAGTTGGTGTTTTCGTCCGAGTTCTAGTGCA
GCAGCAACGTGCTCGTCGTGGCAGCAGCTTCGTCTTCGCTGTGAGTGTTGGCAGCAGCTACTCCGGCGAACTCACGACGGCGCAGCTTCAGTTTAGCGACGGTGGCGTCT
TCCTCGGCGAGTCAGTGAGCTTCAACGTGCGGCTGCAACAAGCGGGGTCTGCGACTTCCATCGATTTTTTTTCTCCTCAGTTTAGCAACTCGTGGCGTCGTTCAACGACG
ATTTTGGTGAGTTCTCGAATCAGTAGCAGAGTGTTTCCAGCACCTTCTCCGGCGAGCGGCGGCGCAAATAATAGTGTTTTGGGTCGTTTCTCGGCGTTTTTGACGTGTTT
GAGTTGTGGGTCTTTTAGTTCCTTGATTTCCAACAATTTTAGCAATTGGGTAAGTGATTTTCAGCAAGGTCTTGAAGCTAGGGAATTATTGAGATTTTGTAAGATCATTA
AGCCGAGAGATGCTTACGCAAGGCTTAGGGACATGAAATTGATGTACCTAAGTCGAGCTTTTGATGGGACAAATAAAGAGGAAGTGCAACGAAGCCCAAATGACACACAG
TTTGAGAAAACACAAGGCAATCCTCAAGACAGTGGGCTGAAACACAGTGAGGCTAGGCACATTGATGCTACTATCGAGGAGAAAAAGTGGTCTTGGAGATTCACGGGAAT
CTACGAGAACCCTGTCAGAGGATTACACCATGAGACCTGGTCCTTGATGAAAAGATTGAGTGACCAATCCGAGATGCCATGGGTCCTAGGAGGGGGCTTTAATGAAATTA
CAAATCATTTTGAGAAATCGGGTGGGTCGGACAGGCTGGACTCAACTATTAAGGGGTTTCGGGAGGCAATGGATAGTTGTGGAGTTCTTGATGTGGGCTTTTGTGGTCCA
AATTACACTTGGTGTAATAAACATTTTGATCATGATTTGATTTGGGAAAGGTTAGATAGGTTTTTGATTAATCAAGATATGCAGAGTAGATGTGGTAGTTTCAAGGTTTT
TCACTTGGCTCTTCTTGCATCAGATCATAGACCTCTTCTGGCTGAGTGGAAAGAAGAGCCTCCAGACCAAATGCAGTCAGTGTTGAATCATCTTAGGAGATTTGAAGAGA
TCTGGACAAAGTATGATGAGTGCAAAGATATAGTGAAGCAGGTTTGGCAGGAGCATGGTCGGCGGAGTTTTAGGAATTTGACTGAGAAAACCAAGGAGTGCCTTACTCGT
TTAGGTCGGTGGAGCAGATCTCATTATGACAGGTCCATTAAGGGAGCTATTGCAAGGAAAGAAAAAGAAAATCAAAATTCCCTAAGCAGTGGGGTGTTGTGCAATGATGA
GGAAATGAGCAAACTTGAAAAAGAACTTGAAAACCTTCTGGAGGATGATGAGATCTACTGGAAGCAGCGTTCTCATGAAGACTGGCTTCAATGGAGAGATCACAACACGA
AATGGTTCCATATGAGAGCTAGCACTGGAAGGAAGACTAATAGAGTTCGAGGTCTGTTCGATGGGATTGGTGATTGGTTTGATGAGGACGCTGAGATGGCTAGGGTGGCT
AATCTGTACTTCCAGGACCTTTTCCAGTCATCTAGCCCTCATAATGAAGCTATTGAGAGGATTTTGGAGGCTACGCCTGTTTGCATCTCTGAAGACCAAAATCACATGCT
CACAGCAACATTCACAAGGGAGGAGATTTTTTGTGTCATTAGGAGTATGCATCCTACTAGAGCTCCTGGTTCTGATGGAATTCAAGCAATCTTCAACCAAAAATACTGGG
ACATAGTTGAGAAGAATCTGGATCAGATCAACAACACTTATATTGCACTGATCCCAAAGGTTAATGACCCGAAATCCATGAAGGATTTTAGGCCAATTAGTCTACGTTCA
ATGCTCTATAAAGTCATTGCTAAAACTCTTGCTAACAGGTTAAAGAGAGTTTTGAACAACATCATTTTTCCTAGCCAATCTGCATTTATTCCTAGGAGACTCATCACTGA
TAATGCCATCCTTGGGTTCGAATGCATTCACGCTGTAAAAAACAAAAGGCAGGGGAAGGATGGTGTGATAGCTCTTAAGTTAGACATGAGTATAGCTTATGACCGTGTGG
AATGGGTGTATATTAGGGGAATCATGGAGCATATGGGTTTCAGTAGCCGTTGGATTGATCTTATTATGCGATGCGTGGAATCGGTAAGCTTCCAAGTTCTTTTGAATGGG
CTTCCAAGCTCTTTTGAATGGGATTCGAAGAGGTTTTGGTGGGGCTCGGTGTCTTCTGAAAACAAGATCCATTGGAAAAGTTGGCAAAAATTGTGCACTCACAAAGCTCA
TGGAGAGATGGATTTTAGAGATCTCAGTGTTTTTAACCAAGCATTATTAGCAAAACAGAGCTGGCTGCCTATTCGATATCCTCATAGCCTCGTCGCTCGAGTGCTAAGGG
GCCGCTATTATAAGACGGGATCTTTTCTCAAAGCTGGCTTTGGTCATAACCCTTCATATATCTGGCGGAGCATTATTTGGGGGAGGGAGCTTTTCAGGAAAAGAGACATC
TTTCTGGAGGAGGATGCAATTGTTATCTTACATATCCCAATTAAAACTCCTCACCGAATGGACGAGATTATTTGGAATGGGGATTCCAAGGGAGTATTCACGGTTAAAAG
TGCATATCGGTTGGGGATGCAAAAGTGGGATGGACAACTCCTGACTATTGCGAAGGAGGAAGGGGAAGAGACTAACCTGAGCATCGCGCCAAGTCTGATTCAAGAGAATA
GGGAAGCTGCCTCTACTGGAATCTCGGCGAATGTCTGGCTACCTCCACCGTCGGATAAGCTGGTTGGAAGCCCTAGCGGTGGTCGAAGATCTGAGGATGATTTCGCGCCG
CCAAGGTCTTCATCGAGCTTGACTCCATCCAGGCCACGTATTTCGGCTTTGACCGTCCGATCTGATTCCTATCTTGCTCCGTCTACCCTTATACGTTATACCTCATCCCC
AAGGCCGCGTTTCGATTTCTCTATTCTCTCAGCCCAGCCGCCGAACATCCCCTCTGCATTTGGCGTTTTCATCTCAGTCATCTCCACCTCTCTCTCGTTTTCGCGCGTAG
CAGACCCACACCCATCGACTTCTTCGTCTTCTTTCGTCGGCTCTCTTGCTCCCCTCTCGTCCCAGTCGCGCGACGCCGCCCCGCGAACAAGCCCCGACTGTAGGACCTTC
GGCGCGATTTCGGCAAGCATCGGGGTAGAAAACCCACGCAATTTCCTTCCTTGTTCGTTTGCAGATCGGCCCACGCGAGCTCGACAAGGAATCACATCGAGTCGCATCTC
TCCAGCCTTCATTGCGCTCGATTTGGGTTTCCAGCCAGCTGCAAGTTCGTTTTCTTCTTCTTTTCGGCTCTTTCGACCACTAAACAGCAGGGATCCGAGTCTGTTCAACG
AGTTTCGGTTCACGACGATACCAGGGGATTCGAATAAGTTCAAGGGTGGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGACCCAATTGTGCTGAAGCGGGCTTGAATTATGACACCAAAAGCCAAAGATTGAAGCCCCAAACCAAGAAGAAGCGAGACCAAGCCGAGAAAAGGAGTAGGAA
GTCGAGTCAAGAGGTTGGCGTCTCGACGCCGATCTTGGCGTTGATGTGTAGAAATTGGTTGAGGAGCTGCGAAATAGGTGCTGGAAGGTTGTCCAGTCAGTTTCGGGGTC
AAAACAACCGGTTAGGTGCGAAGAAGAATCGTCAACAACGACCCGCGTGCGACTCGGAACGGCCTCTGTTTTTCCAGAAATTCTGTCGCATTTCATTTTTGCCTTCCCAG
AAATGCGGTCGCATTTTTGGGGAATTCCAGCTTCTCCTCTGGGATATTCTTGAAGTCTGGGGTAAGGATGAAGACAGATGGATTTCAATTTCGTATGAAAAGCTTCCAAA
TTTTTGCTATGGATGTGGTCGTCTAGGTCATATCATAAATGATTGTGAGGAGCAATGTGGAGCTACTGAGGAAGAGCTCCCTTATGGACCTTGGTTACGTGAACCAGTTA
AATTAAAAATCAGTGATGGCATTCACTCCCCCAGGCCTACTATGTGTGGTAATCCTTCCCTGGAATCATCTAATGCTCCAGCCATCCCCAAACCGTCGATGTCCACTCAA
CCGCCGGGGTCAGTGAGGTCCGTTGCGGCGGAGAACACAGAGATGGAAAGAGAGTCAGTTTCCAAAATTAATGGGAGCGGTTATGGTAAAGAATCTGGAATGGAAACTGG
CGGTGATTTAGCCCTGATACAGGAAGGCGATGATTTTGTGGTTAATTGTCTCAACGGTCATATTCTAGATAACATCTTAATGGAGGTGGACCAGGCTTTTGTTGGGTTTT
ATGCCCTAAAACTCGTAGATAGTGGATCCTCTCCCATTCTTCTTCTTCGTTGGTACCCGATCGACAGCTTCCTCGTGCCCCATCTCCGGCGACGTTCTTTAGCACGCTCC
GACCAACAACAGGCCTCACGACGAGCTCCACGGCTGCATGCATTCCCTCCGACGAGTCTCGGCGGTGCAGCGGCATTCGCAGTTGGTGTTTTCGTCCGAGTTCTAGTGCA
GCAGCAACGTGCTCGTCGTGGCAGCAGCTTCGTCTTCGCTGTGAGTGTTGGCAGCAGCTACTCCGGCGAACTCACGACGGCGCAGCTTCAGTTTAGCGACGGTGGCGTCT
TCCTCGGCGAGTCAGTGAGCTTCAACGTGCGGCTGCAACAAGCGGGGTCTGCGACTTCCATCGATTTTTTTTCTCCTCAGTTTAGCAACTCGTGGCGTCGTTCAACGACG
ATTTTGGTGAGTTCTCGAATCAGTAGCAGAGTGTTTCCAGCACCTTCTCCGGCGAGCGGCGGCGCAAATAATAGTGTTTTGGGTCGTTTCTCGGCGTTTTTGACGTGTTT
GAGTTGTGGGTCTTTTAGTTCCTTGATTTCCAACAATTTTAGCAATTGGGTAAGTGATTTTCAGCAAGGTCTTGAAGCTAGGGAATTATTGAGATTTTGTAAGATCATTA
AGCCGAGAGATGCTTACGCAAGGCTTAGGGACATGAAATTGATGTACCTAAGTCGAGCTTTTGATGGGACAAATAAAGAGGAAGTGCAACGAAGCCCAAATGACACACAG
TTTGAGAAAACACAAGGCAATCCTCAAGACAGTGGGCTGAAACACAGTGAGGCTAGGCACATTGATGCTACTATCGAGGAGAAAAAGTGGTCTTGGAGATTCACGGGAAT
CTACGAGAACCCTGTCAGAGGATTACACCATGAGACCTGGTCCTTGATGAAAAGATTGAGTGACCAATCCGAGATGCCATGGGTCCTAGGAGGGGGCTTTAATGAAATTA
CAAATCATTTTGAGAAATCGGGTGGGTCGGACAGGCTGGACTCAACTATTAAGGGGTTTCGGGAGGCAATGGATAGTTGTGGAGTTCTTGATGTGGGCTTTTGTGGTCCA
AATTACACTTGGTGTAATAAACATTTTGATCATGATTTGATTTGGGAAAGGTTAGATAGGTTTTTGATTAATCAAGATATGCAGAGTAGATGTGGTAGTTTCAAGGTTTT
TCACTTGGCTCTTCTTGCATCAGATCATAGACCTCTTCTGGCTGAGTGGAAAGAAGAGCCTCCAGACCAAATGCAGTCAGTGTTGAATCATCTTAGGAGATTTGAAGAGA
TCTGGACAAAGTATGATGAGTGCAAAGATATAGTGAAGCAGGTTTGGCAGGAGCATGGTCGGCGGAGTTTTAGGAATTTGACTGAGAAAACCAAGGAGTGCCTTACTCGT
TTAGGTCGGTGGAGCAGATCTCATTATGACAGGTCCATTAAGGGAGCTATTGCAAGGAAAGAAAAAGAAAATCAAAATTCCCTAAGCAGTGGGGTGTTGTGCAATGATGA
GGAAATGAGCAAACTTGAAAAAGAACTTGAAAACCTTCTGGAGGATGATGAGATCTACTGGAAGCAGCGTTCTCATGAAGACTGGCTTCAATGGAGAGATCACAACACGA
AATGGTTCCATATGAGAGCTAGCACTGGAAGGAAGACTAATAGAGTTCGAGGTCTGTTCGATGGGATTGGTGATTGGTTTGATGAGGACGCTGAGATGGCTAGGGTGGCT
AATCTGTACTTCCAGGACCTTTTCCAGTCATCTAGCCCTCATAATGAAGCTATTGAGAGGATTTTGGAGGCTACGCCTGTTTGCATCTCTGAAGACCAAAATCACATGCT
CACAGCAACATTCACAAGGGAGGAGATTTTTTGTGTCATTAGGAGTATGCATCCTACTAGAGCTCCTGGTTCTGATGGAATTCAAGCAATCTTCAACCAAAAATACTGGG
ACATAGTTGAGAAGAATCTGGATCAGATCAACAACACTTATATTGCACTGATCCCAAAGGTTAATGACCCGAAATCCATGAAGGATTTTAGGCCAATTAGTCTACGTTCA
ATGCTCTATAAAGTCATTGCTAAAACTCTTGCTAACAGGTTAAAGAGAGTTTTGAACAACATCATTTTTCCTAGCCAATCTGCATTTATTCCTAGGAGACTCATCACTGA
TAATGCCATCCTTGGGTTCGAATGCATTCACGCTGTAAAAAACAAAAGGCAGGGGAAGGATGGTGTGATAGCTCTTAAGTTAGACATGAGTATAGCTTATGACCGTGTGG
AATGGGTGTATATTAGGGGAATCATGGAGCATATGGGTTTCAGTAGCCGTTGGATTGATCTTATTATGCGATGCGTGGAATCGGTAAGCTTCCAAGTTCTTTTGAATGGG
CTTCCAAGCTCTTTTGAATGGGATTCGAAGAGGTTTTGGTGGGGCTCGGTGTCTTCTGAAAACAAGATCCATTGGAAAAGTTGGCAAAAATTGTGCACTCACAAAGCTCA
TGGAGAGATGGATTTTAGAGATCTCAGTGTTTTTAACCAAGCATTATTAGCAAAACAGAGCTGGCTGCCTATTCGATATCCTCATAGCCTCGTCGCTCGAGTGCTAAGGG
GCCGCTATTATAAGACGGGATCTTTTCTCAAAGCTGGCTTTGGTCATAACCCTTCATATATCTGGCGGAGCATTATTTGGGGGAGGGAGCTTTTCAGGAAAAGAGACATC
TTTCTGGAGGAGGATGCAATTGTTATCTTACATATCCCAATTAAAACTCCTCACCGAATGGACGAGATTATTTGGAATGGGGATTCCAAGGGAGTATTCACGGTTAAAAG
TGCATATCGGTTGGGGATGCAAAAGTGGGATGGACAACTCCTGACTATTGCGAAGGAGGAAGGGGAAGAGACTAACCTGAGCATCGCGCCAAGTCTGATTCAAGAGAATA
GGGAAGCTGCCTCTACTGGAATCTCGGCGAATGTCTGGCTACCTCCACCGTCGGATAAGCTGGTTGGAAGCCCTAGCGGTGGTCGAAGATCTGAGGATGATTTCGCGCCG
CCAAGGTCTTCATCGAGCTTGACTCCATCCAGGCCACGTATTTCGGCTTTGACCGTCCGATCTGATTCCTATCTTGCTCCGTCTACCCTTATACGTTATACCTCATCCCC
AAGGCCGCGTTTCGATTTCTCTATTCTCTCAGCCCAGCCGCCGAACATCCCCTCTGCATTTGGCGTTTTCATCTCAGTCATCTCCACCTCTCTCTCGTTTTCGCGCGTAG
CAGACCCACACCCATCGACTTCTTCGTCTTCTTTCGTCGGCTCTCTTGCTCCCCTCTCGTCCCAGTCGCGCGACGCCGCCCCGCGAACAAGCCCCGACTGTAGGACCTTC
GGCGCGATTTCGGCAAGCATCGGGGTAGAAAACCCACGCAATTTCCTTCCTTGTTCGTTTGCAGATCGGCCCACGCGAGCTCGACAAGGAATCACATCGAGTCGCATCTC
TCCAGCCTTCATTGCGCTCGATTTGGGTTTCCAGCCAGCTGCAAGTTCGTTTTCTTCTTCTTTTCGGCTCTTTCGACCACTAAACAGCAGGGATCCGAGTCTGTTCAACG
AGTTTCGGTTCACGACGATACCAGGGGATTCGAATAAGTTCAAGGGTGGTTCTTGA
Protein sequenceShow/hide protein sequence
MKRPNCAEAGLNYDTKSQRLKPQTKKKRDQAEKRSRKSSQEVGVSTPILALMCRNWLRSCEIGAGRLSSQFRGQNNRLGAKKNRQQRPACDSERPLFFQKFCRISFLPSQ
KCGRIFGEFQLLLWDILEVWGKDEDRWISISYEKLPNFCYGCGRLGHIINDCEEQCGATEEELPYGPWLREPVKLKISDGIHSPRPTMCGNPSLESSNAPAIPKPSMSTQ
PPGSVRSVAAENTEMERESVSKINGSGYGKESGMETGGDLALIQEGDDFVVNCLNGHILDNILMEVDQAFVGFYALKLVDSGSSPILLLRWYPIDSFLVPHLRRRSLARS
DQQQASRRAPRLHAFPPTSLGGAAAFAVGVFVRVLVQQQRARRGSSFVFAVSVGSSYSGELTTAQLQFSDGGVFLGESVSFNVRLQQAGSATSIDFFSPQFSNSWRRSTT
ILVSSRISSRVFPAPSPASGGANNSVLGRFSAFLTCLSCGSFSSLISNNFSNWVSDFQQGLEARELLRFCKIIKPRDAYARLRDMKLMYLSRAFDGTNKEEVQRSPNDTQ
FEKTQGNPQDSGLKHSEARHIDATIEEKKWSWRFTGIYENPVRGLHHETWSLMKRLSDQSEMPWVLGGGFNEITNHFEKSGGSDRLDSTIKGFREAMDSCGVLDVGFCGP
NYTWCNKHFDHDLIWERLDRFLINQDMQSRCGSFKVFHLALLASDHRPLLAEWKEEPPDQMQSVLNHLRRFEEIWTKYDECKDIVKQVWQEHGRRSFRNLTEKTKECLTR
LGRWSRSHYDRSIKGAIARKEKENQNSLSSGVLCNDEEMSKLEKELENLLEDDEIYWKQRSHEDWLQWRDHNTKWFHMRASTGRKTNRVRGLFDGIGDWFDEDAEMARVA
NLYFQDLFQSSSPHNEAIERILEATPVCISEDQNHMLTATFTREEIFCVIRSMHPTRAPGSDGIQAIFNQKYWDIVEKNLDQINNTYIALIPKVNDPKSMKDFRPISLRS
MLYKVIAKTLANRLKRVLNNIIFPSQSAFIPRRLITDNAILGFECIHAVKNKRQGKDGVIALKLDMSIAYDRVEWVYIRGIMEHMGFSSRWIDLIMRCVESVSFQVLLNG
LPSSFEWDSKRFWWGSVSSENKIHWKSWQKLCTHKAHGEMDFRDLSVFNQALLAKQSWLPIRYPHSLVARVLRGRYYKTGSFLKAGFGHNPSYIWRSIIWGRELFRKRDI
FLEEDAIVILHIPIKTPHRMDEIIWNGDSKGVFTVKSAYRLGMQKWDGQLLTIAKEEGEETNLSIAPSLIQENREAASTGISANVWLPPPSDKLVGSPSGGRRSEDDFAP
PRSSSSLTPSRPRISALTVRSDSYLAPSTLIRYTSSPRPRFDFSILSAQPPNIPSAFGVFISVISTSLSFSRVADPHPSTSSSSFVGSLAPLSSQSRDAAPRTSPDCRTF
GAISASIGVENPRNFLPCSFADRPTRARQGITSSRISPAFIALDLGFQPAASSFSSSFRLFRPLNSRDPSLFNEFRFTTIPGDSNKFKGGS