; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031132 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031132
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:5143203..5153940
RNA-Seq ExpressionLag0031132
SyntenyLag0031132
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]3.7e-17736.27Show/hide
Query:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH
        +W LL  L      PWL  GDFN ILS  EK GG  +++ ++ G                G  +TW N + GE  +  R+DR L+   W   F   ++ H
Subjt:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH

Query:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQQAMG
        L  S  DH  LL  ++D++   R   +R   FE  W +      ++ A W   V + +P  ++     C   LS W     G + ++IQ   + +     
Subjt:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQQAMG

Query:  RLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-SNGVWQQEPDR-----------------VLGLIE-----------------GYFESIFSTSAPSEGEI
        R    D   ++ R   ++ +LL +EE YW Q +   W +E DR                 ++G+ +                  YF +I+S+S PS  +I
Subjt:  RLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-SNGVWQQEPDR-----------------VLGLIE-----------------GYFESIFSTSAPSEGEI

Query:  DQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYM
        ++VT  +   V++EMN SL+R F +EEV  AL QIHPNKAPGPDG+S  F+Q  WSIVG +V +  LN+LN+   +  LN+T I LIPK  NPK +T++ 
Subjt:  DQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYM

Query:  PISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVEL
        PISLCNV YK++SK+L NR+K +L  +IS NQSAF   R + DN ++ +E +H L  +  G+ G++++KLDMSKA+DRVEW F+ ++M +MGF   W +L
Subjt:  PISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVEL

Query:  VLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQH
        V++CI+SV YS  +NGV  G++ PSRGLRQGDPLSP LFLLCAEGLS++++ A   + ITG+ I RGCP ++HLFFADDS+LF +A   E   + SIL  
Subjt:  VLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQH

Query:  YERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE---------------
        YE ASG+ IN DKS I FSPNTA   + ++        ++ H +YLGLPS + R++      +KE+V  ++ GWKGKL S+GG+E               
Subjt:  YERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE---------------

Query:  ----------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEA-------
                               +  ++ ++ W+SWK +C  K   G+GFR+L+ FN A+LAKQ WRI+ +P S + RVLK RYFP GD L A       
Subjt:  ----------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEA-------

Query:  -----------------------------------------------------------------GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIW
                                                                          W  E LR  F P EV +IL IPL     EDK+IW
Subjt:  -----------------------------------------------------------------GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIW

Query:  HFEKCGIYTVKSGYRLGQVAL-LAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCK
           K G ++VKS Y +    +   +    S+ +     WK  W + +P KIK+F WR C++ LPT DN+S RGI   + C  CG   E   H    C+
Subjt:  HFEKCGIYTVKSGYRLGQVAL-LAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCK

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.4e-17633.02Show/hide
Query:  NGRPGEEVVWERIDRCLSNVAWQELFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATS
        N R  E  V+ R+DR L+   W + +   ++ HL  S SDH  LL+  +D+  V + A +R  +FE  W +     +++   W S   V SP  +A+   
Subjt:  NGRPGEEVVWERIDRCLSNVAWQELFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATS

Query:  RCMSHLSSWGRRKNGNLGQRIQ--AATANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ----------------------------------
         C  +LS W +   GN+ ++IQ    T N      R GSL    ++ R E  +  LL  EE+ W+Q                                  
Subjt:  RCMSHLSSWGRRKNGNLGQRIQ--AATANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ----------------------------------

Query:  -SNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNC
          NG WQ   + +  +   YF++I+S+S P+   I +V   +  +V++EMN SL++ F REE+  AL+Q+HP KAPGPDG+S  F+Q  W+IVG D+V  
Subjt:  -SNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNC

Query:  CLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGW
         L++LN+  S+  +N+T I L+PK KNP  ++++ PISLCNV YK++SKVL NR+K IL ++IS NQSAF+ GR + DN ++ +E +H L+ +K G+ G+
Subjt:  CLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGW

Query:  VSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIA
         ++KLDMSKAYDRVEW F++++M +MGF  +W++LV+ CI+SV YS  VNG   G +TP+RGLRQGDP+SPY+FLLCA+G SS+L+D   +  I+G+ I 
Subjt:  VSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIA

Query:  RGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKE
        RGCP I+HLFFADDSLLF +A   E Q +  ILQ YE ASG+ IN DKS + FS NT    + +V +         H++YLGLPS + ++++     +KE
Subjt:  RGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKE

Query:  RVWRQIQGWKGKLFSVGGRE-------------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQC
        RV R++ GWK KL SVGGRE                                      +  ++ +I WVSWK LCK K   GMGFR+L+ FN A+LAKQ 
Subjt:  RVWRQIQGWKGKLFSVGGRE-------------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQC

Query:  WRIVQHPTSFLSRVLKERYFPHGDFLEA------------------------------------------------------------------------
        WR++ +P S ++++ K RY+PHGD  +A                                                                        
Subjt:  WRIVQHPTSFLSRVLKERYFPHGDFLEA------------------------------------------------------------------------

Query:  GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLG-QVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPT
         W ++++R  F P E R+IL+IPL     ED+IIW   + G ++VKS Y +   V    +V  +SS ++ S  W+  W + IP K+++F W++C+  LPT
Subjt:  GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLG-QVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPT

Query:  VDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFR-------------------------------
          NL  +G+++ +VC  CG   ES +H+F +C+  +       W   L N     ++N+  D+ D   +                               
Subjt:  VDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFR-------------------------------

Query:  -GHVPSAGLVDWATNYLAVFRGASRACCE-----------DTRGV-------------PRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGL
           VP   +  +A  Y+  F+ AS   C+              GV               +  G+++RD  G V  +         + + VE LA+  GL
Subjt:  -GHVPSAGLVDWATNYLAVFRGASRACCE-----------DTRGV-------------PRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGL

Query:  RLVVEMGL-----APDA----AMVDLSEFGVLVSEARRGVPAHFQ----LRVSVTRREGNRVAHELGRLALRERD
         L  E  L       DA    + V+ +E    +    +G+ +        +++  +RE N+ AHEL + A  + D
Subjt:  RLVVEMGL-----APDA----AMVDLSEFGVLVSEARRGVPAHFQ----LRVSVTRREGNRVAHELGRLALRERD

XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]1.4e-17331.63Show/hide
Query:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE
        K  TW+L++ L   +  PWL  GDFN IL   EK G   + E+ +                  G  FTW   R G  +V ER+DR +++ AW  LFP  +
Subjt:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE

Query:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEV-PVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQA-----A
        + HL+   SDH+ +++++      +     R  +FE+ WL+  G  E + + W S   P   P+ +A    +C   L+ W +   G + + I++     +
Subjt:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEV-PVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQA-----A

Query:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-----------------------------------SNGVWQQEPDRVLGLIEGYFESIFS
         A    AMG    L    ++ + + +L  LL +E + W+Q                                   S   W  +  +V+ +   YF S+F+
Subjt:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-----------------------------------SNGVWQQEPDRVLGLIEGYFESIFS

Query:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK
        TS PS  E+  V   V+PSV+ EMNA L+ PF +EEV  AL+Q+    APGPDG+   FY   W+++G +V +  L+ LNN    + +N T I LIPK K
Subjt:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK

Query:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM
        +P+ +++Y PISLCNV YK+VSKVL NR K +L  +IS NQSAF  GR + DN ++ YE +H +K  + G+SG+++LKLDMSKAYDRVEWVF+E MM ++
Subjt:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM

Query:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA
        GF   W+ L+L CIS+V YS  +NGV    + PSRGLRQGDPLSPYLFL+C+EGL  ++  A   R I G+ I +  P ++HLFFADDSL+F RA   E 
Subjt:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA

Query:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------
        + + ++L  YE+ASG+ +N +K+ + FS +T    Q+Q+     V V   + +YLGLPSF+ +N+ ++L FIKERV  ++QGWK +L S  GRE      
Subjt:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------

Query:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL
                                        +    R+IHW  W +LC PK   GMGF++L+ FN A+LAKQ WR++++  S   +  K ++FP+G  L
Subjt:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL

Query:  EAG-----------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQV
        +A                                                                        WNE ++ + F P +   I  IPL   
Subjt:  EAG-----------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQV

Query:  SAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHV
        + +D + W     GI++VKSGY+L   + L    + S        WKG W + IP+++K  +WR  L+ LPT  NL  R +   + C HC  + ES +H 
Subjt:  SAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHV

Query:  FWQC-------KFFRDALMGSEW------EVLLQNVQANSMLNLLRDVKD---------KKFRGHVPSAGLVDWATNYLAVFRGASRACCEDTRGVP---
         W C       K   + L+   W      +V    ++++ +L+L   +           +      P   +   A   L  FR AS      T  V    
Subjt:  FWQC-------KFFRDALMGSEW------EVLLQNVQANSMLNLLRDVKD---------KKFRGHVPSAGLVDWATNYLAVFRGASRACCEDTRGVP---

Query:  ----------------------RTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAPDAAMVD----------------LSEFG
                                G G V+R+E G VM +   +     + ++VE LA    + L  E+    D  +V+                 S FG
Subjt:  ----------------------RTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAPDAAMVD----------------LSEFG

Query:  VLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA
         ++ +      A   +  S TRR GN++AH L R A
Subjt:  VLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.2e-17532.45Show/hide
Query:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE
        K  +W LLKHL+  +  PW+V GDFNA L   EK        +++                  G  +TW N RPGE     R+DR ++N  W + F +  
Subjt:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE

Query:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPI-ELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQ
        + HL    SDH PLLL +   ++  +  G R  +FEE+WL       V+   W +    R  +  +      C   L +WG     ++      A   +Q
Subjt:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPI-ELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQ

Query:  QAMGRLGSLD----SRADLQRAEGQLESLLVEEEVYW-----------------------------------RQSNGVWQQEPDRVLGLIEGYFESIFST
        + + RL   +    S+A+      +++ LL ++E+YW                                   R S G W +  + V  +   YF+++F  
Subjt:  QAMGRLGSLD----SRADLQRAEGQLESLLVEEEVYW-----------------------------------RQSNGVWQQEPDRVLGLIEGYFESIFST

Query:  SAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKN
         A    ++++    V   V+++M   L   F  EEV  AL Q+ P KAPGPDG++  FYQ  W IVG  VV+  L+ LNN   L  +N T IVLIPK +N
Subjt:  SAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKN

Query:  PKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMG
        P+ ++E+ PISLCNV YKI+SKVL NR+K +L ++IS  QSAF+PGR + DN ++ YE +H + ARK G+ G V+LKLD+SKAYDRVEW FL+ +M +MG
Subjt:  PKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMG

Query:  FAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQ
        F   W+E V+ C+++  +S  VNG     + PSRG+RQGDP+SPYLFLLCAEGL+++L+ AE    ITG+ I RG P I++L FADDSLLF +A   E +
Subjt:  FAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQ

Query:  AVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE-------
         +  ILQ YERASG++IN +KS   FS NT+ G + Q+ +   V+      +YLGLP+ + R + +T + +K+RVW+++QGWKG L S  G+E       
Subjt:  AVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE-------

Query:  ------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLE
                                       +   +R+IHW SW  L  PK   GMGFRDL  FN A+LAKQ WR+VQ   S L R  K RYFP   FLE
Subjt:  ------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLE

Query:  AG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQV
        A                                                                         WN E +R  F   E  +I  IPL + 
Subjt:  AG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQV

Query:  SAEDKIIWHFEKCGIYTVKSGYRLGQVALL-AQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH
           D I W +   G+++VKS Y + +  L  A     S      + W   WK+ +P+K+KVF WR C E LPT  NL+ R I   + C  C    ES +H
Subjt:  SAEDKIIWHFEKCGIYTVKSGYRLGQVALL-AQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH

Query:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVPSAGLVDW------------------------ATNYLAVFRGA-SRACCEDTR--
          W C   +D   GS  ++         M+ L+ ++ ++  +  +       W                        A  Y+  FR A +R   + T+  
Subjt:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVPSAGLVDW------------------------ATNYLAVFRGA-SRACCEDTR--

Query:  -----------------------GVPRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGL--------------APDAAMVDLSE
                                + RTG G ++R+E G VM + + S   V NSD  E LA    L   V+ G               A  +++V+ S 
Subjt:  -----------------------GVPRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGL--------------APDAAMVDLSE

Query:  FGVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLALR--ERDCGW
        FG ++ +    + +  ++ V  TRR GN+VAH L + A    E D  W
Subjt:  FGVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLALR--ERDCGW

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]7.6e-17532.33Show/hide
Query:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH
        +W+LLKHL   +  PW+  GDFN I   +EK+GG  + E+++                  G  FTWCN +   EV W R+DR ++  +W +LFP   + H
Subjt:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH

Query:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQQAMG
        +  + SDH PL L  SD   V      R  RFE  WL+      V+   W +        +L      C S L +W R   GN+ + +      +  A  
Subjt:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQQAMG

Query:  RLGSLDSRADLQRAEGQLESLLVEEEVYWRQSNGV-WQQEPD-------------------RVLGLIEG---------------YFESIFSTSAPSEGEI
           + D    ++    ++  L+V+EE  W Q + V W +  D                     L L +G               YF+ IF+++ PS    
Subjt:  RLGSLDSRADLQRAEGQLESLLVEEEVYWRQSNGV-WQQEPD-------------------RVLGLIEG---------------YFESIFSTSAPSEGEI

Query:  DQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYM
        DQ+   +   V+  MNA L R F  +EV  AL Q+ P  APGPDG+S  FY+  W+ +G DV++  L ILN+    A LN T I LIPK K+P+  T++ 
Subjt:  DQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYM

Query:  PISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVEL
        PISLCNV YKIVSK + NR+K +L +L+S +QSAF+  R + DN ++ +E +H LK +  G++G++++KLDMSKAYDRVEW FLE++M ++GF   W+ L
Subjt:  PISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVEL

Query:  VLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQH
        V  CI SV +S  VNG   G+ TP+RGLRQGDPLSPYLFLLCAEGL S++   E   +I G+ +    P +SHLFFADDSLLF RA   E  ++  IL+ 
Subjt:  VLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQH

Query:  YERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE---------------
        YE ASG+ IN +K+ + FSPNT   VQ ++     V  +  + +YLGLPSF+ R +  +  +I+ERVW+++QGWK +L S GGRE               
Subjt:  YERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE---------------

Query:  ----------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAG------
                                  E R+IHWV WK LCK K   G+GF+D+E FN A+L KQ WR++ +  S   +V K +YFP+   L+ G      
Subjt:  ----------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAG------

Query:  ------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIW
                                                                          W E+ +R  F P E  +IL++PL     ED++IW
Subjt:  ------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIW

Query:  HFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKF--
             G YT KS YRL   A  A  PS+S+S A   +W+  W + +P+KI+ FLWR   + LP   NL  R I    +C  CG   E  +H  W C+   
Subjt:  HFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKF--

Query:  -------------------FRDALMG------------------SEW----------------EVLLQNVQANSMLNLLRDVKDKKFRGHVPSAGLVDWA
                           F D L G                  S W                ++    V+     + +R+ +  +   H P+  L    
Subjt:  -------------------FRDALMG------------------SEW----------------EVLLQNVQANSMLNLLRDVKDKKFRGHVPSAGLVDWA

Query:  TNYLAVFRGASRACCEDTRGVPRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP--------------DAAMVDLSEFGVL
        + Y   F GA+         +   G G+VVRD  G V+ + +           +E LA    +    E+GL                 A    +S FG +
Subjt:  TNYLAVFRGASRACCEDTRGVPRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP--------------DAAMVDLSEFGVL

Query:  VSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA
        + E+R    +      + T+R+GN VA +L +LA
Subjt:  VSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein2.3e-18532.9Show/hide
Query:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE
        +PE+  R  TW L++ L G     W   GDFN I+   E  G  ++ + ++                  G  FTWCN R      W R+DR + N+ W E
Subjt:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE

Query:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA
         FP   + HLD  +SDH+ L L+    +   R   ++  RFEE W+  +G  + +   W S+    +  +++     C   L SW R   GN+G++I+A 
Subjt:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA

Query:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-SNGVWQQEPD-----------------RVLGL----------IEG-------YFESIFS
           ++QA        S  +LQ    +L SL  +EE  WRQ S  +W    D                 R+LGL           EG       Y+ S+F+
Subjt:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-SNGVWQQEPD-----------------RVLGL----------IEG-------YFESIFS

Query:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK
        T  P   +I+ V ++V   V+++MN +L+R F   EV  AL Q+ P KAPGPDG+   FYQ  W +VG+DV    L+ LN+   L  +N T I LIPK K
Subjt:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK

Query:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM
        NP+ +TE+ PISLCNV YK++SKVL NR+K IL +++S +QSAF+PGR + DN ++ +E +H +   K+GR G ++LKLDMSKAYDRVEW+FLE++M ++
Subjt:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM

Query:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA
        GF  +W+ L+  CIS+V YS  VNG   G + PSRGLRQGDPLSPYLFLLCAEGL S++  A     I G+ + R  P I+HLFFADDSLLF +A     
Subjt:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA

Query:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------
          +  IL  YE+ASG+ +N DK+ I FS  T    Q+ +    +V +   + +YLGLPS + RNR  + + IKERVW++++GWK KL S  GRE      
Subjt:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------

Query:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL
                                       + ++E R+IHWVSW+ LC+ K   G+GFRDL  FN ALLAKQ WR++    S   RV K ++FPHG  +
Subjt:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL

Query:  EAG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQ
        +                                                                          W+  L+   F+P +  +I  IPL  
Subjt:  EAG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQ

Query:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH
            DK+IW     G+Y+V+SGYRL        +P  S+   L + W+  W + IP K ++F W+   E LPT  NL+ R I +   C  CG   E  +H
Subjt:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH

Query:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV---------------------KDKKFRGH--VPSAGLV-DWATNYLAVFRGASRACCEDTRGVP
          W CK  +       W   L++ Q+    +LL  V                     +  K R H  V S   V   A  YL  +  AS     +++ +P
Subjt:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV---------------------KDKKFRGH--VPSAGLV-DWATNYLAVFRGASRACCEDTRGVP

Query:  -------------------------RTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP-----DAAMV---------DLSEF
                                   G G++VRD  G V+ S         +   +E  A    ++  +E+GL       D+ +V          L+ F
Subjt:  -------------------------RTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP-----DAAMV---------DLSEF

Query:  GVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA
        G+L+++A+       +   +  +R+GNR+AH L   A
Subjt:  GVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLA

A0A2N9GIC4 Reverse transcriptase domain-containing protein1.9e-18733.98Show/hide
Query:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE
        +PE   R  TW L++ L G    PW   GDFN ++  +E  G   + + ++                  G  FTWCN R      W R+DR ++   W  
Subjt:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE

Query:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA
         +P   + H+D  +SDH+ L L      +  R+  +R  RFEE WL  +G  + +   W +  P  +  ++      C   L SW R K GN+G++I+  
Subjt:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA

Query:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-------SNG----------------------------VWQQEPDRVLGLIEGYFESIFS
         + ++QA        +   LQ    +L SL  +EE  WRQ       +NG                             W    D +  L+  Y+ S+FS
Subjt:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-------SNG----------------------------VWQQEPDRVLGLIEGYFESIFS

Query:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK
        TS P   +I +V + V   V+++MN +L+R F   EV  AL Q+ P KAPGPDG+   FYQ  W +VG DV    L+ LN+   L  +N T I LIPK K
Subjt:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK

Query:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM
        NP+ +TE+ PISLCNV YK++SKVL NR+K IL +++S +QSAF+PGR + DN ++ +E +H +   K+GR G ++LKLDMSKAYDRVEW FLE++M +M
Subjt:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM

Query:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA
        GF  +W+ +++ CIS+V YS  VNG   G + PSRGLRQGDPLSPYLFLLCAEGL S++  A     I G+ + R  P ISHLFFADDSLLF +A     
Subjt:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA

Query:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------
        + +  IL  YE+ASG+ +N DK+ I FS NT    Q  + +  +V +   + +YLGLPS + RNR ++ + IKERVW++++GWK KL S  GRE      
Subjt:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------

Query:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL
                                       + + E R+IHWV+W+ LC+PK   GMGFRD+  FN ALLAKQ WR++   +S   RV K ++FPHG  L
Subjt:  -------------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL

Query:  EA------------------------------------------------------------------------GWNEELLRHHFSPGEVRSILTIPLRQ
        +                                                                          W+  L+ + F   +  +I  IPL  
Subjt:  EA------------------------------------------------------------------------GWNEELLRHHFSPGEVRSILTIPLRQ

Query:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH
            DKIIW     G YTV+SGYR         +P +S    L   WK  W ++IP K ++F W+   E LPT  NL  R I V   C  CG   E  +H
Subjt:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH

Query:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV--KDKKFRGHVPSAGLVDWATNYLAVFR-GASRACCEDTRGVPRTGAGIVVRDEMGRVMLSAAV
          W CK  +     S W   +QN        +LR +  +++    +   A  V W  +    ++     A  ++T      G G++VRD    VM S   
Subjt:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV--KDKKFRGHVPSAGLVDWATNYLAVFR-GASRACCEDTRGVPRTGAGIVVRDEMGRVMLSAAV

Query:  SHDHVGNSDLVEGLAVVDGLRLVVEMGLAPD----------AAMVD----LSEFGVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLAL
              +   +E  AV   ++ V+E+GL             AA+ D    L+ FG+L+++A+           S  +R+GN++AH L R AL
Subjt:  SHDHVGNSDLVEGLAVVDGLRLVVEMGLAPD----------AAMVD----LSEFGVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLAL

A0A2N9GLU2 Reverse transcriptase domain-containing protein1.1e-18232.71Show/hide
Query:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE
        SP ++G+   WS+L+ LR +   PWL  GDFN +LS  EK G   + E +++                 G A+TWCN + G+  V ER+DR L+   W  
Subjt:  SPELKGR--TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELS----------------GGAAFTWCNGRPGEEVVWERIDRCLSNVAWQE

Query:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA
         FP   + HL    SDH  L   ++ S R  R   +R  RFEE W       + +   W +E        ++         L  W ++  G++   I+  
Subjt:  LFPVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAA

Query:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-----------------------------------SNGVWQQEPDRVLGLIEGYFESIFS
        T  +++    + +L +   ++    +L SL  +EE  W+Q                                    +G+WQQE D++   I  Y++S+F+
Subjt:  TANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQ-----------------------------------SNGVWQQEPDRVLGLIEGYFESIFS

Query:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK
        ++ P  G++D+V + V   VS+EMN  L+R F   EV +AL Q+ P KAPGPDG+S  FYQ  W IVG DV    L+ L +   L  +N T I LIPK +
Subjt:  TSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKK

Query:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM
        NP    ++ PISLCNV YKIV+KVL NR+K +L  +IS  QSAF+PGR + DN ++ +E +H +   + G+ G+++LKLDMSKAYDRVEWVFLE++M  M
Subjt:  NPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRM

Query:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA
        GF  +WV L++ C+ SV YS  +NG   G   PSRGLRQGDP+SPYLFLLCAEGL ++L  A   R + GL I+RG P ++HLFFADDS+LF RA   E 
Subjt:  GFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEA

Query:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------
          +  IL  YERASG+ IN DK+ + FS +T    + ++ Q  Q+ V   +  YLGLPS + R++ ++ + +KE +WR++QGWK KL +  G+E      
Subjt:  QAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE------

Query:  ------------------NEDTED-------------RRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL
                            D E              R++HW+ W +LC+PKC  GMGFR+L  FN+ALLAKQ WR++ +  S   +V K ++FP+G  +
Subjt:  ------------------NEDTED-------------RRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFL

Query:  EAG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQ
        EA                                                                         WN  L+   FSPG+ + I  + L  
Subjt:  EAG------------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQ

Query:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH
           EDK+IW  EKCGIY+V+S YRL   A+ A  P    S     +WK  W +++P K++ FL R C E LPT+ N+  R I     C  C    E   H
Subjt:  VSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMH

Query:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV-----------------------KDKKFRGHVPSAGLVD-----WATNYLA--VFRGASRACCE
        V W C               L   +  S L++L D+                           +R  V S  L+       ++ YL+  V    S  C  
Subjt:  VFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDV-----------------------KDKKFRGHVPSAGLVD-----WATNYLA--VFRGASRACCE

Query:  DTRGVPR-------------------TGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP-----DAAMV---------DLSEF
          R  P                    TG G+++RD  G  + + +     + + D  E +A  + L+   E+G+       D+  +           + F
Subjt:  DTRGVPR-------------------TGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLAP-----DAAMV---------DLSEF

Query:  GVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLAL
        G ++ EA     +  +   S  RREGNRVAH L R A+
Subjt:  GVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLAL

A0A2N9I4C9 Uncharacterized protein6.3e-18336.1Show/hide
Query:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE
        +G +W+LLK L+     PWLV GDFN IL   EK G + +   ++                  G  FTW NGR G++ V+ER+DR + + AW  LFP  +
Subjt:  KGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSG----------------GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHE

Query:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWG-------RRKNGNLGQRIQA
        + H+ F+ SDH  +L+ + +S     S   ++ RFE  W+Q  G   ++   W++        ++      C   L  W        +++   L +    
Subjt:  LTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWG-------RRKNGNLGQRIQA

Query:  ATANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQSNGV-WQQEPDRVL-------GLIEGYFESIFSTSAPSEGEIDQVTSRVQPSVSDEMNA
             Q   G    +++R+    A  QL  LL +EE YW Q + V W ++   ++        ++E YF  IF +  P    I+QV  +V+P +S EMN 
Subjt:  ATANVQQAMGRLGSLDSRADLQRAEGQLESLLVEEEVYWRQSNGV-WQQEPDRVL-------GLIEGYFESIFSTSAPSEGEIDQVTSRVQPSVSDEMNA

Query:  SLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYKIVSKVLV
         L+ PF   E+  AL Q+HP+KA GPDG++  FYQ  W I+G DV    L  L++   L  +N T I LIPK K P  +T++ PISLCNV YKI+SKVL 
Subjt:  SLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYKIVSKVLV

Query:  NRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGV
        NR+K +LN +IS NQSAF+PGR + DN ++ +E +H LK+++ GR+  +++KLDMSKAYDRVEW F+ +MML++GF   WV L+++CI SV YS  +NG 
Subjt:  NRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGV

Query:  RCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIIS
          G + P+RG+ QGDPLSPYLFL+CAEGL+++L+ A   R+++GL + RG P ISHLFFADDSLLF RA   E   +NS+L  YE+ASG+ +N++K+ I 
Subjt:  RCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIIS

Query:  FSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE--------------------------------
        FS NT    +  +    +   S    +YLGLP  + R +    + IK++V R + GWKGK+ S+ G+E                                
Subjt:  FSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE--------------------------------

Query:  -----NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEA------------------------
              +   +R+IHW +W  LC+ K   GMGFRDL  FNQALLAKQ WR++Q+P + L RVLK +YFP   F+EA                        
Subjt:  -----NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEA------------------------

Query:  -------------------------GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKG
                                  W E L+   F P E   I  IPL   S +D ++W     GI+T +S Y +          S+S+   L ++WK 
Subjt:  -------------------------GWNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKG

Query:  CWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVPSAG
         W + +P+KI+VF+WR C   LPT  N   RGI   + C +C    E+ +H  W+C + ++  + S    L  +V  +S  +L+         GHV S G
Subjt:  CWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKFFRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVPSAG

A0A2N9J3U0 Reverse transcriptase domain-containing protein3.1e-18236.73Show/hide
Query:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAK------------AEAELSG----GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH
        +W+LL+ L+     PWLV GDFN +L   EK G +A+            ++ EL      G  FTW NGR G + V+ER+DR + +  W  LFP  ++ H
Subjt:  TWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAK------------AEAELSG----GAAFTWCNGRPGEEVVWERIDRCLSNVAWQELFPVHELTH

Query:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRI-QAATANVQQAM
        + FS SDH  L++ +S      +   QR  RFE  W+Q+ G  EVV   W+         +++     C   L  W R  N     R+ QA TA      
Subjt:  LDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRI-QAATANVQQAM

Query:  GRLGSLDSRADLQR---AEGQLESLLVEEEVYWRQSNGV-WQQEPDR-----------------VLGL-----------------IEGYFESIFSTSAPS
        G   + D+R    R   A   L  +L +EE YW+Q + V W ++ DR                 +LGL                 +E YF +IF TS PS
Subjt:  GRLGSLDSRADLQR---AEGQLESLLVEEEVYWRQSNGV-WQQEPDR-----------------VLGL-----------------IEGYFESIFSTSAPS

Query:  EGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCI
           I QV   V  +V+  MN  L+ PF  EE+  AL Q+HP KAPGPDG++  FYQ  W IVG DV N  L  L++   L  +N T I LIPK  +P+ +
Subjt:  EGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCI

Query:  TEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVE
        T++ PISLCNV YKI+SKVL NR+K +L+ +IS NQSAF+PGR + DN ++ +E +H +K ++ GRS  +++KLDMSKAYDRVEW FLE MM+++GF   
Subjt:  TEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVE

Query:  WVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNS
        WV L+++C++SV YS  +NG   G + P+RG+RQGDPLSPYLFL+CAEGL+++L  AE    + GL I RG P ISHLFFADDSLLF RA   E Q + +
Subjt:  WVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNS

Query:  ILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE-----------
        IL  YE+ASG+ +N +K+ + FS NT   ++  +    +   +    +YLGLP  + R +      IK+++ +++ GWKGKL S  GRE           
Subjt:  ILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKLFSVGGRE-----------

Query:  --------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAG--
                                   +  E+++IHW  W  +C+ K   GMGFRDL  FNQALLAKQ WR++QHP + L R+LK +YFP+  F+EA   
Subjt:  --------------------------NEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAG--

Query:  ----------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQVSAED
                                                                              WN  L+   F P E   I  IPLR +   D
Subjt:  ----------------------------------------------------------------------WNEELLRHHFSPGEVRSILTIPLRQVSAED

Query:  KIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEA-LSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQ
         ++W     G +T +S Y L Q+    Q+  +SS ++ L ++WK  W++ +PSKIK F+WR C   LPT  NL  RG+     C  C  H E+ +H  W 
Subjt:  KIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEA-LSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQ

Query:  CKFFRDALMGSEWEVLLQNVQANSMLNLL
        C++ + A + S    LL  V+ +S  +L+
Subjt:  CKFFRDALMGSEWEVLLQNVQANSMLNLL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-2824.76Show/hide
Query:  EPDRVLGLIEGYFESIFSTSAPSEGEIDQ-VTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNN
        +P  +   I  Y++ +++    +  E+D  + +   P ++ E   SL RP    E++  ++ +   K+PGPDG +  FYQ     +   ++    +I   
Subjt:  EPDRVLGLIEGYFESIFSTSAPSEGEIDQ-VTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNN

Query:  RASLAPLNETMIVLIPKKKNPKCITE-YMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLD
                E  I+LIPK        E + PISL N+  KI++K+L NR++  + +LI H+Q  FIPG     N       I  +   K      V + +D
Subjt:  RASLAPLNETMIVLIPKKKNPKCITE-YMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLD

Query:  MSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPI
          KA+D+++  F+ + + ++G    +++++         +  +NG +        G RQG PLSP LF +  E L+  +      + I G+++  G   +
Subjt:  MSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPI

Query:  SHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGL------PSFMPRNRMSTLNFIKE
            FADD +++       AQ +  ++ ++ + SG  IN  KS  +F  N     +SQ+       +++   +YLG+            N    L  IKE
Subjt:  SHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGL------PSFMPRNRMSTLNFIKE

Query:  --RVWRQIQ-GWKGKL
            W+ I   W G++
Subjt:  --RVWRQIQ-GWKGKL

P08548 LINE-1 reverse transcriptase homolog6.8e-3326.51Show/hide
Query:  RQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQ-PSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVV
        R  N     +P  +  ++  Y++ ++S    +  EIDQ       P +S +    L RP    E+   +  +   K+PGPDG +  FYQ        ++V
Subjt:  RQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQ-PSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVV

Query:  NCCLNILNN--RASLAP--LNETMIVLIPKK-KNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKAR
           LN+  N  +  + P    E  I LIPK  K+P     Y PISL N+  KI++K+L NR++  + ++I H+Q  FIPG     N       I  +   
Subjt:  NCCLNILNN--RASLAP--LNETMIVLIPKK-KNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKAR

Query:  KVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRA
        K+     + L +D  KA+D ++  F+ R + ++G    +++L+    S    +  +NGV+        G RQG PLSP LF +  E L+  + +    +A
Subjt:  KVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRA

Query:  ITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMS
        I G+ I  G   I    FADD +++          +  +++ Y   SG  IN  KS+     N     +  V       V     +YLG+        + 
Subjt:  ITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMS

Query:  TLNF--IKERVWRQIQGWKGKLFSVGGREN
          N+  +++ +   +  WK    S  GR N
Subjt:  TLNF--IKERVWRQIQGWKGKLFSVGGREN

P11369 LINE-1 retrotransposable element ORF2 protein5.7e-3225.06Show/hide
Query:  RQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQ-PSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVV
        R   G    +P+ +   I  +++ ++ST   +  E+D+   R Q P ++ +    L  P   +E+   ++ +   K+PGPDG S  FYQ ++      ++
Subjt:  RQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQ-PSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVV

Query:  NCCLNILNNRASLA-PLNETMIVLIPK-KKNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVG
        +   + +    +L     E  I LIPK +K+P  I  + PISL N+  KI++K+L NR++  +  +I  +Q  FIPG     N       IH +   K+ 
Subjt:  NCCLNILNNRASLA-PLNETMIVLIPK-KKNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVG

Query:  RSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITG
            + + LD  KA+D+++  F+ +++ R G    ++ ++    S    +  VNG +   +    G RQG PLSPYLF +  E L+  +     ++ I G
Subjt:  RSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITG

Query:  LRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLN
        ++I +    IS L  ADD +++    +   + + +++  +    G  IN +KS ++F        + ++ +     +   + +YLG+        +   N
Subjt:  LRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLN

Query:  F--IKERVWRQIQGWKGKLFSVGGREN
        F  +K+ +   ++ WK    S  GR N
Subjt:  F--IKERVWRQIQGWKGKLFSVGGREN

P14381 Transposon TX1 uncharacterized 149 kDa protein2.4e-3026.34Show/hide
Query:  NLGQRIQAATANVQQAMGRLGSLDSRA----DLQRAEG---QLESLLVEEEVYWRQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQPS
        N+ QR QA  A V+  M  L  +D  +     L++ +G   Q+  L  E+        G   ++P+ +      +++++FS    S    +++   + P 
Subjt:  NLGQRIQAATANVQQAMGRLGSLDSRA----DLQRAEG---QLESLLVEEEVYWRQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQPS

Query:  VSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYK
        VS+     L  P   +E+ +AL  +  NK+PG DGL+  F+Q  W  +G D                     ++ L+PKK + + I  + P+SL +  YK
Subjt:  VSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSGAFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYK

Query:  IVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRY
        IV+K +  R+K +L E+I  +QS  +PGR + DN  L  + +H   AR+ G S    L LD  KA+DRV+  +L   +    F  ++V  +    +S   
Subjt:  IVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRY

Query:  SFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTIN
           +N      +   RG+RQG PLS  L+ L  E    +L     R+ +TGL +      +    +ADD +L  +    + +      + Y  AS   IN
Subjt:  SFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTIN

Query:  FDKSI--------ISFSPNT--AVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKG--KLFSVGGR
        + KS         + F P     +  +S++ ++  V +SA   +Y    +F+          ++E V  ++  WKG  K+ S+ GR
Subjt:  FDKSI--------ISFSPNT--AVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKG--KLFSVGGR

P92555 Uncharacterized mitochondrial protein AtMg012503.7e-1555.07Show/hide
Query:  FNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDS
        F +NG   G VTPSRGLRQGDPLSPYLF+LC E LS +   A+ +  + G+R++   P I+HL FADD+
Subjt:  FNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.1e-1326.23Show/hide
Query:  WNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVD
        W++  +       +   I  I L +    DKIIW++   G YTV+SGY L        +P+ +            W + I  K+K FLWR   + L T +
Subjt:  WNEELLRHHFSPGEVRSILTIPLRQVSAEDKIIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVD

Query:  NLSVRGIDVLNVCVHCGWHGESCMHVFWQCKF------------FRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVP
         L+ RG+ +   C  C    ES  H  + C F             R+ LM +++E  + N+     LN ++D     F   +P
Subjt:  NLSVRGIDVLNVCVHCGWHGESCMHVFWQCKF------------FRDALMGSEWEVLLQNVQANSMLNLLRDVKDKKFRGHVP

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.9e-1439.36Show/hide
Query:  LVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVR
        +V R+K ++  LI   Q++FIPGR   DN +   E +H+++ RK G  GW+ LKLD+ KAYDR+ W +LE  ++  GF   W+  + R     R
Subjt:  LVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKARKVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVR

AT4G29090.1 Ribonuclease H-like superfamily protein3.0e-1238.46Show/hide
Query:  EDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAGWNEELLRHHFSPGEVRSILTIPLRQV--SAED
        E + +HW +W  L   K   G+GF+D+E FN ALL KQ WR++  P S +++V K RYF   D L A           S    + IL    R V  + ED
Subjt:  EDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAGWNEELLRHHFSPGEVRSILTIPLRQV--SAED

Query:  KIIW
         IIW
Subjt:  KIIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.0e-1556.92Show/hide
Query:  RRIHWVSWKTLCKPK-CLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLE
        R+I WV+W+ LCK K    G+GFRDL  FNQALLAKQ +RI+  P + LSR+L+ RYFPH   +E
Subjt:  RRIHWVSWKTLCKPK-CLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.6e-1655.07Show/hide
Query:  FNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDS
        F +NG   G VTPSRGLRQGDPLSPYLF+LC E LS +   A+ +  + G+R++   P I+HL FADD+
Subjt:  FNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGCPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCACGATTTCTTCAAATCTCCAGTCGGCTACGGAAGAGTGTCATTAACAAACCCCATCTTATTCTTGAAAGTGAGTCCAGAACTGAAAGGGAGGACTTGGTCTCT
TTTGAAGCACTTGCGTGGGAATCTAGGGACACCGTGGTTAGTGGGTGGTGACTTTAATGCCATCTTGTCCCATCAGGAGAAGGAAGGTGGCATGGCCAAGGCAGAGGCAG
AATTGTCAGGGGGGGCTGCCTTTACCTGGTGCAATGGGAGGCCGGGGGAGGAAGTAGTTTGGGAGAGGATTGACAGATGTCTGAGCAATGTTGCTTGGCAGGAGTTGTTC
CCAGTGCATGAGTTGACACATCTGGACTTTAGTCGGTCGGACCACAGGCCTCTATTGCTCTCGATGTCAGATAGTGCCCGGGTGGTTCGTAGCGCAGGTCAGAGGATTCA
GAGGTTTGAGGAAACTTGGCTCCAGTCCACTGGCTTTGGGGAGGTAGTGTCAGCAGGTTGGAGGTCTGAGGTTCCCGTTAGGTCGCCTATAGAGTTGGCATCAGCAACGT
CCCGGTGTATGTCACATCTGAGCAGTTGGGGAAGGCGGAAGAATGGTAATCTTGGTCAGCGTATTCAGGCGGCTACAGCAAATGTTCAACAAGCAATGGGTAGGCTGGGC
TCTTTGGATTCCCGAGCAGATCTACAGAGGGCAGAAGGGCAGTTGGAATCTCTTCTTGTCGAGGAGGAGGTGTACTGGAGACAAAGTAACGGTGTGTGGCAACAGGAGCC
AGACAGGGTGTTGGGCCTGATAGAGGGGTATTTTGAGAGCATCTTCTCGACATCCGCCCCTTCTGAGGGTGAGATTGATCAGGTGACATCCCGAGTTCAACCTTCAGTTT
CGGATGAGATGAATGCTAGTCTTGTGCGGCCTTTCCAGCGCGAGGAAGTCCTTCGAGCTTTGCATCAGATCCACCCGAATAAAGCCCCGGGGCCAGATGGGCTGTCTGGA
GCTTTTTACCAGCATTCCTGGTCGATAGTTGGGGCTGATGTGGTTAACTGTTGTCTGAACATCCTGAATAACCGAGCCTCTCTGGCGCCTCTGAATGAAACGATGATTGT
GTTGATCCCTAAGAAGAAGAATCCCAAGTGTATAACGGAGTACATGCCAATCTCTTTGTGTAATGTTTCATACAAAATTGTGTCAAAGGTTTTGGTGAACCGTATGAAGG
GCATACTTAATGAGCTGATTTCTCACAACCAAAGCGCTTTCATTCCAGGGCGTTGCGTCGTTGACAATGCCATCCTGGGGTATGAATGCATTCATGCTTTGAAGGCAAGG
AAGGTGGGGAGATCAGGGTGGGTCTCGCTTAAGTTGGATATGAGCAAAGCCTATGACAGGGTAGAGTGGGTCTTCCTGGAAAGGATGATGCTGAGAATGGGGTTTGCAGT
GGAGTGGGTGGAATTGGTATTACGGTGCATATCATCGGTTCGGTACTCGTTTAATGTGAACGGTGTGAGATGTGGGGATGTCACCCCTAGCAGGGGTCTTCGCCAGGGAG
ATCCGCTCTCCCCGTATCTATTTTTATTGTGTGCTGAAGGGCTCTCCAGTATGTTGCATGATGCAGAAGGGAGGAGGGCTATCACGGGGTTGAGGATAGCGCGTGGGTGC
CCTCCTATCTCCCATTTGTTTTTCGCTGACGACAGTCTCCTCTTCTTTCGGGCTAAAGAGGGTGAAGCCCAGGCGGTGAACAGTATCCTCCAGCATTACGAGCGAGCCTC
CGGGAAAACGATTAATTTTGATAAGTCCATTATCTCATTTAGTCCGAATACTGCAGTGGGGGTTCAATCTCAGGTGAGTCAGTTTTTTCAGGTCCAAGTATCTGCCTGCC
ATCGACAGTATTTAGGTCTGCCGTCTTTTATGCCTCGTAACAGAATGAGCACTCTAAACTTCATTAAGGAGCGGGTGTGGCGTCAGATTCAGGGTTGGAAGGGGAAACTT
TTCTCTGTTGGGGGCAGGGAGAATGAGGATACGGAAGATAGGAGAATCCATTGGGTGAGTTGGAAGACACTGTGTAAGCCAAAATGCTTGAGTGGAATGGGATTCAGAGA
TTTGGAAACGTTCAACCAAGCTCTTTTGGCCAAACAGTGTTGGAGGATTGTTCAACATCCTACCTCTTTTCTATCCCGTGTGTTGAAGGAGCGGTATTTTCCTCATGGAG
ATTTCCTGGAGGCAGGGTGGAATGAGGAGCTTCTTCGACACCACTTCAGCCCTGGCGAGGTACGATCTATCCTTACTATTCCGTTACGACAAGTTTCGGCGGAGGACAAA
ATTATATGGCATTTTGAGAAGTGCGGGATCTACACGGTTAAGAGTGGGTATCGACTTGGCCAGGTGGCCTTGCTTGCCCAAGTCCCGTCGGCGTCCTCGAGTGAGGCGTT
GTCCAGTTGGTGGAAGGGGTGCTGGAAAATGGAGATTCCGAGCAAAATTAAGGTTTTCCTTTGGAGGCTTTGCCTGGAGCGTCTTCCTACTGTTGACAATTTGAGTGTCA
GGGGCATTGATGTACTGAATGTATGTGTGCATTGTGGTTGGCATGGTGAGTCATGCATGCATGTGTTCTGGCAGTGCAAATTTTTTCGTGATGCATTGATGGGATCTGAA
TGGGAGGTTTTGCTACAGAATGTTCAAGCAAATTCTATGCTAAATCTGCTAAGGGATGTGAAGGATAAGAAGTTTAGGGGGCATGTCCCTTCTGCCGGGCTTGTGGATTG
GGCAACGAATTATCTTGCTGTTTTCCGAGGGGCTTCCAGGGCCTGTTGTGAGGACACTAGGGGGGTGCCTCGGACGGGTGCTGGGATTGTTGTTCGGGATGAGATGGGGC
GTGTCATGTTGTCAGCTGCTGTCAGTCATGATCATGTGGGGAATTCAGACTTGGTAGAGGGTCTGGCGGTGGTGGACGGATTGAGACTTGTGGTGGAAATGGGTTTAGCG
CCGGATGCTGCGATGGTGGATTTATCTGAGTTCGGTGTATTGGTGTCTGAGGCTCGGAGGGGGGTACCTGCGCATTTTCAGCTTAGAGTCAGCGTTACAAGGAGGGAAGG
AAACCGTGTTGCCCATGAGTTAGGCCGCCTTGCATTGAGAGAAAGAGATTGTGGGTGGCAGTCGAGGTTGGGAGTTTCGAGAATAGGAGTCATTTCTAAAGAACCAGCTA
CATTGGGAAGGTTGATGGGGGGCCTTTGTATGGCTGTGGATCCTAGGCTGTGGCTGGTTGGTTTAAGCATCAGGTCTCCTGTGGTTGCATGTGTAGGTTGCTTGGTGATC
TGTGTCTGGTTTGCATGCTTAGTGGATCCTAGGTTGAGAACTAGATGTACCCGAGCATCAGCCTCTCTTATGTATGTTTTCGTTGTGTTGCCTGTTATGTTTGGTTGGAG
GGCTTTGCGTCTAGGCAGTGGGCACATGTTTCTGGTGGGGCAATGCCCAAATCTATCTGAACTTGAGATTGTAAAGCTTCCTTTGAGGTGTTCTTGCGTGTGTGAGCGTT
GGGTTTCCCTTAGTTGTCTCGAAAGGCTAGGGCGGTTTTATGAGGCCCATAAGGCTGTTGTCACCTCGATAGGATCAAGCCTCCCACAGGTTAAGGGCTTAGCATGTGTA
GTTCTTGGACTTGTGCATGATAAATGTCGATACATGATGCGGGATCAGTCATGCTGTGGGGCGATGAGCTTGATTAGGGTACAGGGTAGTTGTTGGGTAGAGTCTTGTCT
TGCTAGCGCTTCCTTAACTCCCTTTGGTTTATGTGCAGATATGCTTGGGATGGAAGACGCGAGGAGTTTGGAAGCTAGTTGGATTGTGGATAAAGTCATATGGGTGTCTT
CATGGATCGGGGCGTGTGCTTCATGCGGGGAAGAGTTTGCACAATGGGTGGTGATTTTGACTTATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCCACGATTTCTTCAAATCTCCAGTCGGCTACGGAAGAGTGTCATTAACAAACCCCATCTTATTCTTGAAAGTGAGTCCAGAACTGAAAGGGAGGACTTGGTCTCT
TTTGAAGCACTTGCGTGGGAATCTAGGGACACCGTGGTTAGTGGGTGGTGACTTTAATGCCATCTTGTCCCATCAGGAGAAGGAAGGTGGCATGGCCAAGGCAGAGGCAG
AATTGTCAGGGGGGGCTGCCTTTACCTGGTGCAATGGGAGGCCGGGGGAGGAAGTAGTTTGGGAGAGGATTGACAGATGTCTGAGCAATGTTGCTTGGCAGGAGTTGTTC
CCAGTGCATGAGTTGACACATCTGGACTTTAGTCGGTCGGACCACAGGCCTCTATTGCTCTCGATGTCAGATAGTGCCCGGGTGGTTCGTAGCGCAGGTCAGAGGATTCA
GAGGTTTGAGGAAACTTGGCTCCAGTCCACTGGCTTTGGGGAGGTAGTGTCAGCAGGTTGGAGGTCTGAGGTTCCCGTTAGGTCGCCTATAGAGTTGGCATCAGCAACGT
CCCGGTGTATGTCACATCTGAGCAGTTGGGGAAGGCGGAAGAATGGTAATCTTGGTCAGCGTATTCAGGCGGCTACAGCAAATGTTCAACAAGCAATGGGTAGGCTGGGC
TCTTTGGATTCCCGAGCAGATCTACAGAGGGCAGAAGGGCAGTTGGAATCTCTTCTTGTCGAGGAGGAGGTGTACTGGAGACAAAGTAACGGTGTGTGGCAACAGGAGCC
AGACAGGGTGTTGGGCCTGATAGAGGGGTATTTTGAGAGCATCTTCTCGACATCCGCCCCTTCTGAGGGTGAGATTGATCAGGTGACATCCCGAGTTCAACCTTCAGTTT
CGGATGAGATGAATGCTAGTCTTGTGCGGCCTTTCCAGCGCGAGGAAGTCCTTCGAGCTTTGCATCAGATCCACCCGAATAAAGCCCCGGGGCCAGATGGGCTGTCTGGA
GCTTTTTACCAGCATTCCTGGTCGATAGTTGGGGCTGATGTGGTTAACTGTTGTCTGAACATCCTGAATAACCGAGCCTCTCTGGCGCCTCTGAATGAAACGATGATTGT
GTTGATCCCTAAGAAGAAGAATCCCAAGTGTATAACGGAGTACATGCCAATCTCTTTGTGTAATGTTTCATACAAAATTGTGTCAAAGGTTTTGGTGAACCGTATGAAGG
GCATACTTAATGAGCTGATTTCTCACAACCAAAGCGCTTTCATTCCAGGGCGTTGCGTCGTTGACAATGCCATCCTGGGGTATGAATGCATTCATGCTTTGAAGGCAAGG
AAGGTGGGGAGATCAGGGTGGGTCTCGCTTAAGTTGGATATGAGCAAAGCCTATGACAGGGTAGAGTGGGTCTTCCTGGAAAGGATGATGCTGAGAATGGGGTTTGCAGT
GGAGTGGGTGGAATTGGTATTACGGTGCATATCATCGGTTCGGTACTCGTTTAATGTGAACGGTGTGAGATGTGGGGATGTCACCCCTAGCAGGGGTCTTCGCCAGGGAG
ATCCGCTCTCCCCGTATCTATTTTTATTGTGTGCTGAAGGGCTCTCCAGTATGTTGCATGATGCAGAAGGGAGGAGGGCTATCACGGGGTTGAGGATAGCGCGTGGGTGC
CCTCCTATCTCCCATTTGTTTTTCGCTGACGACAGTCTCCTCTTCTTTCGGGCTAAAGAGGGTGAAGCCCAGGCGGTGAACAGTATCCTCCAGCATTACGAGCGAGCCTC
CGGGAAAACGATTAATTTTGATAAGTCCATTATCTCATTTAGTCCGAATACTGCAGTGGGGGTTCAATCTCAGGTGAGTCAGTTTTTTCAGGTCCAAGTATCTGCCTGCC
ATCGACAGTATTTAGGTCTGCCGTCTTTTATGCCTCGTAACAGAATGAGCACTCTAAACTTCATTAAGGAGCGGGTGTGGCGTCAGATTCAGGGTTGGAAGGGGAAACTT
TTCTCTGTTGGGGGCAGGGAGAATGAGGATACGGAAGATAGGAGAATCCATTGGGTGAGTTGGAAGACACTGTGTAAGCCAAAATGCTTGAGTGGAATGGGATTCAGAGA
TTTGGAAACGTTCAACCAAGCTCTTTTGGCCAAACAGTGTTGGAGGATTGTTCAACATCCTACCTCTTTTCTATCCCGTGTGTTGAAGGAGCGGTATTTTCCTCATGGAG
ATTTCCTGGAGGCAGGGTGGAATGAGGAGCTTCTTCGACACCACTTCAGCCCTGGCGAGGTACGATCTATCCTTACTATTCCGTTACGACAAGTTTCGGCGGAGGACAAA
ATTATATGGCATTTTGAGAAGTGCGGGATCTACACGGTTAAGAGTGGGTATCGACTTGGCCAGGTGGCCTTGCTTGCCCAAGTCCCGTCGGCGTCCTCGAGTGAGGCGTT
GTCCAGTTGGTGGAAGGGGTGCTGGAAAATGGAGATTCCGAGCAAAATTAAGGTTTTCCTTTGGAGGCTTTGCCTGGAGCGTCTTCCTACTGTTGACAATTTGAGTGTCA
GGGGCATTGATGTACTGAATGTATGTGTGCATTGTGGTTGGCATGGTGAGTCATGCATGCATGTGTTCTGGCAGTGCAAATTTTTTCGTGATGCATTGATGGGATCTGAA
TGGGAGGTTTTGCTACAGAATGTTCAAGCAAATTCTATGCTAAATCTGCTAAGGGATGTGAAGGATAAGAAGTTTAGGGGGCATGTCCCTTCTGCCGGGCTTGTGGATTG
GGCAACGAATTATCTTGCTGTTTTCCGAGGGGCTTCCAGGGCCTGTTGTGAGGACACTAGGGGGGTGCCTCGGACGGGTGCTGGGATTGTTGTTCGGGATGAGATGGGGC
GTGTCATGTTGTCAGCTGCTGTCAGTCATGATCATGTGGGGAATTCAGACTTGGTAGAGGGTCTGGCGGTGGTGGACGGATTGAGACTTGTGGTGGAAATGGGTTTAGCG
CCGGATGCTGCGATGGTGGATTTATCTGAGTTCGGTGTATTGGTGTCTGAGGCTCGGAGGGGGGTACCTGCGCATTTTCAGCTTAGAGTCAGCGTTACAAGGAGGGAAGG
AAACCGTGTTGCCCATGAGTTAGGCCGCCTTGCATTGAGAGAAAGAGATTGTGGGTGGCAGTCGAGGTTGGGAGTTTCGAGAATAGGAGTCATTTCTAAAGAACCAGCTA
CATTGGGAAGGTTGATGGGGGGCCTTTGTATGGCTGTGGATCCTAGGCTGTGGCTGGTTGGTTTAAGCATCAGGTCTCCTGTGGTTGCATGTGTAGGTTGCTTGGTGATC
TGTGTCTGGTTTGCATGCTTAGTGGATCCTAGGTTGAGAACTAGATGTACCCGAGCATCAGCCTCTCTTATGTATGTTTTCGTTGTGTTGCCTGTTATGTTTGGTTGGAG
GGCTTTGCGTCTAGGCAGTGGGCACATGTTTCTGGTGGGGCAATGCCCAAATCTATCTGAACTTGAGATTGTAAAGCTTCCTTTGAGGTGTTCTTGCGTGTGTGAGCGTT
GGGTTTCCCTTAGTTGTCTCGAAAGGCTAGGGCGGTTTTATGAGGCCCATAAGGCTGTTGTCACCTCGATAGGATCAAGCCTCCCACAGGTTAAGGGCTTAGCATGTGTA
GTTCTTGGACTTGTGCATGATAAATGTCGATACATGATGCGGGATCAGTCATGCTGTGGGGCGATGAGCTTGATTAGGGTACAGGGTAGTTGTTGGGTAGAGTCTTGTCT
TGCTAGCGCTTCCTTAACTCCCTTTGGTTTATGTGCAGATATGCTTGGGATGGAAGACGCGAGGAGTTTGGAAGCTAGTTGGATTGTGGATAAAGTCATATGGGTGTCTT
CATGGATCGGGGCGTGTGCTTCATGCGGGGAAGAGTTTGCACAATGGGTGGTGATTTTGACTTATCTTTGA
Protein sequenceShow/hide protein sequence
MIHDFFKSPVGYGRVSLTNPILFLKVSPELKGRTWSLLKHLRGNLGTPWLVGGDFNAILSHQEKEGGMAKAEAELSGGAAFTWCNGRPGEEVVWERIDRCLSNVAWQELF
PVHELTHLDFSRSDHRPLLLSMSDSARVVRSAGQRIQRFEETWLQSTGFGEVVSAGWRSEVPVRSPIELASATSRCMSHLSSWGRRKNGNLGQRIQAATANVQQAMGRLG
SLDSRADLQRAEGQLESLLVEEEVYWRQSNGVWQQEPDRVLGLIEGYFESIFSTSAPSEGEIDQVTSRVQPSVSDEMNASLVRPFQREEVLRALHQIHPNKAPGPDGLSG
AFYQHSWSIVGADVVNCCLNILNNRASLAPLNETMIVLIPKKKNPKCITEYMPISLCNVSYKIVSKVLVNRMKGILNELISHNQSAFIPGRCVVDNAILGYECIHALKAR
KVGRSGWVSLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSSMLHDAEGRRAITGLRIARGC
PPISHLFFADDSLLFFRAKEGEAQAVNSILQHYERASGKTINFDKSIISFSPNTAVGVQSQVSQFFQVQVSACHRQYLGLPSFMPRNRMSTLNFIKERVWRQIQGWKGKL
FSVGGRENEDTEDRRIHWVSWKTLCKPKCLSGMGFRDLETFNQALLAKQCWRIVQHPTSFLSRVLKERYFPHGDFLEAGWNEELLRHHFSPGEVRSILTIPLRQVSAEDK
IIWHFEKCGIYTVKSGYRLGQVALLAQVPSASSSEALSSWWKGCWKMEIPSKIKVFLWRLCLERLPTVDNLSVRGIDVLNVCVHCGWHGESCMHVFWQCKFFRDALMGSE
WEVLLQNVQANSMLNLLRDVKDKKFRGHVPSAGLVDWATNYLAVFRGASRACCEDTRGVPRTGAGIVVRDEMGRVMLSAAVSHDHVGNSDLVEGLAVVDGLRLVVEMGLA
PDAAMVDLSEFGVLVSEARRGVPAHFQLRVSVTRREGNRVAHELGRLALRERDCGWQSRLGVSRIGVISKEPATLGRLMGGLCMAVDPRLWLVGLSIRSPVVACVGCLVI
CVWFACLVDPRLRTRCTRASASLMYVFVVLPVMFGWRALRLGSGHMFLVGQCPNLSELEIVKLPLRCSCVCERWVSLSCLERLGRFYEAHKAVVTSIGSSLPQVKGLACV
VLGLVHDKCRYMMRDQSCCGAMSLIRVQGSCWVESCLASASLTPFGLCADMLGMEDARSLEASWIVDKVIWVSSWIGACASCGEEFAQWVVILTYL