; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036163 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036163
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:40764973..40768307
RNA-Seq ExpressionLag0036163
SyntenyLag0036163
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7055450.1 unnamed protein product [Microthlaspi erraticum]1.3e-11636.55Show/hide
Query:  GEIFTWHGNRRGIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIARNGDRSGYPSSLTS
        G +FTW G R    +  RLDR   N E   +F +     LD   SDHRP+     N  F                                         
Subjt:  GEIFTWHGNRRGIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIARNGDRSGYPSSLTS

Query:  LDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG-------------------
        LD NG   + ++    +    F+ LFSSS+  P++  +     +P++ + MN+ L    + +E++ AV  +  + AP   G                   
Subjt:  LDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG-------------------

Query:  ---RLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLS
           RLIS+NI+I  E I+ ++       +  ALK D+SKAYD VEWS+L+ ++  LGF  RWVE++M C+SS  FSVLIN  P G  +  RGLR+GDPLS
Subjt:  ---RLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLS

Query:  PYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILF-SKVNNDRQRFLSSILG
        P++F+L  EGL+HL++ A RLG +  +  S  GP + HLLFADDSL  C+A + +   LK  LK Y  A+G+ +N SKS+I F +K+  D++  + + LG
Subjt:  PYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILF-SKVNNDRQRFLSSILG

Query:  VNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCK
        ++     G YLG+P   S  K   L Y+ D V K +  W + + S  GKE L+KS+  A+P Y MS FK PK +C ++  + A FWW S  +KRK HW  
Subjt:  VNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCK

Query:  WEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISP--
        WE++CLPK +GG+ FRDIE FNQAL+AKQ W +   P+SLL +F KS YFSS+  L + +GS P+Y WRS++WGR LL+KGLR RVGNG ++  +  P  
Subjt:  WEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISP--

Query:  ----------------------------SGDWNLERLKMAVSRDDLETIRRVPINGILEDKIVWHYDGTENYMVKSGYKLFRNI
                                    S  W+L +L      +D+  +         ED  VW ++ +  Y VKSGY L   I
Subjt:  ----------------------------SGDWNLERLKMAVSRDDLETIRRVPINGILEDKIVWHYDGTENYMVKSGYKLFRNI

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]7.1e-11831.81Show/hide
Query:  DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG--------------------
        D  G W +D   IE  F   F+ LF+SS+   + I++AL GL PKV Q MN  L  PF+  ++  A+ +M PTKAP   G                    
Subjt:  DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG--------------------

Query:  -------------------------------------------------------------------------RLISENILIGLESINAIKNSKYTDIDM
                                                                                 RLI++N++IG E ++ I+ SK     +
Subjt:  -------------------------------------------------------------------------RLISENILIGLESINAIKNSKYTDIDM

Query:  AALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS
         ALK+D+SKAYD VEW+FL++ M  LGF  +W+  IM CI++  FSVLING+P G     RGLRQG PLSPY+F+L  E  S+L++ A R   +  L  +
Subjt:  AALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS

Query:  NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDK
            ++HLLFADDSL+F +A   +   LK +   Y  ASG+  N+ KS++ FS K ++++   + SI  +  V  + KYLG+P +L R+K      +  K
Subjt:  NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDK

Query:  VWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVW
        V   +  W   LFS  GKE LIK++ QA+P Y MSVFK PK +C++I K  ARFWWG+  +K  +HW +W+ +   K  GGL FRD+  FNQAL+AKQ W
Subjt:  VWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVW

Query:  CIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISP---------------SGDWN
         +   P+SL++R  K+ Y+ +S   +A +GSNP+++WRS+LWG  ++ KG+R R+G+G  V  +             ISP                  W 
Subjt:  CIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISP---------------SGDWN

Query:  LERLKMAVSRDDLETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF--RNIKIDGISSGSSSMHQEI--------------WRRIFN----------
        ++RL+    ++D+E I ++ + +G  ED+++WH+D    Y VKSGY+L   +N   +  SS SSS   +I              WR + N          
Subjt:  LERLKMAVSRDDLETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF--RNIKIDGISSGSSSMHQEI--------------WRRIFN----------

Query:  --------------------RVFLD------------------EEFNGSFVDRWLKIDSNSSLAEMELVATTCWSIWNDRNRLIHGDQIPDVNFKCQWIS
                             V ++                  ++ N  F     ++ S SS AE EL+   CW IW+ RN+ I   +  D  F      
Subjt:  --------------------RVFLD------------------EEFNGSFVDRWLKIDSNSSLAEMELVATTCWSIWNDRNRLIHGDQIPDVNFKCQWIS

Query:  NYLEKYLQ----ANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVD
        + L+ Y +     N+       +D Q           KW PP +N  K NVD
Subjt:  NYLEKYLQ----ANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVD

XP_021737156.1 uncharacterized protein LOC110703671 [Chenopodium quinoa]6.9e-12142.41Show/hide
Query:  ENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKM-----GRLISENILIGLESINAIKNSKYTDIDMAALKVDLSK
        + LF+SS   PS   +AL G+  +V   MN+RL AP++  EV  A+KQM P K P +M     GRLI++NIL+  E    +KNS  ++    ALK+D++K
Subjt:  ENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKM-----GRLISENILIGLESINAIKNSKYTDIDMAALKVDLSK

Query:  AYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCSN-GPLVSHL
        AYD VEWSFL+ +M+K GF  +WV+ ++ CIS+  FSVLING P G+F+ SRGLRQGDPLSPY+F+L  E LS L++ A  LGHL  +  +   P +SHL
Subjt:  AYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCSN-GPLVSHL

Query:  LFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGW
         FADDS+IF +A   E+  ++ +L TYE+ASG+ +N+ K+ + FSK V  DR+  L++ L VN V    KYLG+P+V+ R K      + +K+WK +QGW
Subjt:  LFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGW

Query:  KTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDS
        K      AG++ LIK++ Q+IPTY MSVFK+P S CDE+    A FWW   N +RK+HW  W+KLC PK+ GGL FRD + FN AL+ KQ W +   PDS
Subjt:  KTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDS

Query:  LLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISPSG-----------------DWNLERLKM
        L++R  K+ YF+    L A LG+ P+Y WR +   R ++ +GLR RVG+G S+  +             I+P G                  WN+ ++K 
Subjt:  LLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISPSG-----------------DWNLERLKM

Query:  AVSRDDLETIRRVPIN-GILEDKIVWHYDGTENYMVKSGYK-LFRNIKIDGISSGSSSMHQEIWRRIFNRVFL
             + E I  +P++  +L+D I W  +   +Y V+S Y+ +F + + D  SSGSSS+  + W +++  V L
Subjt:  AVSRDDLETIRRVPIN-GILEDKIVWHYDGTENYMVKSGYK-LFRNIKIDGISSGSSSMHQEIWRRIFNRVFL

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]7.9e-11734.72Show/hide
Query:  GEIFTWHGNRRGIQ-VWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIARNGDRSGYPSSLT
        G  FTW   R G   + E+LDRF CN +    +     SNL  + SDH PI + +    + K G R   +        +   A+   +     G      
Subjt:  GEIFTWHGNRRGIQ-VWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIARNGDRSGYPSSLT

Query:  SLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG------------------
          D NG W+ED  ++E +F   F N+F++S   P+ +  AL+ +  KV   MNN+L   F+K E+  A+ QM PTKAP   G                  
Subjt:  SLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMG------------------

Query:  ---------------------------------------------------------------------------RLISENILIGLESINAIKNSKYTDI
                                                                                   RLI++NI+IG E +N I+ S+ +  
Subjt:  ---------------------------------------------------------------------------RLISENILIGLESINAIKNSKYTDI

Query:  DMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLS
         + A+K+D+SKAYD VEWSF+K  M KLGF  R V+ +M CIS+A FSVLING  KG     RGLRQG P SPY+F++  +  S L+  A        L 
Subjt:  DMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLS

Query:  CSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDR-QRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLM
         S    V+H LFADDSL+F RA   E   LK +   Y  ASG+  NY KS++ FS   + +    +S+I  +  V+   KYLG+PS++ R K      + 
Subjt:  CSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDR-QRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLM

Query:  DKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQ
         +V   +  W++  FS  GKE LIK++ QA+P Y MSVFK P+++C ++ K+ ARFWWGS   K+ +HW  W +LC  K+ GGL FRD+  FNQALIAKQ
Subjt:  DKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQ

Query:  VWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISPSGDWNLERLKMAVSRDDLETIRRVPINGILE
         W I  +PDSL+++  K+ YF  +  + A LGSNP+++WRS+LWGR+LL +GL   +     VA  I     WN ER+  + S +    I R+P+  I +
Subjt:  VWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISPSGDWNLERLKMAVSRDDLETIRRVPINGILE

Query:  -DKIVWHYDGTENYMVKSGYKLFRNIKI---DGISSGSSSMHQEIW
         D ++W +D    Y  KSGY++    K       S  S S    +W
Subjt:  -DKIVWHYDGTENYMVKSGYKLFRNIKI---DGISSGSSSMHQEIW

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]3.4e-12034.81Show/hide
Query:  WERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPI--EVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIAR--------------------------
        W  L R    +E  ++F     SNLD + SDH P+  EV+  N      GR      +++ W++Y+EC  II                            
Subjt:  WERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPI--EVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIAR--------------------------

Query:  ------NGDRSGYPSSLTSL-------------------------DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLT
               G+       L  L                         D++G W+E    IE  F S F NLF+++      I+ A++G+  +V + MN  L 
Subjt:  ------NGDRSGYPSSLTSL-------------------------DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLT

Query:  APFSKSEVEYAVKQMFPTK--------------APTKMG----RLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDH
         PFS  EV  A+  M   K              +PT+      RLI++NI++G E ++ I++ K     + ALK+D+SKAYD +EWSFL++IM +LGF H
Subjt:  APFSKSEVEYAVKQMFPTK--------------APTKMG----RLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDH

Query:  RWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKN
        +W+  IM+CISS  FSV+ING  KG     RGLRQG P+SPY+F+   E  S L+  A +   +  LS      +SHLLFADDSLIFCRA   +   LK 
Subjt:  RWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKN

Query:  LLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIP
        +L  Y LASG+  N+ KS++  S  V   +   + +I  +N V+ +  YLG+P+++ R +S     +  KV   +  W+   FS  GKE LIK+  QAIP
Subjt:  LLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIP

Query:  TYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLG
         + MSVFK P  IC++I +    FWWGS+  +R +HW KWEK+   K  GG+ FRD   FNQAL+AKQ W I   PDSL++R  ++ YF SS  L+A +G
Subjt:  TYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLG

Query:  SNPTYLWRSLLWGRNLLIKGLRNRVGNGLS----------------------------VAAFISPSGDWNLERLKMAVSRDDLETIRRVPI-NGILEDKI
        SNP+Y+WRS+LWGR ++  G R R+GNG                              V+  I+    W+ E +     + D + I ++P+   + ED++
Subjt:  SNPTYLWRSLLWGRNLLIKGLRNRVGNGLS----------------------------VAAFISPSGDWNLERLKMAVSRDDLETIRRVPI-NGILEDKI

Query:  VWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFN
        +WH+  +  Y VKSGY+    I+   + S S S   E W  I++
Subjt:  VWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFN

TrEMBL top hitse value%identityAlignment
A0A2N9GW67 Uncharacterized protein1.3e-12035.68Show/hide
Query:  DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTK--------MGRLISENILIGLE
        DS G W  + + I N+ +  F NLF+SS+  P  I + +D +   V   MN+ L+  F+  E+ +A+ QM P+KAP           GRLIS+N+++  E
Subjt:  DSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTK--------MGRLISENILIGLE

Query:  SINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLI
         ++ +KN         A K+D+SKAYD VEW+FL+ I+LKLGF  RWV  +M C+SS  F+V++NG P G    SRGLRQGDPLSPY+FLL  EGLS LI
Subjt:  SINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLI

Query:  SMANRLGHLTRLS-CSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRF-LSSILGVNTVTDFGKYLGVPS
          A R   +  ++ C  GP +SHL FADDS+IFCRA + E   L+ +LK YE ASG+ IN  K+A  FSK      R  + S+ G + V+ F KYLG+P 
Subjt:  SMANRLGHLTRLS-CSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRF-LSSILGVNTVTDFGKYLGVPS

Query:  VLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNF
        +L R K +    + D++WK +QGWK +L S AG+E LIK++ QA+P Y M  FK P  +C+EI     +FWWG    +RK+HW   +KL  PK  GG+ F
Subjt:  VLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNF

Query:  RDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISPSG--
        RD++ FN+AL+A+Q W +   P SL+ RF KS YF  +  L A L  N +Y+WRS+   R +L  GLR RVGNG  +  +             ISP+   
Subjt:  RDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAF-------------ISPSG--

Query:  ---------------DWNLERLKMAVSRDDLETIRRVPIN-GILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQ---EIWRRIFN-----
                        WNL++L+  +   D+E I+++P++     DK++W    + N+ VKS YKL  N     ++SGSSS       +W RI++     
Subjt:  ---------------DWNLERLKMAVSRDDLETIRRVPIN-GILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQ---EIWRRIFN-----

Query:  --RVFL---------------DEEFNGSFVDRWLKIDSNSSL-------------------------------------------AEMELVATTCWSIWN
          R+F+               D+    S    W + +  SS                                            A+ E++ TT W IWN
Subjt:  --RVFL---------------DEEFNGSFVDRWLKIDSNSSL-------------------------------------------AEMELVATTCWSIWN

Query:  DRNRLIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTN
         RNR    +    VN   Q  +     + +A L+N++            +    ++W PP +  +K N
Subjt:  DRNRLIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTN

A0A2N9J3G2 Fe2OG dioxygenase domain-containing protein6.1e-12331.74Show/hide
Query:  FTWHGNRRG-IQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIAR---------------
        FTW  NRRG    W RLDRF   +E    F+S  V +++   SDH+PI +        +   R K F+FE+ W  + +C  ++ +               
Subjt:  FTWHGNRRG-IQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIAR---------------

Query:  ------------------------------------------NGDRSGYPSSLTSLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQP
                                                     R    S  T +  +GE       I   F   ++ LF+++ LE   +   LDG+QP
Subjt:  ------------------------------------------NGDRSGYPSSLTSLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQP

Query:  KVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMGR----LISENILIGLESINAIKNSKYT-DIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRW
         V Q MN  L + F++ EV  A+KQM P KAP   G       S   ++G +  +A+    ++  +   ALK+D+SKAYD VEW FLK++M+++GF  RW
Subjt:  KVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMGR----LISENILIGLESINAIKNSKYT-DIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRW

Query:  VEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLS-CSNGPLVSHLLFADDSLIFCRAKELELVNLKNL
        +  IM+CIS+  +S+LING+P G  + +RGLRQGDP+SPY+FLL  EGL+ LI  A+  G +  +S C  GP +++L FADDSL+FCRA   E   ++ +
Subjt:  VEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLS-CSNGPLVSHLLFADDSLIFCRAKELELVNLKNL

Query:  LKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPT
        L  YE ASG+ +N +K+ + FSK      Q  L  ILGV ++  + KYLG+PS++ + K      + ++VW  V+GWK  L S AG+E LIK++ QAIPT
Subjt:  LKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPT

Query:  YVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGS
        Y M+ FK P ++C EI     RFWWG N   RK+HW KWEKLC PK  GGL FRD++ FN AL+AKQ W +    +SLL + F + +F    I+ A   +
Subjt:  YVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGS

Query:  NPTYLWRSLLWGRNLLIKGLRNRVGNGLSV-------------AAFISPSGD-----------------WNLERLKMAVSRDDLETIRRVPIN-GILEDK
          ++ WRS+L  + L+  G+  RVG+G+ +                +SP  D                 WN+ +++      D E I ++P++  + EDK
Subjt:  NPTYLWRSLLWGRNLLIKGLRNRVGNGLSV-------------AAFISPSGD-----------------WNLERLKMAVSRDDLETIRRVPIN-GILEDK

Query:  IVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQE---IWRRI---------------------------FNRVFL---------------------
        I W       Y V+SGYKL   +K   +    SS H +   +W+RI                           F R  L                     
Subjt:  IVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQE---IWRRI---------------------------FNRVFL---------------------

Query:  ----DEEFNG-------------SFVDRWLKIDSNSSLAEMELVATTCWSIWNDRNRLIHGDQIPDVNFKCQW--ISNYLEKYLQANLKNKSSINLDPQK
             + +NG             SF D    +    S   +E +A TCW IWN RN   H    P   +   W    + L +YL  N + K+     PQ 
Subjt:  ----DEEFNG-------------SFVDRWLKIDSNSSLAEMELVATTCWSIWNDRNRLIHGDQIPDVNFKCQW--ISNYLEKYLQANLKNKSSINLDPQK

Query:  VRSQIPDRRNKWMPPPENCWKTNVDDEGMYETHVEGIRI
                  +W  P  + +K N D     +++  GI +
Subjt:  VRSQIPDRRNKWMPPPENCWKTNVDDEGMYETHVEGIRI

A0A7N2LPF9 Uncharacterized protein7.9e-12333.64Show/hide
Query:  GEIFTWHGNRR-GIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKS-FKFEEYWTNYEECANIIA------------
        G  FTW   RR G Q+ ERLDR    S+   LF +  + +     SDH P+ +++ +   KK  ++FK  F+FE  W   E C +I+             
Subjt:  GEIFTWHGNRR-GIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKS-FKFEEYWTNYEECANIIA------------

Query:  --------------RNGDR----------SGYPSSLTS--LDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFS
                        GDR          S +  +L     D++  W  D   +E +FI  + +LF+SS   PS  A+ ++ +QPKV Q MN  L   F 
Subjt:  --------------RNGDR----------SGYPSSLTS--LDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFS

Query:  KSEVEYAVKQMFPTKAPTKMGRLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGD
         SEV  A+KQ+  +++    GRLI +N+L+  E+++ I   K   +   ALK+D+SKAYD VEW+ L +IM KLGF+ +W   +M+CISS  ++V ING 
Subjt:  KSEVEYAVKQMFPTKAPTKMGRLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGD

Query:  PKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAIL
        P G  + +RGLRQGDPLSPY+FLL  E LS LI  A   G L  +S S  GP +SHL FADDSLIFC+A   E   L+ +L TYE ASG+ +N SK+++ 
Subjt:  PKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAIL

Query:  FS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSF
        FS     + Q  +    G   +    KYLG+PS++ R+K      + +K+ K + GWK  L S AGKE LIK++ QAIPTY MS FK   S+CDE+    
Subjt:  FS-KVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSF

Query:  ARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGL
          FWWG    +RK+ W  WEKLC PK+ GG+ F+ ++ FN A++ KQ W + +  +SL+ R FKS YF + + + A L  NP++ WRS++  ++++ KG 
Subjt:  ARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGL

Query:  RNRVGNGLSVAAF-------------ISP-----------------SGDWNLERLKMAVSRDDLETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF
        R +VGNG S+  +             +SP                  G W ++ +       + + IR + +   + EDK VW       + V+S YKL 
Subjt:  RNRVGNGLSVAAF-------------ISP-----------------SGDWNLERLKMAVSRDDLETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF

Query:  RNIKID-GISSGSSSMH-QEIWRRIFN---------------RVFLDEEFN------------GSFVDR--WLKIDSNSSLAEMELVATTCWSIWNDRNR
          ++ D  + SGS   H +  WR I++               R  L  + N            GSF+D   ++ + +    +++E +    W++W++RN 
Subjt:  RNIKID-GISSGSSSMH-QEIWRRIFN---------------RVFLDEEFN------------GSFVDR--WLKIDSNSSLAEMELVATTCWSIWNDRNR

Query:  LIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVDDEGMYETHVEGIRI
             +       CQ +     +YL A       +    Q      P    KW PP  + +K NVD     E  + G+ I
Subjt:  LIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVDDEGMYETHVEGIRI

A0A803QAN3 Uncharacterized protein2.2e-12029.3Show/hide
Query:  TGEIFTWHGNRRGIQ-VWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPI--EVRMENCSFKKSGRRFKSF------KFEEYW-----TNYEECAN--
        TG+ FTW   R  +  + ERLD    N   +  F  +  S+LD+Y SDHR I  +V + N +  K  +  ++       + EEYW      ++ +C +  
Subjt:  TGEIFTWHGNRRGIQ-VWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPI--EVRMENCSFKKSGRRFKSF------KFEEYW-----TNYEECAN--

Query:  ---IIARNGDRSGYPSSLTSLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKM
             A    R    +  + ++  G        +  +  S + +LF++  ++P S+   L+ +   +   MN  LTAPF+  EV  A+K M P K+P   
Subjt:  ---IIARNGDRSGYPSSLTSLDSNGEWSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKM

Query:  G---------------------------------------------------------------------------------------------RLISEN
        G                                                                                             RLI++N
Subjt:  G---------------------------------------------------------------------------------------------RLISEN

Query:  ILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVE
        IL+  E I+ +++        +ALK+D+SKA+D VEW +L+ +MLK+GF  +WV  IM CI ++ FS  +NG+  G    SRGLRQGDPLSPY+FL+  E
Subjt:  ILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVE

Query:  GLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGK
        GLS L+     +G+L  L  + + P VSHLLFADDSL+FCRA       ++  L TY  ASG+ +N +KS + FS    +D + F +  L +       +
Subjt:  GLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFS-KVNNDRQRFLSSILGVNTVTDFGK

Query:  YLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKS
        YLG+PS   R K +   ++ ++VWK +  W   +FSI GKE L+K++ Q+IPTY MS FK  K  C ++    A FWWG+N N  K+HW +W  LC  K 
Subjt:  YLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKS

Query:  LGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVA----------------
         GG+ FR    FNQAL+AKQ W IF  P+SLLSR  K  YFS++    A LG +P+Y W+S+ WGR+LL++G+R ++G G+++A                
Subjt:  LGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVA----------------

Query:  -----------AFISPSGDWNLERLKMAVSRDDLETIRRVPINGI-LEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFN-----
                    FI+   +WN++ L       D++ I  +P++    +D+++WH+  +  Y VKSG+ L  +++   ISS +S  H++ WR  +N     
Subjt:  -----------AFISPSGDWNLERLKMAVSRDDLETIRRVPINGI-LEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFN-----

Query:  --RVF-------------------------------------------------------------LDEEFNGSFVDRWLKIDSNSSLAEMELVATTCWS
          R+F                                                                 FNG ++   L + S  +  + EL+    W 
Subjt:  --RVF-------------------------------------------------------------LDEEFNGSFVDRWLKIDSNSSLAEMELVATTCWS

Query:  IWNDRNRLIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNK---WMPPPENCWKTNVD
        IW+DRNRL HG Q    +    +++N+ + +++AN       + +  +  +   +   K   W PP  N +K NVD
Subjt:  IWNDRNRLIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNK---WMPPPENCWKTNVD

A0A803QJV0 Uncharacterized protein4.3e-12130.58Show/hide
Query:  FTWHGNRRGIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFK---SFKFEEYWTNYEECANIIA---RNGDRSGYPSS
        FTW       QV ERLDR  CN E    F    +  LDW+ SDHR + V +      + G R K    F FEE W + +EC  II    R+ D SG  +S
Subjt:  FTWHGNRRGIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFK---SFKFEEYWTNYEECANIIA---RNGDRSGYPSS

Query:  LTSLDSNGEWSEDDSIIENMFISDFENLFS--SSHLEPSSIAK-ALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTK---------------
                         + + +     L++    +   SSI K  L  +QPKV   MN  L  PF + EV  A+K+M PTKAP +               
Subjt:  LTSLDSNGEWSEDDSIIENMFISDFENLFS--SSHLEPSSIAK-ALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTK---------------

Query:  ------------------------------------------------------------------------------MGRLISENILIGLESINAIKNS
                                                                                       GRLI +N ++G ES++ ++ +
Subjt:  ------------------------------------------------------------------------------MGRLISENILIGLESINAIKNS

Query:  KYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGH
        ++ +    AL++D++KAYD VEW FL+E+ML+LG+  RWV  IM C++S  FS LING+ +GK    RG+RQGDPL P++FL   E  S L+        
Subjt:  KYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGH

Query:  LTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSK
        L  +     G  VSHL FADDS+IF  A          LL  Y  ASG+ +N+ KS + F K V    +  L+  LGV  V + GKYLG+ S++ R+K +
Subjt:  LTRLSCS-NGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSK-VNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSK

Query:  DLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQ
            + ++VW S++GWK  +FS+ G E LIK+I QAIP Y MS ++  KS    I +  ARFWWGS   K+K+HWCKW+ LC PK  GGL FRD+E FNQ
Subjt:  DLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQ

Query:  ALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNG----------------------------LSVAAF
        AL+AKQ+W    +P SL ++  K+ YF +  +L+A  G++ +++WRSL+WG+ +++KG R RVGNG                            L V   
Subjt:  ALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNG----------------------------LSVAAF

Query:  ISPSGDWNLERLKMAVSRDDLETIRRV-PINGILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFNRVFLD--EEFNGSFVDRW
          PSG W+   ++   + +D E I R+ P++  LEDK++WHY     Y V+SGY++   ++    +S    M ++ W +++        + F     + W
Subjt:  ISPSGDWNLERLKMAVSRDDLETIRRV-PINGILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFNRVFLD--EEFNGSFVDRW

Query:  LKIDSNSSLAEM----------------------------------------------------------------ELVATTCWSIWNDRNRLIHGDQIP
        L   SN  + ++                                                                E     CW +W  RN   HG ++P
Subjt:  LKIDSNSSLAEM----------------------------------------------------------------ELVATTCWSIWNDRNRLIHGDQIP

Query:  DVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVD
               W + Y+ +Y Q  L+         Q+ R Q    +++W+PP +   K NVD
Subjt:  DVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.9e-1925.38Show/hide
Query:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH
        +SIN I++ ++  D +   + +D  KA+D ++  F+ + + KLG D  +++ I         ++++NG     F    G RQG PLSP +F +V+E L+ 
Subjt:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH

Query:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKS-AILFSKVNNDRQRFLSSILGVNTVTDFGKYLGVP
         I     +  +          V   LFADD +++     +   NL  L+  +   SG  IN  KS A L+   NN+RQ   S I+G    T   K +   
Subjt:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKS-AILFSKVNNDRQRFLSSILGVNTVTDFGKYLGVP

Query:  SVLSRHKSKDL---GY--LMDKVWKSVQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLP
         +      KDL    Y  L+ ++ +    WK    S  G+  ++K   + + I  +     K P +   E+ K+  +F W    N+++    K   L   
Subjt:  SVLSRHKSKDL---GY--LMDKVWKSVQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLP

Query:  KSLGGLNFRDIEGFNQALIAKQVWCIFSRPD
           GG+   D + + +A + K  W  +   D
Subjt:  KSLGGLNFRDIEGFNQALIAKQVWCIFSRPD

P08548 LINE-1 reverse transcriptase homolog1.7e-1824.92Show/hide
Query:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH
        +SIN I++ +K  + D   L +D  KA+D ++  F+   + K+G +  +++ I    S    ++++NG     F    G RQG PLSP +F +V+E    
Subjt:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH

Query:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRFLSSILGVNTVTDFGKYLGV--
        ++++A R     +        +   LFADD +++          L  ++K Y   SG  IN  KS       NN  ++ +   +    V    KYLGV  
Subjt:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRFLSSILGVNTVTDFGKYLGV--

Query:  -PSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSL
           V   +K ++   L  ++ + V  WK    S  G+  ++K   + +AI  +     K P S   ++ K    F W    N++K    K   L      
Subjt:  -PSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSL

Query:  GGLNFRDIEGFNQALIAKQVW
        GG+   D+  + ++++ K  W
Subjt:  GGLNFRDIEGFNQALIAKQVW

P0C2F6 Putative ribonuclease H protein At1g657506.6e-1830.98Show/hide
Query:  VLSRHKSKD-LGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLN
        VL +  +KD  G ++++V   + GW+    S AG+ TL K++  ++P + MS    P+SI + + +    F WGS   K+K H  KW K+C PK  GGL 
Subjt:  VLSRHKSKD-LGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLN

Query:  FRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYF-----SSSKILSAGLGSNPTYLWRSLLWG-RNLLIKGLRNRVGNGLSV
         R  +  N+ALI+K  W +    +SL +   +  Y       S  ++  G  S+    WRS+  G R+++  G+    G+G  +
Subjt:  FRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYF-----SSSKILSAGLGSNPTYLWRSLLWG-RNLLIKGLRNRVGNGLSV

P11369 LINE-1 retrotransposable element ORF2 protein4.3e-1724.23Show/hide
Query:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH
        +SIN I   +K  D +   + +D  KA+D ++  F+ +++ + G    ++  I    S    ++ +NG+         G RQG PLSPY+F +V+E L+ 
Subjt:  ESINAIKN-SKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSH

Query:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRFLSSILGVNTVTDFGKYLGVPS
         I     +  +          V   L ADD +++    +     L NL+ ++    G  IN +KS       N   ++ +      + VT+  KYLGV  
Subjt:  LISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFCRAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRFLSSILGVNTVTDFGKYLGVPS

Query:  VLSRHKSKDLGYLMDKVWKS--------VQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLC
              +K++  L DK +KS        ++ WK    S  G+  ++K   + +AI  +     K P    +E+  +  +F W  NN K ++       L 
Subjt:  VLSRHKSKDLGYLMDKVWKS--------VQGWKTSLFSIAGKETLIKS--IGQAIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLC

Query:  LPKSLGGLNFRDIEGFNQALIAKQVW
          ++ GG+   D++ + +A++ K  W
Subjt:  LPKSLGGLNFRDIEGFNQALIAKQVW

P93295 Uncharacterized mitochondrial protein AtMg003104.4e-3043.17Show/hide
Query:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPK-SLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILS
        A+P Y MS F+  K +C ++  +   FWW S  NKRK+ W  W+KLC  K   GGL FRD+  FNQAL+AKQ + I  +P +LLSR  +S YF  S ++ 
Subjt:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPK-SLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILS

Query:  AGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFI
          +G+ P+Y WRS++ GR LL +GL   +G+G+    ++
Subjt:  AGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.2e-0726.54Show/hide
Query:  KSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISPSGD------------------------------WNLERLKMAVSRDD
        K+ YF    IL A +    +Y W SLL G  LL KG R+ +G+G ++   +    D                              W+  ++   V + D
Subjt:  KSIYFSSSKILSAGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFISPSGD------------------------------WNLERLKMAVSRDD

Query:  LETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF-----RNIKIDGISSGSSSMHQEIW
           I R+ +      DKI+W+Y+ T  Y V+SGY L       NI       GS  +   IW
Subjt:  LETIRRVPI-NGILEDKIVWHYDGTENYMVKSGYKLF-----RNIKIDGISSGSSSMHQEIW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.5e-0429.73Show/hide
Query:  PTKAPTKMGRLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMK
        P +A    GR+ ++NI+   E++++++  K        LK+DL KAYD + W +L++ ++  GF   W+  I +
Subjt:  PTKAPTKMGRLISENILIGLESINAIKNSKYTDIDMAALKVDLSKAYDWVEWSFLKEIMLKLGFDHRWVEFIMK

AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-3432.43Show/hide
Query:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSA
        A+PTY M+ F  PK++C +I    A FWW +    + MHW  W+ L   K+ GG+ F+DIE FN AL+ KQ+W + SRP+SL+++ FKS YF  S  L+A
Subjt:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSA

Query:  GLGSNPTYLWRSLLWGRNLLIKGLRNRVGNG------------------------------------LSVAAFISPSG-DWNLERLKMAVSRDDLETIRR
         LGS P+++W+S+   + +L +G R  VGNG                                    L V+  I  SG +W  + ++M     + + I  
Subjt:  GLGSNPTYLWRSLLWGRNLLIKGLRNRVGNG------------------------------------LSVAAFISPSG-DWNLERLKMAVSRDDLETIRR

Query:  V-PINGILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISS-------GSSSMHQEIWR
        + P    + D   W Y  + +Y VKSGY +   I I+  SS         + ++Q+IW+
Subjt:  V-PINGILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISS-------GSSSMHQEIWR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-3143.17Show/hide
Query:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPK-SLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILS
        A+P Y MS F+  K +C ++  +   FWW S  NKRK+ W  W+KLC  K   GGL FRD+  FNQAL+AKQ + I  +P +LLSR  +S YF  S ++ 
Subjt:  AIPTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPK-SLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILS

Query:  AGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFI
          +G+ P+Y WRS++ GR LL +GL   +G+G+    ++
Subjt:  AGLGSNPTYLWRSLLWGRNLLIKGLRNRVGNGLSVAAFI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-1254.41Show/hide
Query:  LINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDS
        +ING P+G  + SRGLRQGDPLSPY+F+L  E LS L   A   G L  +  S N P ++HLLFADD+
Subjt:  LINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCS-NGPLVSHLLFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCGACTGGAGAGATTTTCACTTGGCATGGAAACAGAAGGGGCATCCAAGTTTGGGAGAGGCTAGACCGGTTTCCGTGCAATTCTGAGTTGGATGATCTTTTTAA
TTCAATGGGAGTGTCAAATCTTGACTGGTATCACTCAGATCACAGACCTATAGAAGTTAGAATGGAGAACTGCTCTTTCAAAAAATCTGGAAGAAGGTTCAAATCCTTTA
AATTTGAGGAATATTGGACTAACTATGAGGAATGTGCCAATATCATTGCCAGGAACGGAGACCGGTCAGGTTATCCTTCTTCCCTTACTTCTTTAGATTCTAATGGGGAG
TGGTCGGAGGATGATTCCATTATAGAAAACATGTTCATCTCCGACTTTGAAAACCTATTTTCCTCATCTCATCTTGAACCGAGTTCCATAGCCAAAGCCTTAGATGGTTT
GCAACCAAAAGTAGATCAGATTATGAATAATAGACTTACAGCCCCCTTTTCTAAAAGTGAGGTAGAATATGCAGTAAAACAGATGTTTCCCACAAAAGCTCCAACCAAGA
TGGGTAGGCTTATCTCGGAAAATATTTTAATTGGTCTTGAAAGTATAAATGCAATCAAGAATAGTAAATATACTGATATAGATATGGCTGCCTTAAAAGTTGACCTTAGC
AAAGCTTATGACTGGGTAGAATGGTCTTTTCTTAAGGAAATTATGCTTAAATTGGGCTTTGACCATAGATGGGTTGAGTTTATTATGAAATGCATATCTTCGGCTGAGTT
TTCTGTTTTAATTAATGGTGATCCAAAAGGTAAATTTTCTTCTAGTAGAGGTCTCCGCCAAGGGGATCCACTCTCCCCATACATGTTCCTTTTAGTGGTAGAAGGATTAT
CACATTTAATATCCATGGCGAATAGATTAGGGCATCTTACAAGGTTATCATGCTCGAATGGTCCTTTAGTTTCTCATCTTTTATTTGCTGATGATAGTCTCATCTTTTGT
AGGGCTAAGGAGTTAGAACTGGTAAACCTTAAAAACCTACTGAAAACCTATGAATTAGCCTCAGGAGAATGTATCAACTACTCCAAATCTGCCATTCTTTTTTCCAAAGT
CAATAATGATAGACAAAGGTTCCTAAGTAGTATTCTGGGAGTAAATACAGTAACTGATTTTGGCAAATATCTAGGAGTACCATCGGTTCTATCTAGGCATAAATCAAAGG
ACCTTGGATACCTTATGGACAAGGTTTGGAAATCAGTTCAAGGCTGGAAAACTTCTTTATTTTCTATTGCAGGGAAAGAAACCTTAATTAAAAGTATAGGTCAAGCAATT
CCAACCTATGTCATGAGTGTTTTTAAATACCCTAAGTCTATTTGTGATGAGATAGCCAAAAGTTTTGCTAGATTTTGGTGGGGTTCAAATAATAACAAGAGGAAAATGCA
TTGGTGCAAATGGGAGAAGCTTTGTTTGCCTAAAAGTTTAGGGGGTCTAAATTTTAGGGATATTGAAGGTTTTAACCAAGCCTTAATTGCAAAACAAGTGTGGTGTATTT
TCTCAAGGCCTGATTCCCTATTATCTAGATTTTTTAAGAGTATTTACTTTAGTTCCTCGAAAATTCTATCTGCTGGTTTGGGTAGTAATCCAACATATCTCTGGAGAAGT
CTGTTGTGGGGAAGAAACCTGTTAATAAAAGGGTTACGGAATAGAGTAGGGAATGGGCTTTCGGTGGCCGCTTTCATATCTCCTTCTGGGGATTGGAATTTAGAGAGGCT
TAAAATGGCAGTGTCTAGGGATGATCTAGAAACTATTAGAAGAGTTCCAATCAATGGTATATTAGAAGATAAAATAGTGTGGCACTATGATGGAACCGAAAATTACATGG
TCAAAAGTGGCTACAAACTTTTTAGGAACATAAAAATTGATGGAATTTCTTCTGGTTCATCTTCTATGCATCAGGAAATCTGGAGAAGGATTTTCAACCGGGTGTTCCTG
GATGAAGAGTTTAATGGGAGCTTCGTGGATCGCTGGCTGAAAATTGACTCAAATTCGTCTTTGGCAGAGATGGAGTTGGTAGCCACAACTTGCTGGTCAATCTGGAACGA
TAGAAACAGATTGATTCACGGTGATCAAATTCCTGATGTAAATTTCAAATGTCAGTGGATATCCAACTATTTGGAGAAATACTTACAAGCCAATTTGAAGAACAAGTCGA
GCATTAACTTGGATCCCCAAAAAGTACGTTCTCAAATTCCAGATAGAAGAAATAAATGGATGCCTCCCCCTGAGAATTGTTGGAAAACTAATGTTGATGATGAAGGAATG
TACGAGACACATGTGGAAGGAATTCGGATCGAAGCACTCAAAATTCTTTTAATTTTTTTTTTTTTGATAGGAAACGAAAAGAGATATGTATTAATAACCCACAAAAGAGA
ACAACCTAGGGCCGAGGGAGGAGATCGCCCTCGCCCAAGAACTAATACAAAAGAGCTTGTCAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACCGACTGGAGAGATTTTCACTTGGCATGGAAACAGAAGGGGCATCCAAGTTTGGGAGAGGCTAGACCGGTTTCCGTGCAATTCTGAGTTGGATGATCTTTTTAA
TTCAATGGGAGTGTCAAATCTTGACTGGTATCACTCAGATCACAGACCTATAGAAGTTAGAATGGAGAACTGCTCTTTCAAAAAATCTGGAAGAAGGTTCAAATCCTTTA
AATTTGAGGAATATTGGACTAACTATGAGGAATGTGCCAATATCATTGCCAGGAACGGAGACCGGTCAGGTTATCCTTCTTCCCTTACTTCTTTAGATTCTAATGGGGAG
TGGTCGGAGGATGATTCCATTATAGAAAACATGTTCATCTCCGACTTTGAAAACCTATTTTCCTCATCTCATCTTGAACCGAGTTCCATAGCCAAAGCCTTAGATGGTTT
GCAACCAAAAGTAGATCAGATTATGAATAATAGACTTACAGCCCCCTTTTCTAAAAGTGAGGTAGAATATGCAGTAAAACAGATGTTTCCCACAAAAGCTCCAACCAAGA
TGGGTAGGCTTATCTCGGAAAATATTTTAATTGGTCTTGAAAGTATAAATGCAATCAAGAATAGTAAATATACTGATATAGATATGGCTGCCTTAAAAGTTGACCTTAGC
AAAGCTTATGACTGGGTAGAATGGTCTTTTCTTAAGGAAATTATGCTTAAATTGGGCTTTGACCATAGATGGGTTGAGTTTATTATGAAATGCATATCTTCGGCTGAGTT
TTCTGTTTTAATTAATGGTGATCCAAAAGGTAAATTTTCTTCTAGTAGAGGTCTCCGCCAAGGGGATCCACTCTCCCCATACATGTTCCTTTTAGTGGTAGAAGGATTAT
CACATTTAATATCCATGGCGAATAGATTAGGGCATCTTACAAGGTTATCATGCTCGAATGGTCCTTTAGTTTCTCATCTTTTATTTGCTGATGATAGTCTCATCTTTTGT
AGGGCTAAGGAGTTAGAACTGGTAAACCTTAAAAACCTACTGAAAACCTATGAATTAGCCTCAGGAGAATGTATCAACTACTCCAAATCTGCCATTCTTTTTTCCAAAGT
CAATAATGATAGACAAAGGTTCCTAAGTAGTATTCTGGGAGTAAATACAGTAACTGATTTTGGCAAATATCTAGGAGTACCATCGGTTCTATCTAGGCATAAATCAAAGG
ACCTTGGATACCTTATGGACAAGGTTTGGAAATCAGTTCAAGGCTGGAAAACTTCTTTATTTTCTATTGCAGGGAAAGAAACCTTAATTAAAAGTATAGGTCAAGCAATT
CCAACCTATGTCATGAGTGTTTTTAAATACCCTAAGTCTATTTGTGATGAGATAGCCAAAAGTTTTGCTAGATTTTGGTGGGGTTCAAATAATAACAAGAGGAAAATGCA
TTGGTGCAAATGGGAGAAGCTTTGTTTGCCTAAAAGTTTAGGGGGTCTAAATTTTAGGGATATTGAAGGTTTTAACCAAGCCTTAATTGCAAAACAAGTGTGGTGTATTT
TCTCAAGGCCTGATTCCCTATTATCTAGATTTTTTAAGAGTATTTACTTTAGTTCCTCGAAAATTCTATCTGCTGGTTTGGGTAGTAATCCAACATATCTCTGGAGAAGT
CTGTTGTGGGGAAGAAACCTGTTAATAAAAGGGTTACGGAATAGAGTAGGGAATGGGCTTTCGGTGGCCGCTTTCATATCTCCTTCTGGGGATTGGAATTTAGAGAGGCT
TAAAATGGCAGTGTCTAGGGATGATCTAGAAACTATTAGAAGAGTTCCAATCAATGGTATATTAGAAGATAAAATAGTGTGGCACTATGATGGAACCGAAAATTACATGG
TCAAAAGTGGCTACAAACTTTTTAGGAACATAAAAATTGATGGAATTTCTTCTGGTTCATCTTCTATGCATCAGGAAATCTGGAGAAGGATTTTCAACCGGGTGTTCCTG
GATGAAGAGTTTAATGGGAGCTTCGTGGATCGCTGGCTGAAAATTGACTCAAATTCGTCTTTGGCAGAGATGGAGTTGGTAGCCACAACTTGCTGGTCAATCTGGAACGA
TAGAAACAGATTGATTCACGGTGATCAAATTCCTGATGTAAATTTCAAATGTCAGTGGATATCCAACTATTTGGAGAAATACTTACAAGCCAATTTGAAGAACAAGTCGA
GCATTAACTTGGATCCCCAAAAAGTACGTTCTCAAATTCCAGATAGAAGAAATAAATGGATGCCTCCCCCTGAGAATTGTTGGAAAACTAATGTTGATGATGAAGGAATG
TACGAGACACATGTGGAAGGAATTCGGATCGAAGCACTCAAAATTCTTTTAATTTTTTTTTTTTTGATAGGAAACGAAAAGAGATATGTATTAATAACCCACAAAAGAGA
ACAACCTAGGGCCGAGGGAGGAGATCGCCCTCGCCCAAGAACTAATACAAAAGAGCTTGTCAATCTTTAA
Protein sequenceShow/hide protein sequence
MKPTGEIFTWHGNRRGIQVWERLDRFPCNSELDDLFNSMGVSNLDWYHSDHRPIEVRMENCSFKKSGRRFKSFKFEEYWTNYEECANIIARNGDRSGYPSSLTSLDSNGE
WSEDDSIIENMFISDFENLFSSSHLEPSSIAKALDGLQPKVDQIMNNRLTAPFSKSEVEYAVKQMFPTKAPTKMGRLISENILIGLESINAIKNSKYTDIDMAALKVDLS
KAYDWVEWSFLKEIMLKLGFDHRWVEFIMKCISSAEFSVLINGDPKGKFSSSRGLRQGDPLSPYMFLLVVEGLSHLISMANRLGHLTRLSCSNGPLVSHLLFADDSLIFC
RAKELELVNLKNLLKTYELASGECINYSKSAILFSKVNNDRQRFLSSILGVNTVTDFGKYLGVPSVLSRHKSKDLGYLMDKVWKSVQGWKTSLFSIAGKETLIKSIGQAI
PTYVMSVFKYPKSICDEIAKSFARFWWGSNNNKRKMHWCKWEKLCLPKSLGGLNFRDIEGFNQALIAKQVWCIFSRPDSLLSRFFKSIYFSSSKILSAGLGSNPTYLWRS
LLWGRNLLIKGLRNRVGNGLSVAAFISPSGDWNLERLKMAVSRDDLETIRRVPINGILEDKIVWHYDGTENYMVKSGYKLFRNIKIDGISSGSSSMHQEIWRRIFNRVFL
DEEFNGSFVDRWLKIDSNSSLAEMELVATTCWSIWNDRNRLIHGDQIPDVNFKCQWISNYLEKYLQANLKNKSSINLDPQKVRSQIPDRRNKWMPPPENCWKTNVDDEGM
YETHVEGIRIEALKILLIFFFLIGNEKRYVLITHKREQPRAEGGDRPRPRTNTKELVNL