; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:32386085..32391598
RNA-Seq ExpressionLag0008919
SyntenyLag0008919
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]1.1e-17932.92Show/hide
Query:  WKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWI
        W  M  N D  +IG++L+ + I+  A++  +   W       V+  G   F F F+ + DR   +  GPW FDK LIVL+   A +    +     +F++
Subjt:  WKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWI

Query:  RLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEIS-WGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKEC--ECK
         ++ LP   RNR +   IG+ IG   +V    +++  +G  +R+R  +++  PL R   L+  E      V ++YERLP FC+ CG + HI+  C  +  
Subjt:  RLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEIS-WGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKEC--ECK

Query:  RGEEEKNVDNMEFGNWLRFQSYFRPNKKT-------ESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPDEDIVESRDKRNEGTSSGTY
           EE+N DN  +G WL+  +   P+K T        S     SP      E     G  +AS + +  GKE  +  H   + +        +   +G +
Subjt:  RGEEEKNVDNMEFGNWLRFQSYFRPNKKT-------ESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPDEDIVESRDKRNEGTSSGTY

Query:  VLNTI-------------------ASMD------TDLLVIPKSDKDLILNQTDENQ---VDGG-----------EQSIRSSTILKDYSVD----------
         +N I                    SM       T++ + P + +  + N    N    +DG             Q    + I    S+           
Subjt:  VLNTI-------------------ASMD------TDLLVIPKSDKDLILNQTDENQ---VDGG-----------EQSIRSSTILKDYSVD----------

Query:  ---NTNEGNMEEEET---HSGNSKATKRRASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNL
             ++  + ++ T     G+SK + +R   K K++  V   +   E    K++ V  ++   +KF             +  + G  GG+ LLW+ + L
Subjt:  ---NTNEGNMEEEET---HSGNSKATKRRASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNL

Query:  VSINSYSDNHIDSTV--NWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPN
        VS+N++S NHID+ +  N    TWRFTG YG P  + +H +W+L+R+L  L +  WL  GD N ++ +SEK G      K +++F   + D  L DL   
Subjt:  VSINSYSDNHIDSTV--NWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPN

Query:  GELFTWHSNRRGVHVW-ERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLT
        G  FTW +NR+  H   ERLDR   NN +  LF     ++LD L+SDH P+ +   +        + R FKFE +W   E CE++I  +  W  +    T
Subjt:  GELFTWHSNRRGVHVW-ERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLT

Query:  SLPM--DLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDA-IHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKT
        SL    +L++C + L +W R +    +++I++ K  + +  K +   +  + IH +  ELD+LL++EE+ W+QR++  W++ GDKN+K+FH KAS R++ 
Subjt:  SLPM--DLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDA-IHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKT

Query:  NEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLV
        N I G+ NSEG W +   +IE +   YFS++FTS     + +   L+ + PRVS  +N  LL  +T +E++ ++  M P K+PGPDG+P +F+Q++W +V
Subjt:  NEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLV

Query:  GKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNS
        G    +  L +LN  +     N T+IVLIPK   P+ ++++RPISL NV YKI +K + NRLK  +  IIS++QSAF+  RLI+DN++I +E ++ +K S
Subjt:  GKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNS

Query:  RHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGR
            M   A K+D+SKAYDR+EW+FL+G+M +LGF   +++LV+ C+S+ ++S +LNG   G     RGLRQGDP+SPYLFL  AE LS LI      G 
Subjt:  RHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGR

Query:  VTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSK
        + GL VC   P ISHLLFADD+IIFC A       +K +L+ YE ASG+ +N  KS+I+       +    + S L +  V++  +YLG PS   +SK +
Subjt:  VTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSK

Query:  DLAYVLDKIWKATQGWKKVFL
          A + D++ +  +GWK+ +L
Subjt:  DLAYVLDKIWKATQGWKKVFL

XP_021602686.1 uncharacterized protein LOC110607833 [Manihot esculenta]2.8e-17834Show/hide
Query:  HCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLE--LPTANQRSVEMGFKKANFWIRLINLPT
        +C++G++L+ R I+  + ++ L  AWK  K  +V+  G N+F F+F    D D VL   PW + +  I+L    P  N R VE+   K   W+++ ++P 
Subjt:  HCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLE--LPTANQRSVEMGFKKANFWIRLINLPT

Query:  GYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDN
        G+ + K  + IGN IG F+  +       W + +RIRV +D+  PL R   ++ P+     WV  +YE+L  FC+ CG +GH  + C           D 
Subjt:  GYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDN

Query:  MEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPK
          +G WLR      P    +  R  N   M+     +   G  + +E    K ++   VD  P + ++ +  KR                          
Subjt:  MEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPK

Query:  SDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDS
                + +E + D  ++ + S T+                                              + +P N++++           FK    
Subjt:  SDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDS

Query:  GAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWEGI-TWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLG
                 ++  G  GG+ L WKD  +  + SYS +HID  V  EG+ +WR TG YGFP  +++   WDL+R L S   L W + GD N+I+  SEK G
Subjt:  GAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWEGI-TWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLG

Query:  GPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEE
             R  LE FR  +DDC L DL   G  FTW + R    V ERLDR +  + + S+FN A    LD   SDH P+ ++L            + F+FE 
Subjt:  GPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEE

Query:  LWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSRE
         W+    C  ++     +EG N +++SLP  + +C+  L +W    H   +  +   K  L   ++     DF  +  +E +  K+L + E+YWKQR+++
Subjt:  LWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSRE

Query:  EWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQM
         WL  GD N+K+FH  AS R + N IT + N+EG WVD    ++ V   ++  LFTS+   P ++   L+ +  +V+ + N  L+APF+ EE+ +++  M
Subjt:  EWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQM

Query:  FPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAF
         P K  GPDG    F+QKYW +VG      CL  LNGD  +   N+TNI+LIPK + P+ V++ RPISL NV  +I+ K +ANRLK VL  +ISEAQSAF
Subjt:  FPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAF

Query:  IQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLS
        I  RLITDN+I+ +E  +S+K  +  K  + A K D+SKAYDR+EW +LK IMLKLGF   WV+ ++K ++S S+  +   +  G     RGLRQGDPLS
Subjt:  IQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLS

Query:  PYLFLLVAEGLSHLISSANEEGRVTG-LVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILG
        PYLF++ AEGLS ++S    +GR+ G  V      +SHL FADDS+ F +A+++E   LK  L  YE ASG+ IN  KSA++     +   +A +   LG
Subjt:  PYLFLLVAEGLSHLISSANEEGRVTG-LVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILG

Query:  VNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL
        V  VE+ G YLG PSV  R K     YV+DK+ K  QGWK  FL
Subjt:  VNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.3e-18329.36Show/hide
Query:  ELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKN-FNVETTGRNIFNFKFESQGDRDWVLNNGPWTFD
        +L+ EWK F L  +E+E  I ++          L+  ++GKL   R I+   +KN ++ AWK   N F V++ G N+F F F    DR+ +  +GPWTFD
Subjt:  ELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKN-FNVETTGRNIFNFKFESQGDRDWVLNNGPWTFD

Query:  KYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIR
        + L+++  P A     E+ F K   W+R  +LP G   R +  ++GN++G F E +       WG ++R+RV +DI+ PL RG  L      G  W+ I+
Subjt:  KYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIR

Query:  YERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNE-FEDELGDKVASENRSRKGKEKVVVDHSPDE
        YERLP+FC+ CG      K                ++G+WLR+Q             KP  P M +  E   D+ G+   S + S  G     V  +P  
Subjt:  YERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNE-FEDELGDKVASENRSRKGKEKVVVDHSPDE

Query:  DIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKER
                     ++G   +     M++ +   PK   +          +D GEQ                                             
Subjt:  DIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKER

Query:  AGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWEGITWRFTGLYGFPEASKKH
             RI  +E SNL           N   K   SGA  +                           SYSD+     ++  G + RFTG YG P A K+H
Subjt:  AGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWEGITWRFTGLYGFPEASKKH

Query:  LTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNR-RGVHVWERLDRFLCNNSFDSLFNFAGPK
        LTW+L+RR+ +L    WLIGGD+N I+ + E       D   +E FR+++D C+L D+   G +FTW +NR  G  +W+RLDRFLCN++F+ +F  A   
Subjt:  LTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNR-RGVHVWERLDRFLCNNSFDSLFNFAGPK

Query:  NLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAY
                        + H +  T              NY   + + AS                     S AL  WGR        +IK  K ++ +AY
Subjt:  NLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAY

Query:  KNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNI
             +DF  IH +E +L  LLE EEI+WKQRSRE+WLKWG                      I N+                               +I
Subjt:  KNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNI

Query:  NLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYR
           +N +  R++SE+N +LLAP+TKEEIEL+I+QMFPTKA GPDG+PA+FYQ YWH+VG KT++ CL  LN    ++ WN T I LIPK   P+ +S++R
Subjt:  NLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYR

Query:  PISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNL
        PISLCNV+YKII+K++ NRLK V+  +IS+AQSAF+  R I+DNVIIGHEC+++I + +   + MAA K+DLSKA+DRVEW +L+ IM K+GF+  W+  
Subjt:  PISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNL

Query:  VLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKS
        +++CIS+  FSI LNG P G F  SRG+RQGDPLSPYLFLL AEGLS LI+  N  GR+TG+    N   I+HLLFADDS+IF ++ E E + L+ LL S
Subjt:  VLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKS

Query:  YELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKS------------------------KDL-----AYVLDKIW---
        Y  ASG+CIN +KSA+L    V+ +R+ +L  IL V  V  FG YLG PS F+R +                         +DL     A V   +W   
Subjt:  YELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKS------------------------KDL-----AYVLDKIW---

Query:  ----------------------------KATQGWK-----------------------------------------------------------------
                                    K++  WK                                                                 
Subjt:  ----------------------------KATQGWK-----------------------------------------------------------------

Query:  -----------------------------------------------------------------------------KVFLLYS----------------
                                                                                     K+F+  S                
Subjt:  -----------------------------------------------------------------------------KVFLLYS----------------

Query:  -----------------------------W---------------------WMNINSTCSNEELGLVAVACWAIWSDRNKIIHEEAIPPVNKRSVWIQEY
                                     W                     W ++      ++L L A+  W IW+DRN +IH + + PV  +  W+  +
Subjt:  -----------------------------W---------------------WMNINSTCSNEELGLVAVACWAIWSDRNKIIHEEAIPPVNKRSVWIQEY

Query:  LAKYIEA-----SSRREAMHASVTRDSSPQSNRNWIPPPSDVWKFNVDAAWSDSLPETGIGVVCRGSNEKLIEASCNFLDIAMDAPGAELRAIMEGVIMA
        L  + +A     S R ++ H  V         + W P  S   K N DAA   +   T  G + R S+  L+ A+   +   +    AE+R I+EG+  A
Subjt:  LAKYIEA-----SSRREAMHASVTRDSSPQSNRNWIPPPSDVWKFNVDAAWSDSLPETGIGVVCRGSNEKLIEASCNFLDIAMDAPGAELRAIMEGVIMA

Query:  KSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADRLAKWARVRKCNNIWVEADVSPEWLLLLVENDL-SFFA
         +     + +ESD  +A+  +  +  T    +  V  I AL  CF  I F ++ R  N+ A  LAKW  +   +  +      P WLL LV+ D  S FA
Subjt:  KSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADRLAKWARVRKCNNIWVEADVSPEWLLLLVENDL-SFFA

Query:  HVA
        HVA
Subjt:  HVA

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]1.4e-17433.23Show/hide
Query:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF
        +EE++N+WK F+L+E E+ + I L+ED+     D  +  + GK ++ R +S  A+   LK  WKTR  F +   G ++  F F  + +   VL   PW +
Subjt:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF

Query:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI
        DK+LI+L     +     + F   NFW+++  LP    N +    IG SIG+  +  +  E I  G  +R+RV I++T PL +G  +   + +   WV  
Subjt:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI

Query:  RYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNMEF-------GNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVV
        +YE+LP FC+ CG + H AKE       +  +V    F       G++    ++ R  ++ E  R+  +P  +      D L     ++N    GK    
Subjt:  RYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNMEF-------GNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVV

Query:  VDHSPDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPK--SDKDLILNQTDE------NQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGN
        V H+      ++        SS T  +   + M T+L    +   + D+ +N+ D       N     E S   S+ ++ +  D T      E    +  
Subjt:  VDHSPDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPK--SDKDLILNQTDE------NQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGN

Query:  SKATKRR----ASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQ----NKKFKSDDSGAMDVSGDDINSR-----------------------------
         +  +       +WKR  R   +       P+  K  +V D + +     KK +      M   GD I ++                             
Subjt:  SKATKRR----ASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQ----NKKFKSDDSGAMDVSGDDINSR-----------------------------

Query:  ------GAKGGICLLWKDNNLVSINSYSDNHIDSTV--NWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDR
                 GG+ L WK+N  + + + S NHID+ V    EG  WRFTG YG P   K+  +W+L+R L+S + L W+  GD NEI   SEKLGG  R  
Subjt:  ------GAKGGICLLWKDNNLVSINSYSDNHIDSTV--NWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDR

Query:  KYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVH-VWERLDRFLCNNSFDSLFNFAGP--KNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWT
          ++ FR  ID+C   DL   G  FTW  +    H VWERLDR L N+ +  L  FAG    +L    SDH PI +   N       R  + F FEE+W 
Subjt:  KYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVH-VWERLDRFLCNNSFDSLFNFAGP--KNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWT

Query:  NYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKN-LHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEW
            C + I +      S    T +   ++ C + L KW        + ++++ K  + +A K  +       +  +  EL+ L ++E   W QRS+ +W
Subjt:  NYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKN-LHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEW

Query:  LKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFP
         K GDKNSK+FH +A+ R + N I+ +   +GTW  D  E+ +   +YF +LF S+  +P N+      + P +S EMN +L+A F   E++ +IKQM P
Subjt:  LKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFP

Query:  TKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQ
         KAPGPDG P +FYQ YW L+G+      L  LN        N T I LIPK   P+ V E+RPISLCNV YKI +K +ANRLK +L  II+E QSAF +
Subjt:  TKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQ

Query:  GRLITDNVIIGHECINSIKNSRHT-KMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSP
         RLI+DN+++  E + S++  RHT K +  A K+D+SK+YDRVEW +L+ +M ++GFD +W+ L++  I++ S+SIL+N EP      SRG+RQGDPLSP
Subjt:  GRLITDNVIIGHECINSIKNSRHT-KMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSP

Query:  YLFLLVAEGLSHLISSANEEGRVTGLVCA-NGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGV
        +LFLL  EGL+ LIS A   G + G   + N PR++HLLFADDS++FC+A  QE   +  +L+ Y   SG+ IN  K+ I      + + +  +   LGV
Subjt:  YLFLLVAEGLSHLISSANEEGRVTGLVCA-NGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGV

Query:  NHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL
          ++ + KYLG PS   R K     ++ +K+W+  QGW++  L
Subjt:  NHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]8.5e-1024.54Show/hide
Query:  LVAVACWAIWSDRNKIIHEEAIPPVNKRSVWIQEYLAKYIEASSRREAMHASVTRDSSPQSNR-NWIPPPSDVWKFNVDAAWSDSLPETGIGVVCRGSNE
        L A   W IW  RN++       P+++        L ++       +   A+  +  SP  +R  W PPP    K N D A      E G+GVV R S+ 
Subjt:  LVAVACWAIWSDRNKIIHEEAIPPVNKRSVWIQEYLAKYIEASSRREAMHASVTRDSSPQSNR-NWIPPPSDVWKFNVDAAWSDSLPETGIGVVCRGSNE

Query:  KLIEASCNFLDIAMDAPGAELRAIMEGVIMAKSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADRLAKWAR
        K++ +    + +   +   E  A +  + +AK +     I+E D +V ++ L K+ +++     L+ +I    +  + + F + +R  N +A  LAK AR
Subjt:  KLIEASCNFLDIAMDAPGAELRAIMEGVIMAKSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADRLAKWAR

Query:  VRKCNNIWVEADVSPE
             ++W++ D+ P+
Subjt:  VRKCNNIWVEADVSPE

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.8e-18533.61Show/hide
Query:  EKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRS
        E+E  I +++D   E  ++ +  +IGK L+ +  + +A KN ++ AW    +  +   G N+F FKF+S+ D + ++  GPW+FD  L++L+        
Subjt:  EKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRS

Query:  VEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIG
          + F+ A+ WI++ + P    +  V +++G+ +G+  EVE  + +      +R+RV + I  P+ RG  +   +     WV+ +YERLP FC  CG +G
Subjt:  VEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIG

Query:  HIAKECECKRGEEEKNVD-NMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEK---------VVVDHSPDEDIVES-
        H  + C      E+K      ++G++LR               KP S      +   +E+ D  A+E RS+ G +K         V +D   +   VE+ 
Subjt:  HIAKECECKRGEEEKNVD-NMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEK---------VVVDHSPDEDIVES-

Query:  -RDKRNEGTSSGT---YVLNTI-----------ASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKA-T
         R   +E  + GT     LN I           A +  +       DK  +  + +      G     +   L D  +    +G      +   N  A  
Subjt:  -RDKRNEGTSSGT---YVLNTI-----------ASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKA-T

Query:  KRRASWKRKER-----AGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWE-GI
        K + SW R  R        +  +          +   +   + + FK  D+G               GG+  LWK++  + + +++ NH+ + V  E G 
Subjt:  KRRASWKRKER-----AGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWE-GI

Query:  TWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRG-VHVWERLDR
         W  TG YG+P A +K  +W L++ L + +   W++ GD N  +  SEK+   +     +E FR  +  C L DL   G  +TW + R G  +   RLDR
Subjt:  TWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRG-VHVWERLDR

Query:  FLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELI-ASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETH
         + N  +   F  +   +L    SDH P+ + + +  +      GRSFKFEE W   + C  +I  + G  +G+   L ++   +K C + L  WG    
Subjt:  FLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELI-ASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETH

Query:  NRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEF--ELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIED
        +     IKE +  L +      L +     ++    ++D LL+++EIYW QRSR  WL+ GD+N+K+FH KAS R++ N I GI NS+G WV++ +E+  
Subjt:  NRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEF--ELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIED

Query:  VFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWN
        V   YF NLF +   D   +   L+ ++ +V+ +M   L   FT EE++ ++ QM PTKAPGPDG  A+FYQK+WH+VG   +   L+ LN    L   N
Subjt:  VFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWN

Query:  KTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVE
         TNIVLIPK   P+ +SE+RPISLCNV YKII+K +ANRLK VL  IIS  QSAF+ GRLITDNV++ +E ++++   +  K    A K+D+SKAYDRVE
Subjt:  KTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVE

Query:  WAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS
        W FL+ IM K+GF   W+  V+ C+++PSFSIL+NG+P      SRG+RQGDP+SPYLFLL AEGL+ L++ A   G +TG+ +C   P+I++L+FADDS
Subjt:  WAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS

Query:  IIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL
        ++FC+A   E   +  +L+ YE ASG+ INL KS+       +  +K  +  ILGV  V+ F KYLG P++  R+K    + + D++WK  QGWK + L
Subjt:  IIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein9.3e-19635.98Show/hide
Query:  DIEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWT
        ++EEL    K    +   +++HI L  +          + I+ KLL+ R  +  A+KN+++  W +     V     N+F   F S+ D + +    PWT
Subjt:  DIEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWT

Query:  FDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVE-EGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWV
        FDK LI +     + +   + F  A FWIR++NLP     R+V + IG ++G  ++V+   +  I+WG+ +RIRV++D+T PL+RG +++  E     WV
Subjt:  FDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVE-EGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWV

Query:  TIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNV-DNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRS-RKGKEKVVVDH
          RYE LP FC+ CG IGH   EC   RG     V D  ++G+WLR             P +P+  S  R  E  D  G   A   +S  + ++ V  + 
Subjt:  TIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNV-DNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRS-RKGKEKVVVDH

Query:  SPDEDIVESRDKRNEGTSS---GTYVLNT---------IASMDTDLLVIPK-------------SDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNT
        +P + +       NE  SS   G  + N          +A    D+L +P               D   +     E  +D    +  +  I  D S+   
Subjt:  SPDEDIVESRDKRNEGTSS---GTYVLNT---------IASMDTDLLVIPK-------------SDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNT

Query:  NEGN-----MEEEETHSG----------------NSKATKRRASWK----------RKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDV
         E N      EE     G                N+++       K            E  G    + ++E  +L         ++N +F     G    
Subjt:  NEGN-----MEEEETHSG----------------NSKATKRRASWK----------RKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDV

Query:  SGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWE-GITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRD
         G  +   G  GG+ LLW  +  V I SYS++HID+ V +E G+ WR TG YG PE + +  +W L+R L     L WL+ GD NEI    EK G   R 
Subjt:  SGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWE-GITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRD

Query:  RKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRR--GVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWT
           +  FR  + DC+LRDL   G  FTW SNRR  G  V  RLDR + N+++ SLF  A   ++    SDH  + +  N  Q     RK + F+FE +W 
Subjt:  RKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRR--GVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWT

Query:  NYEACEELIASHGEWEGSNLSLTSLPM-----DLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDA--IHYIEFELDKLLEEEEIYWKQ
            CEE I     W   N S T  PM      +K C I L +W   T  R   +I E K S     +N  + ++++  ++ +  EL+ L E+EE++W+Q
Subjt:  NYEACEELIASHGEWEGSNLSLTSLPM-----DLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDA--IHYIEFELDKLLEEEEIYWKQ

Query:  RSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELS
        RSR  WLK GD+N+++FHK AS R+KTN I G+ +  G W  D   I  +   YF NLFTSS P+  +I+  +  + P VS +MN+ L+ P++ EEI  +
Subjt:  RSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELS

Query:  IKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEA
        + Q+ P+KAPGPDG  A+F+QKYWH+VG       L+ LN  + L   N TNI LIPK   P+ ++ +RPISLCNV YKI++K + NR+K +L  +IS++
Subjt:  IKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEA

Query:  QSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQG
        QSAF+ GRLITDNVI+  E ++ +KN R       A K+D+SKAYDRVEW +L+ I+LKLGF+  WV L++ C++S S+S+L+NGE +G    SRGLRQG
Subjt:  QSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQG

Query:  DPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLS
        DPLSPYLFL+ AEGLS LI  A  +    G+ +C  GPRISHL FADDS+IFC+A+  +  V++NLL  YE ASG+ +N  K+A+         +K  + 
Subjt:  DPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLS

Query:  SILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL
        S+ G +    F KYLG P +  RSK      + ++IWK  QGWK+  L
Subjt:  SILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL

A0A2N9FCL2 Reverse transcriptase domain-containing protein1.2e-19535.16Show/hide
Query:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF
        +E +   WK F+L +KE        + D    T   +H +  K L+SR+++  A+    K  WKTR++F V+  G N   F FE   D + VL N PWT+
Subjt:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF

Query:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI
        DK+L+V +    +    +  F   +FW++L NLP   R     + IG SIG   +V    +E      +R+RV++++ +PL RG ++K  E Q   WV  
Subjt:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI

Query:  RYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNME---FGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHS
        RYERLP FC+ CG + H  K+C+   G +++N    E   FG WLR  S  RP +KT      N P   +G E       +   +  + +   K  V +S
Subjt:  RYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDNME---FGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHS

Query:  PDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDE-----NQVDGGEQ---SIRSSTILKDYSV--DNTNEGNMEEE--------
         + +  +  D           +       ++D+  + KS+ ++  +Q  E     N +  GEQ    +  S  ++++S+  D T  G   +         
Subjt:  PDEDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDE-----NQVDGGEQ---SIRSSTILKDYSV--DNTNEGNMEEE--------

Query:  -----ETHSGNSKATKRRAS---WKRKERAGVTGRIQKEEPSNLKRK--AVEDNNIQNKKFKSDDSGAMDVSGDDI--NSRGAKGGICLLWKDNNLVSIN
             +  +G +   +RR S   WK+  RA            + KR    VE+N    K++++    ++  +   I  N RG  G    + +  NLV I 
Subjt:  -----ETHSGNSKATKRRAS---WKRKERAGVTGRIQKEEPSNLKRK--AVEDNNIQNKKFKSDDSGAMDVSGDDI--NSRGAKGGICLLWKDNNLVSIN

Query:  SYS----------DNHIDSTVNWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRD
          S          ++ ++   N E   WR T  YG PE   +  +W+L+R L+    L W   GD NEI+ +SEK G   +  + ++ FR+VIDDC   D
Subjt:  SYS----------DNHIDSTVNWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRD

Query:  LKPNGELFTWHSNRRG-VHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSN
        L   G  FTW +NRRG    W RLDRF+  N +   FN A   +L+   SDH+PI   LN H       + R F+FE++W +  +CE +I      + S 
Subjt:  LKPNGELFTWHSNRRG-VHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSN

Query:  LSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHL-IDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRK
             +   ++ C  AL++W R         +KE    L+ A  +  L +  + +  I  E++ LL +EE  WKQRSR+ WLK GD+N+K+FH +AS R+
Subjt:  LSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHL-IDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRK

Query:  KTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWH
        + N I  +       + + + I + F  Y+  LFT++  +  ++ L L+G+ P +++EMN  L+  FT+EE++ ++KQM P KAPGPDG P +FYQ YWH
Subjt:  KTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWH

Query:  LVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIK
        +VG       L  L     L   N T + LIPKT  P+ V+EYRPISLCNV YK+I+K +ANRLK +L  IISE+QSAF+ GRLITDN++I  E ++ + 
Subjt:  LVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIK

Query:  NSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEE
        N R  K+   A K+D+SKAYDRVEW FLK +M+ +GF  RW++L+L+CIS+ S+S+L+NG P G+   +RGLRQGDP+SPYLFLL AEGL+ LI  A  +
Subjt:  NSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEE

Query:  GRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSK
        G + G+ +C  GP+I++L FADDS++FC+A+  E   ++ +L +YE ASG+ +N AK+ +        D +A L  ILGV  ++ + KYLG PS+  + K
Subjt:  GRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSK

Query:  SKDLAYVLDKIWKATQGWKKVFL
            + + +++W   +GWK+  L
Subjt:  SKDLAYVLDKIWKATQGWKKVFL

A0A2N9FFZ2 Reverse transcriptase domain-containing protein1.5e-19834.13Show/hide
Query:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF
        +E L   W+ F+L E E +      + D    T      +  K L+ R ++  A+    K  W+T   F++     N   F F+ + DR+ V+   PW +
Subjt:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF

Query:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI
        DKYL++L+    ++   E+ F   +FW++L  +P    N +V   +G+S+G+   V  G   +  G+++RIRV IDIT PL RG   K  E     W++ 
Subjt:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI

Query:  RYERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPD
        +YERLP FC+ CG + H  K+C    R +E   +++ +FG+WLR                                    ++E   RK + K+     P 
Subjt:  RYERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPD

Query:  EDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKE
             ++   +  +   T+  +      T+ ++ P +     L+    N       +  +   +   +  +      ++  TH    +  +   +W    
Subjt:  EDIVESRDKRNEGTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKE

Query:  RAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVN-WEGITWRFTGLYGFPEASK
                Q E P    R  ++ ++    KF              +N R   GG+CL WK +  +S+ S+S +HID+ VN  +  TWRFTG YG PE  K
Subjt:  RAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVN-WEGITWRFTGLYGFPEASK

Query:  KHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGP
        +  +WDL+RRL++ L L W   GD NE++   EK G   R    ++ FR V+D+C   DL   G  FTW +NR G   WERLDR +    +   F  A  
Subjt:  KHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGP

Query:  KNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREA
         +L+  +SDH+PI VS      T    K + F+FEE+WT+ + CE +I    + + + + + ++   +  C   L  W R T     ++IKE +  L+ A
Subjt:  KNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREA

Query:  YKN-LHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPN
         +N +   D   ++ ++ EL  LL +EE  W+QRSR EWL  GD+N+++FH +A+ RK+ N +T +   +G W     ++  +F  Y+ +LF ++ PD  
Subjt:  YKN-LHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPN

Query:  NINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSE
         +   +  +   V++EMN++L++ FT  E+E+++KQM P KAPGPD  P IFYQKYWHL+G+      L  LN  + L+  N T+I LIPK   P+EV E
Subjt:  NINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSE

Query:  YRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWV
        +RPISLCNV YK+I+K +ANRLK +L  I+ E+QSAFI GRLITDN+++  E ++ +++ +  K    A K+D+SKAYDRVEW +LKG+M K+GF  +WV
Subjt:  YRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWV

Query:  NLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLL
         L+++CIS+ S+SIL+NGEP G    SRGLRQGDPLSPYLFLL AEGL  LI      G + G+ +  +GP+I+HL FADDS++FCKA   +++ ++ +L
Subjt:  NLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLL

Query:  KSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKK
          YE ASG+ +N  K+ +       L  +  + ++LGV  ++ + +YLG PS   R+K    A + +++W   +GWK+
Subjt:  KSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKK

A0A2N9HB41 Uncharacterized protein9.3e-19634.8Show/hide
Query:  EELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFD
        EEL    +   L E EK N I L     ++        ++ +LL+SR  +  A K  +K  W +     V     N+F   F    D + V+   PWTFD
Subjt:  EELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFD

Query:  KYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIR
        K LI +    A+ +   + F  + FWIR+ NLP     ++V + IGNSIG  +EV+     I WG+ +RIRV+ID+T PLLRG +L++ E     WV  R
Subjt:  KYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIR

Query:  YERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSY-----FRPNKKTESPRK--PNSPSMDRGNEFED--ELGDK--VASEN------
        YE LP FC+ CGRIGH   EC E +R   +++V +  FG+WLR          R N+ T    +     PS     EFE   + G+   V+SEN      
Subjt:  YERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSY-----FRPNKKTESPRK--PNSPSMDRGNEFED--ELGDK--VASEN------

Query:  ------------RSRKGKEKVVVDHSPD-EDIVESRDKRNE----GTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQT------DENQV----DGGEQS
                      R   + + V  +P+   ++E+ +  N+       +G    N +++   +L+   +   ++ L Q        +NQV    D   Q 
Subjt:  ------------RSRKGKEKVVVDHSPD-EDIVESRDKRNE----GTSSGTYVLNTIASMDTDLLVIPKSDKDLILNQT------DENQV----DGGEQS

Query:  IRSSTILKDYSVDNT--------NEGNMEEEETHSGNSKATKRRAS-------------WKRKERAG------------------------VTGRIQKE-
        I +   +     DNT        N  N+ +      N+K  K   S             WK++ R                          + G ++++ 
Subjt:  IRSSTILKDYSVDNT--------NEGNMEEEETHSGNSKATKRRAS-------------WKRKERAG------------------------VTGRIQKE-

Query:  EPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGD----------------------------DINSRGAKGGICLLWKDNNLVSINSYSDNHIDS-TVNWE
         P ++K  ++    + N+   ++  G + +                                ++ R   GG+ L W  +  + + SYS  HID+  +  +
Subjt:  EPSNLKRKAVEDNNIQNKKFKSDDSGAMDVSGD----------------------------DINSRGAKGGICLLWKDNNLVSINSYSDNHIDS-TVNWE

Query:  GITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVH-VWERL
        G+ WRFTG YG PE + +H +W L+RRL  + +L WL+ GD NEI+   EK G   R  + +  FR  + DC+L DL   G  FTW +NR     V  RL
Subjt:  GITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVH-VWERL

Query:  DRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRET
        DR + +  + +LF  A  K++    SDH  + + L         +K R F+F+  W + E CEE+IA       S   +  L   +K C I L KW +  
Subjt:  DRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRET

Query:  HNRRKNKIKECKNSLRE-AYKNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIED
               I+  K  L+E   ++        ++ +  EL  L+ +EEI+W+QRSR  WLK GD NS++FH+ AS R+KTN + G+H+S G W  DP  +E 
Subjt:  HNRRKNKIKECKNSLRE-AYKNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIED

Query:  VFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWN
        +   YF NLF SS   P+ IN     ++ RVS EMN  LLAPF+ EE+  ++ Q+ P+KAPGPDG  A+F+Q+YW++VG       L+ LN  + L   N
Subjt:  VFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWN

Query:  KTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVE
         TNIVLIPK   P  +S +RPISLCNV YKII+K + NR+K +L  IIS+ QSAF+ GR+ITDN+II  E ++ +KN +  ++   A K+D+SKAYDRVE
Subjt:  KTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVE

Query:  WAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS
        W +L+ I+LKLGF  RWV+L++ C+SS ++S+++NGE KG     RGLRQGDPLSPYLFL+ AEGLS LI  A  +  + G+ +C  GPR+SHL FADDS
Subjt:  WAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS

Query:  IIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL
        IIFC A   E   L  LL +YE ASG+ IN +K+A+   +    D +  +  + G      F KYLG P V  R+K +    + D++ +  QGWK+  L
Subjt:  IIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFL

A0A2N9HYE3 Reverse transcriptase domain-containing protein2.4e-19633.15Show/hide
Query:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF
        +E L   W+ F+L E E +      + D    T      +  K L+ R ++  A+    K  W+T   F++     N   F F+ + DR+ V+   PW +
Subjt:  IEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTF

Query:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI
        DKYL++L+    ++   E+ F   +FW++L  +P    N +V + +G+S+G+   V  G   +  G+++RIRV IDIT PL RG   K  E     W++ 
Subjt:  DKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTI

Query:  RYERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTE----------SPRKPNSPSMDRGNEFEDELGDKVASENRSRKGK
        +YERLP FC+ CG + H  K+C    R +E   +++ +FG+WLR  S  RP +K E           P++P  P     +           +++ +    
Subjt:  RYERLPEFCFLCGRIGHIAKEC-ECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTE----------SPRKPNSPSMDRGNEFEDELGDKVASENRSRKGK

Query:  EKVVVDHS-----PDEDIVESRDKRNEGTSSGTYVLNTIA-SMDT---------------DLLVIPK-----SDKDLILNQTDENQVDGGEQSIRS----
           ++  S     P      +    N+ T+     +  +A S  T               ++ + P+      D +  L   D N ++    S+ S    
Subjt:  EKVVVDHS-----PDEDIVESRDKRNEGTSSGTYVLNTIA-SMDT---------------DLLVIPK-----SDKDLILNQTDENQVDGGEQSIRS----

Query:  -----STILKDYSVDNTNEGNM-EEEETHSGNSKATKRRASWKRKERAGVTGRIQKEEPSNL--------------------------------KRKAVE
             ST+           G++  +   H G+ +    + +WK+  R          EP  +                                +    +
Subjt:  -----STILKDYSVDNTNEGNM-EEEETHSGNSKATKRRASWKRKERAGVTGRIQKEEPSNL--------------------------------KRKAVE

Query:  DNNIQNKKFKSDDSGAMDV--------SGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVN-WEGITWRFTGLYGFPEASKKHLTWDLIRRLDSL
        D ++        D G ++         S   +N R   GG+CL WK +  +S+ S+S +HID+ VN  +  TWRFTG YG PE  K+  +WDL+RRL++ 
Subjt:  DNNIQNKKFKSDDSGAMDV--------SGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVN-WEGITWRFTGLYGFPEASKKHLTWDLIRRLDSL

Query:  LDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIE
        L L W   GD NE++   EK G   R    ++ FR V+D+C   DL   G  FTW +NR G   WERLDR +    +   F  A   +L+  +SDH+PI 
Subjt:  LDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIE

Query:  VSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKN-LHLIDFDAIH
        VS      T    K + F+FEE+WT+ + CE +I    + + + + + ++   +  C   L  W R T     ++IKE +  L+ A +N +   D   ++
Subjt:  VSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKN-LHLIDFDAIH

Query:  YIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVS
         ++ EL  LL +EE  W+QRSR EWL  GD+N+++FH +A+ RK+ N +T +   +G W     ++  +F  Y+ +LF ++ PD   +   +  +   V+
Subjt:  YIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVS

Query:  SEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKII
        +EMN++L++ FT  E+E+++KQM P KAPGPD  P IFYQKYWHL+G+      L  LN  + L+  N T+I LIPK   P+EV E+RPISLCNV YK+I
Subjt:  SEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKII

Query:  TKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSI
        +K +ANRLK +L  I+ E+QSAFI GRLITDN+++  E ++ +++ +  K    A K+D+SKAYDRVEW +LKG+M K+GF  +WV L+++CIS+ S+SI
Subjt:  TKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSI

Query:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLA
        L+NGEP G    SRGLRQGDPLSPYLFLL AEGL  LI      G + G+ +  +GP+I+HL FADDS++FCKA   +++ ++ +L  YE ASG+ +N  
Subjt:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLA

Query:  KSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKK
        K+ +       L  +  + ++LGV  ++ + +YLG PS   R+K    A + +++W   +GWK+
Subjt:  KSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.2e-4025.83Show/hide
Query:  NSRGAKGGICLLWKDN---NLVSINSYSDNH---IDSTVNWEGITWRFTGLY----GFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLN---EIMEDSEK
        N +  K G+ +L  D        I    + H   +  ++  E +T     +Y    G P   K+ L+ DL R LDS      LI GD N    I++ S +
Subjt:  NSRGAKGGICLLWKDN---NLVSINSYSDNH---IDSTVNWEGITWRFTGLY----GFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLN---EIMEDSEK

Query:  LGGPRRDRKYLEDFRSVIDDCNL----RDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSL---NNHQTTTTSR
            ++  K  ++  S +   +L    R L P    +T+ S     H + ++D  + + +   L      + +    SDH  I++ L   N  Q+ +T+ 
Subjt:  LGGPRRDRKYLEDFRSVIDDCNL----RDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSL---NNHQTTTTSR

Query:  KGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCS----IALSKWGRETHNRRKNKIKECKNSLREAYK----NLHLIDFDAIHYIEFEL
        K  +    + W + E   E+       E  + +  +L    K       IAL+ + R+   + ++KI    + L+E  K    +        I  I  EL
Subjt:  KGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCS----IALSKWGRETHNRRKNKIKECKNSLREAYK----NLHLIDFDAIHYIEFEL

Query:  DKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNN
         ++  ++ +     SR  + +  +K  +   +    +++ N+I  I N +G    DP EI+     Y+ +L+ +   +   ++  L+    PR++ E   
Subjt:  DKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNN

Query:  KLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPK---TTCPKEVSEYRPISLCNVNYKIITK
         L  P T  EI   I  +   K+PGPDG+ A FYQ+Y   +    ++    I         + + +I+LIPK    T  KE   +RPISL N++ KI+ K
Subjt:  KLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPK---TTCPKEVSEYRPISLCNVNYKIITK

Query:  TVANRLKWVLKDIISEAQSAFIQGRLITDNV-----IIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPS
         +ANR++  +K +I   Q  FI G     N+     +I H  IN  K+  H         ID  KA+D+++  F+   + KLG D  ++ ++      P+
Subjt:  TVANRLKWVLKDIISEAQSAFIQGRLITDNV-----IIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPS

Query:  FSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLK---SYELASGE
         +I+LNG+    F    G RQG PLSP LF +V E L+  I    +E  + G+       +   LFADD I++    E  +V  +NLLK   ++   SG 
Subjt:  FSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLK---SYELASGE

Query:  CINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDL-----AYVLDKIWKATQGWKKVFLLYSWWMNIN
         IN+ KS   L N    + +   S I+G        K + +  +      KDL       +L +I + T  WK +    SW   IN
Subjt:  CINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDL-----AYVLDKIWKATQGWKKVFLLYSWWMNIN

P08548 LINE-1 reverse transcriptase homolog2.9e-4526.15Show/hide
Query:  NSRGAKGGICLLWKDN---NLVSINSYSDNH---IDSTVNWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLN---EIMEDSEKLGGP
        N +  K GI +L+ D        I    D H   +     ++ I+     +Y  P  +      + +  + +L+    ++ GD N    +++ S K    
Subjt:  NSRGAKGGICLLWKDN---NLVSINSYSDNH---IDSTVNWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLN---EIMEDSEKLGGP

Query:  RRDRKYLEDFRSVIDDCNLRDL----KPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQ---TTTTSRKGRS
        ++  K + D  S I   +L D+     PN   +T+ S+  G   + ++D  L + S  +L  F   + +  +FSDH  I+V LNN++   T T + K  +
Subjt:  RRDRKYLEDFRSVIDDCNLRDL----KPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQ---TTTTSRKGRS

Query:  FKFEELWTNYEACEELIA-----SHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIK-ECKNSLREAYKNLHLIDFDAIHYIEFELDKLLEE
           ++ W   E  +E+       ++ +    NL  T+  + L+   IAL  + ++T     N +    K   +E + N        I  I  EL+++  +
Subjt:  FKFEELWTNYEACEELIA-----SHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIK-ECKNSLREAYKNLHLIDFDAIHYIEFELDKLLEE

Query:  EEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNNKLLAPF
          I    +S+  + +  +K  K        ++  + I+ I N       DP EI+ +   Y+  L++    +   I+  L   + PR+S +    L  P 
Subjt:  EEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNNKLLAPF

Query:  TKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKT-TCPKEVSEYRPISLCNVNYKIITKTVANRLKW
        +  EI  +I+ +   K+PGPDG+ + FYQ +   +    +     I         + + NI LIPK    P     YRPISL N++ KI+ K + NR++ 
Subjt:  TKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKT-TCPKEVSEYRPISLCNVNYKIITKTVANRLKW

Query:  VLKDIISEAQSAFIQGRLITDNV-----IIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGE
         +K II   Q  FI G     N+     +I H  IN +KN  H         ID  KA+D ++  F+   + K+G +  ++ L+    S P+ +I+LNG 
Subjt:  VLKDIISEAQSAFIQGRLITDNV-----IIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGE

Query:  PKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILL
           +F    G RQG PLSP LF +V E L+  I    EE  + G +      I   LFADD I++ +        L  ++K Y   SG  IN  KS   +
Subjt:  PKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILL

Query:  PNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDL
            N   K    SI     V    KYLG   V+     KDL
Subjt:  PNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDL

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-3328.33Show/hide
Query:  ITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVG
        I  I N +G    DP+EI++   S++  L+++   + + ++  L+    P+++ +  + L +P + +EIE  I  +   K+PGPDG+ A FYQ +     
Subjt:  ITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLN-PRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVG

Query:  KKTIQECLEILNGDKSLE-----CWNKTNIVLIPK-TTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRL----ITDNVIIGH
        K+ +   L  L     +E      + +  I LIPK    P ++  +RPISL N++ KI+ K +ANR++  +K II   Q  FI G      I  ++ + H
Subjt:  KKTIQECLEILNGDKSLE-----CWNKTNIVLIPK-TTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRL----ITDNVIIGH

Query:  ECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHL
          IN +K+  H         +D  KA+D+++  F+  ++ + G    ++N++    S P  +I +NGE         G RQG PLSPYLF +V E L+  
Subjt:  ECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHL

Query:  ISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPS
        I    E   + G+       +   L ADD I++    +     L NL+ S+    G  IN  KS   L  K N   +  +      + V +  KYLG   
Subjt:  ISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKSYELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPS

Query:  VFSRSKSKDLAYVLDKIWKA
            + +K++  + DK +K+
Subjt:  VFSRSKSKDLAYVLDKIWKA

P14381 Transposon TX1 uncharacterized 149 kDa protein5.8e-3824.68Show/hide
Query:  LIGGDLNEIMEDSEKLGGPRRD------RKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPI
        +IGGD N  ++  ++    +RD      R+ +  F S++D    R+  P    FT+   R G     R+DR   ++   S    +  +     FSDH  +
Subjt:  LIGGDLNEIMEDSEKLGGPRRD------RKYLEDFRSVIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPI

Query:  EVSLN-----------NHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYK
         + ++           +   +    +G +    + W  + A ++  A+  +W           +   +  +   ++ +    +R  +I+     + +  +
Subjt:  EVSLN-----------NHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYK

Query:  NLHLIDFDAIHYIEFELDKLLEEEEIYWKQ----RSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDP
         L   +  A+     E  + L   E    +    RSR + L   D+ S++F+     +    +IT +   +GT ++DP+ I D   S++ NLF+     P
Subjt:  NLHLIDFDAIHYIEFELDKLLEEEEIYWKQ----RSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQPDP

Query:  NNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEIL-NGDKSLECWNKTNIVLIPKTTCPKEV
        +      +GL P VS     +L  P T +E+  +++ M   K+PG DG    F+Q +W  +G    +   E    G+  L C  +  + L+PK    + +
Subjt:  NNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEIL-NGDKSLECWNKTNIVLIPKTTCPKEV

Query:  SEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYR
          +RP+SL + +YKI+ K ++ RLK VL ++I   QS  + GR I DNV +  + ++    +R T + +A   +D  KA+DRV+  +L G +    F  +
Subjt:  SEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIMLKLGFDYR

Query:  WVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCAN-GPRISHLLFADDSIIFCKAAEQELVVLKN
        +V  +    +S    + +N          RG+RQG PLS  L+ L  E    L+       R+TGLV      R+    +ADD I+      Q+LV L+ 
Subjt:  WVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCAN-GPRISHLLFADDSIIFCKAAEQELVVLKN

Query:  LLKS---YELASGECINLAKSAILLPNKVNLD
          +    Y  AS   IN +KS+ LL   + +D
Subjt:  LLKS---YELASGECINLAKSAILLPNKVNLD

P92555 Uncharacterized mitochondrial protein AtMg012509.1e-1560.29Show/hide
Query:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS
        ++NG P+G    SRGLRQGDPLSPYLF+L  E LS L   A E+GR+ G+ V  N PRI+HLLFADD+
Subjt:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.4e-3125.53Show/hide
Query:  LVSINSYSDNHIDSTVNWEGITWRFTGLYGFPEASKKHLTWD------LIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRD--RKYLEDFRSVIDDCN
        LVS  S S+++  + ++    +WR    Y   E  +  + WD      + ++ D L+ L+    GD ++I   S+     +     + LE+F++ + D +
Subjt:  LVSINSYSDNHIDSTVNWEGITWRFTGLYGFPEASKKHLTWD------LIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRD--RKYLEDFRSVIDDCN

Query:  LRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWE-
        L D+   G  +TW +++    +  +LDR + N  + S F  A         SDH P  + L N       R  + F++    + +     L++    WE 
Subjt:  LRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWE-

Query:  ----GSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEFELDK----LLEEEEIYWKQRSREEWLKWGDKNSK
            GS+  + SL   LK          R+     ++K KE  +SL      L     D++  +E    K         E +++Q+SR +WL+ GD N++
Subjt:  ----GSNLSLTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEFELDK----LLEEEEIYWKQRSREEWLKWGDKNSK

Query:  WFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQP--DPNNINLALNGLNP-RVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGP
        +FHK     +  N I  +   +   V++  +++++  +Y+++L  S      P+++   +  ++P R +  + ++L A  + +EI  ++  M   KAPGP
Subjt:  WFHKKASIRKKTNEITGIHNSEGTWVDDPKEIEDVFYSYFSNLFTSSQP--DPNNINLALNGLNP-RVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGP

Query:  DGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIIT
        D + A F+ + W +V   TI    E       L+ +N T I LIPK T   ++S +RP+S C V YKIIT
Subjt:  DGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECWNKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIIT

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-1731.53Show/hide
Query:  LVAVACWAIWSDRNKII---HEEAIPPVNKRSVWIQEYLAKYIEASSRREAMHASVTRDSSPQSNRN----WIPPPSDVWKFNVDAAWSDSLPETGIGVV
        LV    W +W  RN+++    E   P V +R+      +  + E S+RRE       + S PQ  RN    W  PP    K N DA W    P  GIG +
Subjt:  LVAVACWAIWSDRNKII---HEEAIPPVNKRSVWIQEYLAKYIEASSRREAMHASVTRDSSPQSNRN----WIPPPSDVWKFNVDAAWSDSLPETGIGVV

Query:  CRGSNEKLIEASCNFLDIAMDAPGAELRAIMEGVIMAKSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADR
         R  +  ++      L    +   AEL A+   V+       KR+I ESD Q  VN LN     W  ++  +E I  L   F++++F++  R  NKVADR
Subjt:  CRGSNEKLIEASCNFLDIAMDAPGAELRAIMEGVIMAKSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADR

Query:  LAK
        +A+
Subjt:  LAK

AT3G31430.1 unknown protein3.5e-1430.19Show/hide
Query:  FNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDIT
        F+F F  +   + VL  GPW F+ ++I+L+     +  + + F    FW+++  +P  + NR V++ IG ++G+ L+ +   E ++     R+ +  DIT
Subjt:  FNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDIT

Query:  TPL--LRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDN
         PL   R F   A     N  +  RYERL  FC +CG + H    C  + G EE+  D+
Subjt:  TPL--LRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKECECKRGEEEKNVDN

AT3G42140.1 zinc ion binding;nucleic acid binding1.3e-1126.88Show/hide
Query:  KNFNVETTGRNI----FNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREE
        KN + E  GR +      F F+S+     +L  GPW+F+ ++ V++  T      E  FK+  FWI++  +P  +   ++I  IG  +G FLE   GR  
Subjt:  KNFNVETTGRNI----FNFKFESQGDRDWVLNNGPWTFDKYLIVLELPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREE

Query:  ISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKEC
                     D++                   +  +YE+L  FC  CG + H A EC
Subjt:  ISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHIAKEC

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.5e-1660.29Show/hide
Query:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS
        ++NG P+G    SRGLRQGDPLSPYLF+L  E LS L   A E+GR+ G+ V  N PRI+HLLFADD+
Subjt:  LLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGL-VCANGPRISHLLFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATTGAAGAACTGGTTAATGAATGGAAAATTTTCAACCTAATGGAGAAGGAGAAAGAGAACCATATTTTCTTAAACGAGGATGACTGGAAAGAAATGACGGATAA
CCTGGATCACTGCATAATTGGAAAACTCCTTTCAAGCCGCATAATTTCTCCGATCGCCATAAAGAATGCTCTGAAGGGCGCCTGGAAAACAAGAAAAAATTTTAATGTCG
AAACAACAGGTAGAAACATTTTTAACTTCAAATTCGAAAGTCAAGGAGATCGAGATTGGGTCCTGAATAATGGACCTTGGACTTTTGACAAGTACCTTATTGTTCTGGAA
CTTCCAACAGCAAATCAAAGATCTGTGGAGATGGGATTCAAGAAGGCAAATTTCTGGATAAGACTTATCAATTTGCCGACAGGATATAGAAACAGAAAGGTGATCAAGAA
GATTGGAAATAGCATTGGAGAGTTCTTGGAAGTAGAAGAAGGAAGGGAGGAAATTAGTTGGGGAAAGAGCATTCGAATTCGAGTCAAGATTGACATAACGACACCCCTGT
TGAGAGGGTTCATGCTAAAGGCTCCAGAAGTTCAAGGAAATTGTTGGGTGACAATCCGCTATGAAAGACTCCCAGAATTTTGTTTCTTGTGTGGTAGAATTGGGCATATA
GCTAAAGAATGCGAGTGCAAAAGGGGCGAAGAAGAGAAAAACGTCGACAATATGGAATTTGGGAACTGGTTAAGATTTCAAAGTTATTTTAGGCCAAATAAGAAGACAGA
GTCACCAAGGAAACCGAACAGCCCCAGTATGGATCGTGGTAATGAATTCGAGGATGAATTGGGAGATAAAGTGGCAAGTGAAAACAGGAGCAGAAAAGGTAAGGAAAAAG
TGGTTGTAGATCATTCACCTGATGAGGATATTGTGGAGAGTCGAGATAAAAGAAATGAGGGTACTTCCTCTGGGACCTATGTTTTGAACACAATAGCAAGTATGGATACA
GATCTTCTTGTGATACCGAAAAGTGATAAAGATTTAATTCTTAATCAAACAGATGAGAATCAGGTAGATGGAGGGGAGCAAAGCATAAGGTCCTCAACAATATTAAAGGA
CTATTCAGTTGACAATACAAATGAAGGCAATATGGAGGAAGAGGAAACACATTCTGGGAACTCAAAAGCAACCAAGAGAAGAGCTTCTTGGAAAAGAAAAGAAAGAGCAG
GGGTAACGGGCAGAATACAGAAAGAGGAACCATCCAATTTAAAGAGGAAAGCAGTAGAAGACAATAATATTCAGAACAAAAAATTTAAATCAGATGATTCTGGTGCAATG
GATGTGAGTGGCGATGATATTAACAGCAGAGGAGCCAAAGGTGGAATTTGCCTTCTGTGGAAAGACAACAACCTCGTGAGTATTAACTCATATTCAGACAATCACATTGA
CTCTACAGTTAACTGGGAAGGAATTACATGGAGATTCACAGGCCTTTATGGCTTCCCTGAAGCAAGTAAAAAACATCTAACATGGGATTTAATTAGAAGATTGGACAGCC
TTCTTGATTTGCTGTGGCTTATTGGAGGGGATCTAAACGAGATTATGGAGGACTCAGAAAAACTGGGGGGCCCAAGGAGAGACAGAAAGTATTTAGAAGATTTCAGAAGT
GTGATTGATGACTGCAATCTAAGAGACCTAAAGCCGAATGGAGAGTTATTCACCTGGCATAGTAACCGAAGAGGAGTCCATGTGTGGGAAAGATTGGATAGATTTTTGTG
TAATAATAGTTTTGATTCTTTGTTTAATTTTGCAGGACCAAAAAACTTGGATTGGCTATTTTCAGATCATAGGCCAATTGAAGTTAGTCTCAACAATCATCAGACTACTA
CTACATCAAGAAAGGGAAGATCTTTCAAATTTGAAGAGCTCTGGACCAATTATGAGGCATGCGAAGAGCTGATAGCTAGTCATGGAGAATGGGAGGGGAGTAATCTTTCT
TTAACTTCTTTACCTATGGATTTAAAAAATTGCTCTATTGCTTTATCTAAATGGGGTAGGGAGACTCATAATAGGAGGAAAAACAAAATTAAAGAATGCAAAAATTCCCT
AAGAGAAGCCTATAAAAATCTCCATTTAATTGACTTTGATGCTATCCACTATATTGAATTTGAACTCGATAAGCTCTTGGAAGAAGAAGAAATATATTGGAAACAAAGAT
CTAGGGAAGAGTGGCTTAAATGGGGGGACAAGAATTCTAAGTGGTTTCATAAAAAAGCCTCGATAAGGAAGAAAACCAATGAAATAACTGGAATTCATAATTCAGAAGGG
ACTTGGGTAGATGATCCTAAAGAAATAGAGGACGTATTTTATTCTTACTTTTCAAACCTGTTTACATCCTCCCAACCAGACCCAAACAATATTAATTTGGCTTTGAATGG
TTTAAATCCTAGAGTTAGCTCGGAGATGAATAACAAGTTATTGGCCCCGTTTACTAAAGAAGAAATTGAGCTGTCTATTAAACAAATGTTCCCTACAAAAGCTCCCGGTC
CGGATGGTTATCCAGCCATTTTTTACCAAAAATATTGGCACTTAGTGGGTAAGAAAACTATTCAGGAATGCTTAGAAATTTTAAATGGGGATAAAAGCTTAGAATGTTGG
AATAAAACTAATATAGTCTTAATCCCAAAAACCACTTGCCCTAAGGAGGTTAGCGAATATAGACCTATTAGCCTCTGTAACGTTAACTATAAGATAATTACAAAAACAGT
GGCTAATAGGCTTAAATGGGTTTTAAAAGACATCATATCCGAAGCCCAATCTGCTTTTATTCAAGGTAGATTAATAACGGATAACGTTATAATTGGCCATGAATGCATTA
ATTCTATTAAAAACAGTAGGCATACTAAAATGGATATGGCAGCTTTTAAAATCGATCTTAGCAAAGCCTATGATAGGGTAGAGTGGGCTTTCTTAAAAGGAATCATGTTA
AAGTTGGGGTTTGATTATAGATGGGTCAACCTAGTTTTAAAATGTATCTCTTCTCCTAGTTTTTCTATCCTTCTTAATGGAGAGCCTAAAGGTAATTTTAACTCCAGTAG
GGGCCTTAGACAAGGAGATCCCCTCTCCCCATACTTATTTCTTCTTGTAGCTGAAGGACTATCTCACTTAATATCTTCTGCTAACGAAGAGGGCAGGGTAACAGGTTTGG
TTTGTGCTAATGGTCCTAGAATATCACATCTCCTATTTGCAGATGATAGCATTATTTTTTGTAAGGCTGCAGAACAAGAGCTGGTTGTCCTTAAAAACCTATTAAAATCA
TATGAGCTTGCATCAGGCGAGTGTATTAACTTAGCTAAGTCTGCTATCCTTCTCCCTAACAAAGTTAATTTGGACAGGAAAGCTTTCTTAAGCTCTATTCTAGGTGTTAA
TCATGTTGAGGATTTTGGTAAATATCTAGGTTTCCCCTCAGTCTTTTCTAGAAGTAAGTCCAAAGATTTAGCCTATGTCTTGGATAAAATTTGGAAGGCGACTCAGGGCT
GGAAAAAGGTCTTTCTTCTCTATAGCTGGTGGATGAATATCAACTCGACTTGTTCGAATGAGGAGCTTGGTCTTGTAGCAGTAGCTTGCTGGGCGATCTGGTCAGATAGA
AACAAAATTATTCACGAGGAAGCTATTCCCCCAGTTAACAAAAGAAGTGTTTGGATTCAGGAATACCTTGCGAAATACATAGAAGCTTCCTCAAGGAGAGAGGCGATGCA
TGCAAGTGTTACAAGGGATTCCTCTCCTCAGTCAAATAGAAACTGGATTCCCCCGCCTTCGGATGTTTGGAAATTCAATGTTGATGCGGCTTGGTCCGATTCTCTTCCGG
AAACAGGAATTGGAGTAGTTTGTAGAGGTTCGAATGAGAAATTAATTGAAGCTTCTTGTAATTTTTTGGATATTGCGATGGATGCTCCTGGAGCTGAACTGAGGGCAATA
ATGGAGGGTGTTATAATGGCAAAATCAATGGGTTGCAAAAGGGTTATCATTGAGTCAGATTGTCAAGTGGCAGTAAATTTCCTAAACAAGAAGTCCAAAACATGGTGCAT
TGTGGAGGGCTTGGTGGAGGCGATTTGGGCTTTAGGCCTTTGTTTTGACGATATTCGATTTCAATATGCCCAGAGAGATAGAAACAAGGTGGCTGACAGACTAGCTAAAT
GGGCCAGAGTGAGGAAATGTAATAATATTTGGGTTGAAGCTGATGTAAGCCCAGAATGGCTGTTGTTATTGGTCGAGAATGACCTTTCGTTTTTTGCCCATGTGGCGTTT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGACATTGAAGAACTGGTTAATGAATGGAAAATTTTCAACCTAATGGAGAAGGAGAAAGAGAACCATATTTTCTTAAACGAGGATGACTGGAAAGAAATGACGGATAA
CCTGGATCACTGCATAATTGGAAAACTCCTTTCAAGCCGCATAATTTCTCCGATCGCCATAAAGAATGCTCTGAAGGGCGCCTGGAAAACAAGAAAAAATTTTAATGTCG
AAACAACAGGTAGAAACATTTTTAACTTCAAATTCGAAAGTCAAGGAGATCGAGATTGGGTCCTGAATAATGGACCTTGGACTTTTGACAAGTACCTTATTGTTCTGGAA
CTTCCAACAGCAAATCAAAGATCTGTGGAGATGGGATTCAAGAAGGCAAATTTCTGGATAAGACTTATCAATTTGCCGACAGGATATAGAAACAGAAAGGTGATCAAGAA
GATTGGAAATAGCATTGGAGAGTTCTTGGAAGTAGAAGAAGGAAGGGAGGAAATTAGTTGGGGAAAGAGCATTCGAATTCGAGTCAAGATTGACATAACGACACCCCTGT
TGAGAGGGTTCATGCTAAAGGCTCCAGAAGTTCAAGGAAATTGTTGGGTGACAATCCGCTATGAAAGACTCCCAGAATTTTGTTTCTTGTGTGGTAGAATTGGGCATATA
GCTAAAGAATGCGAGTGCAAAAGGGGCGAAGAAGAGAAAAACGTCGACAATATGGAATTTGGGAACTGGTTAAGATTTCAAAGTTATTTTAGGCCAAATAAGAAGACAGA
GTCACCAAGGAAACCGAACAGCCCCAGTATGGATCGTGGTAATGAATTCGAGGATGAATTGGGAGATAAAGTGGCAAGTGAAAACAGGAGCAGAAAAGGTAAGGAAAAAG
TGGTTGTAGATCATTCACCTGATGAGGATATTGTGGAGAGTCGAGATAAAAGAAATGAGGGTACTTCCTCTGGGACCTATGTTTTGAACACAATAGCAAGTATGGATACA
GATCTTCTTGTGATACCGAAAAGTGATAAAGATTTAATTCTTAATCAAACAGATGAGAATCAGGTAGATGGAGGGGAGCAAAGCATAAGGTCCTCAACAATATTAAAGGA
CTATTCAGTTGACAATACAAATGAAGGCAATATGGAGGAAGAGGAAACACATTCTGGGAACTCAAAAGCAACCAAGAGAAGAGCTTCTTGGAAAAGAAAAGAAAGAGCAG
GGGTAACGGGCAGAATACAGAAAGAGGAACCATCCAATTTAAAGAGGAAAGCAGTAGAAGACAATAATATTCAGAACAAAAAATTTAAATCAGATGATTCTGGTGCAATG
GATGTGAGTGGCGATGATATTAACAGCAGAGGAGCCAAAGGTGGAATTTGCCTTCTGTGGAAAGACAACAACCTCGTGAGTATTAACTCATATTCAGACAATCACATTGA
CTCTACAGTTAACTGGGAAGGAATTACATGGAGATTCACAGGCCTTTATGGCTTCCCTGAAGCAAGTAAAAAACATCTAACATGGGATTTAATTAGAAGATTGGACAGCC
TTCTTGATTTGCTGTGGCTTATTGGAGGGGATCTAAACGAGATTATGGAGGACTCAGAAAAACTGGGGGGCCCAAGGAGAGACAGAAAGTATTTAGAAGATTTCAGAAGT
GTGATTGATGACTGCAATCTAAGAGACCTAAAGCCGAATGGAGAGTTATTCACCTGGCATAGTAACCGAAGAGGAGTCCATGTGTGGGAAAGATTGGATAGATTTTTGTG
TAATAATAGTTTTGATTCTTTGTTTAATTTTGCAGGACCAAAAAACTTGGATTGGCTATTTTCAGATCATAGGCCAATTGAAGTTAGTCTCAACAATCATCAGACTACTA
CTACATCAAGAAAGGGAAGATCTTTCAAATTTGAAGAGCTCTGGACCAATTATGAGGCATGCGAAGAGCTGATAGCTAGTCATGGAGAATGGGAGGGGAGTAATCTTTCT
TTAACTTCTTTACCTATGGATTTAAAAAATTGCTCTATTGCTTTATCTAAATGGGGTAGGGAGACTCATAATAGGAGGAAAAACAAAATTAAAGAATGCAAAAATTCCCT
AAGAGAAGCCTATAAAAATCTCCATTTAATTGACTTTGATGCTATCCACTATATTGAATTTGAACTCGATAAGCTCTTGGAAGAAGAAGAAATATATTGGAAACAAAGAT
CTAGGGAAGAGTGGCTTAAATGGGGGGACAAGAATTCTAAGTGGTTTCATAAAAAAGCCTCGATAAGGAAGAAAACCAATGAAATAACTGGAATTCATAATTCAGAAGGG
ACTTGGGTAGATGATCCTAAAGAAATAGAGGACGTATTTTATTCTTACTTTTCAAACCTGTTTACATCCTCCCAACCAGACCCAAACAATATTAATTTGGCTTTGAATGG
TTTAAATCCTAGAGTTAGCTCGGAGATGAATAACAAGTTATTGGCCCCGTTTACTAAAGAAGAAATTGAGCTGTCTATTAAACAAATGTTCCCTACAAAAGCTCCCGGTC
CGGATGGTTATCCAGCCATTTTTTACCAAAAATATTGGCACTTAGTGGGTAAGAAAACTATTCAGGAATGCTTAGAAATTTTAAATGGGGATAAAAGCTTAGAATGTTGG
AATAAAACTAATATAGTCTTAATCCCAAAAACCACTTGCCCTAAGGAGGTTAGCGAATATAGACCTATTAGCCTCTGTAACGTTAACTATAAGATAATTACAAAAACAGT
GGCTAATAGGCTTAAATGGGTTTTAAAAGACATCATATCCGAAGCCCAATCTGCTTTTATTCAAGGTAGATTAATAACGGATAACGTTATAATTGGCCATGAATGCATTA
ATTCTATTAAAAACAGTAGGCATACTAAAATGGATATGGCAGCTTTTAAAATCGATCTTAGCAAAGCCTATGATAGGGTAGAGTGGGCTTTCTTAAAAGGAATCATGTTA
AAGTTGGGGTTTGATTATAGATGGGTCAACCTAGTTTTAAAATGTATCTCTTCTCCTAGTTTTTCTATCCTTCTTAATGGAGAGCCTAAAGGTAATTTTAACTCCAGTAG
GGGCCTTAGACAAGGAGATCCCCTCTCCCCATACTTATTTCTTCTTGTAGCTGAAGGACTATCTCACTTAATATCTTCTGCTAACGAAGAGGGCAGGGTAACAGGTTTGG
TTTGTGCTAATGGTCCTAGAATATCACATCTCCTATTTGCAGATGATAGCATTATTTTTTGTAAGGCTGCAGAACAAGAGCTGGTTGTCCTTAAAAACCTATTAAAATCA
TATGAGCTTGCATCAGGCGAGTGTATTAACTTAGCTAAGTCTGCTATCCTTCTCCCTAACAAAGTTAATTTGGACAGGAAAGCTTTCTTAAGCTCTATTCTAGGTGTTAA
TCATGTTGAGGATTTTGGTAAATATCTAGGTTTCCCCTCAGTCTTTTCTAGAAGTAAGTCCAAAGATTTAGCCTATGTCTTGGATAAAATTTGGAAGGCGACTCAGGGCT
GGAAAAAGGTCTTTCTTCTCTATAGCTGGTGGATGAATATCAACTCGACTTGTTCGAATGAGGAGCTTGGTCTTGTAGCAGTAGCTTGCTGGGCGATCTGGTCAGATAGA
AACAAAATTATTCACGAGGAAGCTATTCCCCCAGTTAACAAAAGAAGTGTTTGGATTCAGGAATACCTTGCGAAATACATAGAAGCTTCCTCAAGGAGAGAGGCGATGCA
TGCAAGTGTTACAAGGGATTCCTCTCCTCAGTCAAATAGAAACTGGATTCCCCCGCCTTCGGATGTTTGGAAATTCAATGTTGATGCGGCTTGGTCCGATTCTCTTCCGG
AAACAGGAATTGGAGTAGTTTGTAGAGGTTCGAATGAGAAATTAATTGAAGCTTCTTGTAATTTTTTGGATATTGCGATGGATGCTCCTGGAGCTGAACTGAGGGCAATA
ATGGAGGGTGTTATAATGGCAAAATCAATGGGTTGCAAAAGGGTTATCATTGAGTCAGATTGTCAAGTGGCAGTAAATTTCCTAAACAAGAAGTCCAAAACATGGTGCAT
TGTGGAGGGCTTGGTGGAGGCGATTTGGGCTTTAGGCCTTTGTTTTGACGATATTCGATTTCAATATGCCCAGAGAGATAGAAACAAGGTGGCTGACAGACTAGCTAAAT
GGGCCAGAGTGAGGAAATGTAATAATATTTGGGTTGAAGCTGATGTAAGCCCAGAATGGCTGTTGTTATTGGTCGAGAATGACCTTTCGTTTTTTGCCCATGTGGCGTTT
TAA
Protein sequenceShow/hide protein sequence
MDIEELVNEWKIFNLMEKEKENHIFLNEDDWKEMTDNLDHCIIGKLLSSRIISPIAIKNALKGAWKTRKNFNVETTGRNIFNFKFESQGDRDWVLNNGPWTFDKYLIVLE
LPTANQRSVEMGFKKANFWIRLINLPTGYRNRKVIKKIGNSIGEFLEVEEGREEISWGKSIRIRVKIDITTPLLRGFMLKAPEVQGNCWVTIRYERLPEFCFLCGRIGHI
AKECECKRGEEEKNVDNMEFGNWLRFQSYFRPNKKTESPRKPNSPSMDRGNEFEDELGDKVASENRSRKGKEKVVVDHSPDEDIVESRDKRNEGTSSGTYVLNTIASMDT
DLLVIPKSDKDLILNQTDENQVDGGEQSIRSSTILKDYSVDNTNEGNMEEEETHSGNSKATKRRASWKRKERAGVTGRIQKEEPSNLKRKAVEDNNIQNKKFKSDDSGAM
DVSGDDINSRGAKGGICLLWKDNNLVSINSYSDNHIDSTVNWEGITWRFTGLYGFPEASKKHLTWDLIRRLDSLLDLLWLIGGDLNEIMEDSEKLGGPRRDRKYLEDFRS
VIDDCNLRDLKPNGELFTWHSNRRGVHVWERLDRFLCNNSFDSLFNFAGPKNLDWLFSDHRPIEVSLNNHQTTTTSRKGRSFKFEELWTNYEACEELIASHGEWEGSNLS
LTSLPMDLKNCSIALSKWGRETHNRRKNKIKECKNSLREAYKNLHLIDFDAIHYIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSKWFHKKASIRKKTNEITGIHNSEG
TWVDDPKEIEDVFYSYFSNLFTSSQPDPNNINLALNGLNPRVSSEMNNKLLAPFTKEEIELSIKQMFPTKAPGPDGYPAIFYQKYWHLVGKKTIQECLEILNGDKSLECW
NKTNIVLIPKTTCPKEVSEYRPISLCNVNYKIITKTVANRLKWVLKDIISEAQSAFIQGRLITDNVIIGHECINSIKNSRHTKMDMAAFKIDLSKAYDRVEWAFLKGIML
KLGFDYRWVNLVLKCISSPSFSILLNGEPKGNFNSSRGLRQGDPLSPYLFLLVAEGLSHLISSANEEGRVTGLVCANGPRISHLLFADDSIIFCKAAEQELVVLKNLLKS
YELASGECINLAKSAILLPNKVNLDRKAFLSSILGVNHVEDFGKYLGFPSVFSRSKSKDLAYVLDKIWKATQGWKKVFLLYSWWMNINSTCSNEELGLVAVACWAIWSDR
NKIIHEEAIPPVNKRSVWIQEYLAKYIEASSRREAMHASVTRDSSPQSNRNWIPPPSDVWKFNVDAAWSDSLPETGIGVVCRGSNEKLIEASCNFLDIAMDAPGAELRAI
MEGVIMAKSMGCKRVIIESDCQVAVNFLNKKSKTWCIVEGLVEAIWALGLCFDDIRFQYAQRDRNKVADRLAKWARVRKCNNIWVEADVSPEWLLLLVENDLSFFAHVAF