; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005924 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005924
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold11:124772..130555
RNA-Seq ExpressionSpg005924
SyntenySpg005924
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61757.1 hypothetical protein VITISV_030741 [Vitis vinifera]2.3e-7430.68Show/hide
Query:  WVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRM------------GIRV
        W+S VYG  +   R  F  EL D+ GL    WC+GGDFN+IR + E+L  S  + +M+ F+ F+R C L D P+ +  F+WS               + V
Subjt:  WVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRM------------GIRV

Query:  AATRIDR-----FLLSQSWVQ------SFGDGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSR--------------------ENLSKAG------
          T   +     F     W+Q      SFG     SWW+E    GW G++ M KL+ +K K++EW++                    + L + G      
Subjt:  AATRIDR-----FLLSQSWVQ------SFGDGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSR--------------------ENLSKAG------

Query:  ---RKLLKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKDEGQS----------
           R L K  L E+I+ E+    Q  ++KW++EGD N+ FFH+ A+   NR  I  +E+E G +L  ++ I EE+  +F  LY    G+           
Subjt:  ---RKLLKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKDEGQS----------

Query:  -----------------------YLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFV
                               YLG P+GGN     FWDP++ER+ +++D W+   +S GGR+TL QS L+ +P Y  SL K   S+    E+L R+F+
Subjt:  -----------------------YLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFV

Query:  WNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKL
        W+G       +L+                   + RNVAL+ KWLWR+ +E   LW ++I+SIYG    G              W  IA  +  F    + 
Subjt:  WNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKL

Query:  TVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVAD-CWESESLTWDLDFRKGLFEREI---PSWLALLDRI
         V +G  I+FW+D W  DQ L   YP L  +   K A ++     S   +W+ +FR+ L + EI    S +  LDR+
Subjt:  TVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVAD-CWESESLTWDLDFRKGLFEREI---PSWLALLDRI

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-7622.47Show/hide
Query:  EVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNK---GWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLH
        + V+IT+R  HDDW RI+  L +Q E      PFQ  KA+L   +   A LL  NK   GW + G   +K E W+   HS  + +PSYGGW++ R +PLH
Subjt:  EVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNK---GWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLH

Query:  LWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGP-----
        LW  + F+ IG   GGF++  +    +   ++ K+KVR NY GF+PA I I D + + F V  V   E   L+ER   +HGSF ++AA  F +       
Subjt:  LWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGP-----

Query:  ---NDPSFCPADIWRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPESSGQAHNGRSHSENSPSVTVYKRK
           N     P +  R     D+ + N  +         ++  S E +++   Q ++ +R K  G + +              N + H   S        +
Subjt:  ---NDPSFCPADIWRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPESSGQAHNGRSHSENSPSVTVYKRK

Query:  GVSFANNNQVTTCKRGDIQHTDKNGSPPRAE-DTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELDRGTQKRQLA
         VSF +   +    + +  +T+ N      E  T ND           + +    R+ KD  E +    ED++ S+           KE   G+++  L+
Subjt:  GVSFANNNQVTTCKRGDIQHTDKNGSPPRAE-DTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELDRGTQKRQLA

Query:  SKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLE------EVIEETPKALAIIQSTREEHFDKINEKDRGYIEGFSISKEIVLTLRKNNLCIR
              S L   + S EN H  +   +   +   +   S E       V E   +  +  +ST E +           I+  +  +++V+ L++N L + 
Subjt:  SKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLE------EVIEETPKALAIIQSTREEHFDKINEKDRGYIEGFSISKEIVLTLRKNNLCIR

Query:  P----------------------IAGSS--SKKGQI------------------------------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAW
        P                      IAG      KG I                                W++ VYG   Y +R++   EL  LQ LC   W
Subjt:  P----------------------IAGSS--SKKGQI------------------------------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAW

Query:  CIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------
         I GDFN++RW +E  A S   R+M  FN F+ +  L DPP +N +F+WS + +    +R+DRFLLS+ W  +FG                         
Subjt:  CIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------

Query:  --------------DGKVE----SWWKELNPTGWAGYRIMEKLKGLKMKIREW--SRENLSKAGRKL---------------------------LKVNLM
                      D + +    +WW      G+ GY  ++ L  L   I+EW  ++ NL  A +K                            LK +L+
Subjt:  --------------DGKVE----SWWKELNPTGWAGYRIMEKLKGLKMKIREW--SRENLSKAGRKL---------------------------LKVNLM

Query:  EIIIDEQRCLNQNCKIKWLQEGDENTSFFHR------------------------------------------------------W--------------
         I  ++ +  +Q  + +W   GDEN S+FHR                                                      W              
Subjt:  EIIIDEQRCLNQNCKIKWLQEGDENTSFFHR------------------------------------------------------W--------------

Query:  ------ASTIK----------------------------------------------NRTFISVMESEEG-----------------RILAK--------
               STI                                               N TFI+++  +E                  +I+AK        
Subjt:  ------ASTIK----------------------------------------------NRTFISVMESEEG-----------------RILAK--------

Query:  --NDEIAE----------------------------EVTGF--------------------------FSH------------------------------
           D IAE                            ++ GF                          F H                              
Subjt:  --NDEIAE----------------------------EVTGF--------------------------FSH------------------------------

Query:  -------------------------------------------------------------------------LYQKDEG--------------------
                                                                                 L++K  G                    
Subjt:  -------------------------------------------------------------------------LYQKDEG--------------------

Query:  ---------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHS
                        +YLG PLGGN   R FWD  +E + KK++ W++  +SKGGR+TL ++ LSSLP Y  S  KAP S+ K  EK  R+F+W G   
Subjt:  ---------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHS

Query:  KLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGR
        K +++L+ W   + P  LGGLG+  L   N AL+ KWLWR+  E + LW+K I + Y  +  G      R    N  W  I      ++  +    N+G 
Subjt:  KLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGR

Query:  CIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSW
         + FW   W ++  L L  P LY +SN + ATV + W+  S  W++  R+ L ERE  +W
Subjt:  CIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSW

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.9e-8723.29Show/hide
Query:  RSFAD-IVRSKPRAPSNFQGKESSREWKDSQERKGCKEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM
        RS+A  +   +P A S+      S +   S     C      +     VVI +R  HDDW +IL+ L +Q EE    N F   KAL+   S+  A+LL  
Subjt:  RSFAD-IVRSKPRAPSNFQGKESSREWKDSQERKGCKEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM

Query:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-D
        NKGW + G   ++ EKW+  +H+    +PSYGGW   R +PLHLW +  F+ IG    G I+  E      + +E ++KVR NY GFLPA +RI D E +
Subjt:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-D

Query:  QFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAF-------------------------------HRGPNDPSFCPADIWRIENGWDFPMVNIQELIEEE
        +F VQVVT+ EG  LIER   +HG+F  +AA +F                                  P+ PS   + I + +     P    +EL+ + 
Subjt:  QFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAF-------------------------------HRGPNDPSFCPADIWRIENGWDFPMVNIQELIEEE

Query:  T----------------------DSGQQLFSIERKFELSRQLEIGERSKA-AGPS-----FVPES-PAH-------PESSGQAHNGRS----HSENSPSV
                               D G+Q   I+ +   +  L+  +R  +   PS     F P+S PA+       PE   +    RS     S   P+ 
Subjt:  T----------------------DSGQQLFSIERKFELSRQLEIGERSKA-AGPS-----FVPES-PAH-------PESSGQAHNGRS----HSENSPSV

Query:  TVYKRKGVSFANNNQVT-------------TCKRGDIQHTDKNGSPPRAEDTSN----DWMRDTGLDSDPSMSSP---GSRVSKDVDEESPNLYEDNEDS
           + KGV      Q+              T   GD+   D N S     ++ N    D      +   P M  P    S  S + +   P      +  
Subjt:  TVYKRKGVSFANNNQVT-------------TCKRGDIQHTDKNGSPPRAEDTSN----DWMRDTGLDSDPSMSSP---GSRVSKDVDEESPNLYEDNEDS

Query:  VLVDYQRLFEADKELDRGTQKRQLAS--KKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEV-----IEETPKALAIIQSTREEHFDKINEK
            Y++  E +K+ D    K+QL S  KK    L     SS             N   K+     ++ +     I    K  +         +D  N  
Subjt:  VLVDYQRLFEADKELDRGTQKRQLAS--KKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEV-----IEETPKALAIIQSTREEHFDKINEK

Query:  DRGYIEG-FSISKEIVLTLRKNNLCIRPIAGSSSKKGQIEGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRK
             EG FS+S   +L    NN                  W++G+YG    RER  F  EL++LQ L    W +GGD N+IR  +E  +  +SS + R 
Subjt:  DRGYIEG-FSISKEIVLTLRKNNLCIRPIAGSSSKKGQIEGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRK

Query:  FNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFGDGKVES---------------------------------------------W
         N F+    L DPP+ N  F+WS +      +RIDRFL + SW   F      +                                             W
Subjt:  FNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFGDGKVES---------------------------------------------W

Query:  WKELNPTGWAGYRIMEKLKGLKMKIREWSRENL----------------------------SKAGRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDEN
        W+     G+ G+  +++LK L   I+ W +E L                             ++ R+L LK +L E+ + E +   Q  K  WL+EGDEN
Subjt:  WKELNPTGWAGYRIMEKLKGLKMKIREWSRENL----------------------------SKAGRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDEN

Query:  TSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKD-----------------------------EGQ-------------------
        +SFFHR  S+ + R+FI  ++ EEG I   N+ I+     FFS +Y+                               EG+                   
Subjt:  TSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKD-----------------------------EGQ-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEK
                                    SYLG PLGGN    +FW  + ++++KK++ W++  +SKGGR+TL +S LSSLP+Y  S+ +AP    K  EK
Subjt:  ----------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEK

Query:  LIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTF
        L R F+W G +    S+L+ W   S     GGLG+  L   N AL+SKWLWR+  E + LWR++I   Y     G   +          W  I  +   F
Subjt:  LIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTF

Query:  DPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSWLALLD
               +NNG  I FW   W  +  L   YP L+ ++  K  +V D W +    W++ FR+ L +RE  +W  +L+
Subjt:  DPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSWLALLD

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.4e-8522.66Show/hide
Query:  VVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLS
        VV+ +R  HDDW +IL+ L +Q EE    N F   K L+   S+  A+LL  NKGW + G   ++ EKW    H+    +PSYGGW   R +PLHLW + 
Subjt:  VVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLS

Query:  IFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADI
         F+ IG   GG I+  E      + +E K+K+R NY GFLPA ++I D E ++F VQVVT+ EG  L+ER   +HG+F  +AA +F        F P   
Subjt:  IFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADI

Query:  WRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPES-------------SGQAHNGRSH---------SENS
          + +G +    ++   I     SG +      +    + + I     A  P+ + E   +  S             SG +++G            S+ +
Subjt:  WRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPES-------------SGQAHNGRSH---------SENS

Query:  PSVTVYK-RKGVSFANNNQVTTCKRGDIQHTDKNGSPPRAEDTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELD
         +   YK ++ VSF + +  TT    D      N SP + +  S +                   V K      P L  +     L+  Q L     +LD
Subjt:  PSVTVYK-RKGVSFANNNQVTTCKRGDIQHTDKNGSPPRAEDTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELD

Query:  RGTQKRQLASKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAI----------------IQSTREEHFDKINEKDRGYIEG
           +   L        +L P   S E+ H ++ A   +  +         EV+ ETP+                      S R  H+ +  E        
Subjt:  RGTQKRQLASKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAI----------------IQSTREEHFDKINEKDRGYIEG

Query:  FSISKEIVLTLRKNNLCIRPIAGSS-----------------------------SKKGQIEG---------------WVSGVYGSCSYRERSQFLQELYD
         +   ++V  L++N L +     SS                             S   Q EG               W++G+YG    RER    ++L++
Subjt:  FSISKEIVLTLRKNNLCIRPIAGSS-----------------------------SKKGQIEG---------------WVSGVYGSCSYRERSQFLQELYD

Query:  LQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG----------------
        L  L    W IGGD N++R  +E  A + SS S    N F+    L DPP+ N  ++WS +      +R+DRFL +  W   F                 
Subjt:  LQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG----------------

Query:  -----------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSRE----------------------------NLSKA
                                        +E WW+     G  G+  +++LK L   I+ W +E                            +L ++
Subjt:  -----------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSRE----------------------------NLSKA

Query:  GRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQ-----------------
         R+L LK  L ++ + E +   Q  K  WL+EGDEN++FFHR  S+ + R  I  ++ EEG I   N+ I+      FS +Y+                 
Subjt:  GRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---KDEGQ-------------------------------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVT
           K +G+                                                 +YLG PLGGN    +FW  + +R++KK+  W++  +SKGGR+T
Subjt:  ---KDEGQ-------------------------------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVT

Query:  LAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGL
        L +S LSSLP+Y  S+ +AP S  K  EKL RNF+W G      S+L+ W   + P   GGLG+  L   N AL+SKWLWR+  E + LWR++I   Y  
Subjt:  LAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGL

Query:  SPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPS
           G   +          W  I  N   F       +NNG  I FW   W  +  L   YP L+ +S  K +++ D W S +  W++ FR+ L +RE+ +
Subjt:  SPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPS

Query:  WLALLDRIKDVR
        W  +L+ +  +R
Subjt:  WLALLDRIKDVR

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-7622.47Show/hide
Query:  EVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNK---GWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLH
        + V+IT+R  HDDW RI+  L +Q E      PFQ  KA+L   +   A LL  NK   GW + G   +K E W+   HS  + +PSYGGW++ R +PLH
Subjt:  EVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNK---GWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLH

Query:  LWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGP-----
        LW  + F+ IG   GGF++  +    +   ++ K+KVR NY GF+PA I I D + + F V  V   E   L+ER   +HGSF ++AA  F +       
Subjt:  LWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGP-----

Query:  ---NDPSFCPADIWRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPESSGQAHNGRSHSENSPSVTVYKRK
           N     P +  R     D+ + N  +         ++  S E +++   Q ++ +R K  G + +              N + H   S        +
Subjt:  ---NDPSFCPADIWRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPESSGQAHNGRSHSENSPSVTVYKRK

Query:  GVSFANNNQVTTCKRGDIQHTDKNGSPPRAE-DTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELDRGTQKRQLA
         VSF +   +    + +  +T+ N      E  T ND           + +    R+ KD  E +    ED++ S+           KE   G+++  L+
Subjt:  GVSFANNNQVTTCKRGDIQHTDKNGSPPRAE-DTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELDRGTQKRQLA

Query:  SKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLE------EVIEETPKALAIIQSTREEHFDKINEKDRGYIEGFSISKEIVLTLRKNNLCIR
              S L   + S EN H  +   +   +   +   S E       V E   +  +  +ST E +           I+  +  +++V+ L++N L + 
Subjt:  SKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLE------EVIEETPKALAIIQSTREEHFDKINEKDRGYIEGFSISKEIVLTLRKNNLCIR

Query:  P----------------------IAGSS--SKKGQI------------------------------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAW
        P                      IAG      KG I                                W++ VYG   Y +R++   EL  LQ LC   W
Subjt:  P----------------------IAGSS--SKKGQI------------------------------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAW

Query:  CIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------
         I GDFN++RW +E  A S   R+M  FN F+ +  L DPP +N +F+WS + +    +R+DRFLLS+ W  +FG                         
Subjt:  CIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------

Query:  --------------DGKVE----SWWKELNPTGWAGYRIMEKLKGLKMKIREW--SRENLSKAGRKL---------------------------LKVNLM
                      D + +    +WW      G+ GY  ++ L  L   I+EW  ++ NL  A +K                            LK +L+
Subjt:  --------------DGKVE----SWWKELNPTGWAGYRIMEKLKGLKMKIREW--SRENLSKAGRKL---------------------------LKVNLM

Query:  EIIIDEQRCLNQNCKIKWLQEGDENTSFFHR------------------------------------------------------W--------------
         I  ++ +  +Q  + +W   GDEN S+FHR                                                      W              
Subjt:  EIIIDEQRCLNQNCKIKWLQEGDENTSFFHR------------------------------------------------------W--------------

Query:  ------ASTIK----------------------------------------------NRTFISVMESEEG-----------------RILAK--------
               STI                                               N TFI+++  +E                  +I+AK        
Subjt:  ------ASTIK----------------------------------------------NRTFISVMESEEG-----------------RILAK--------

Query:  --NDEIAE----------------------------EVTGF--------------------------FSH------------------------------
           D IAE                            ++ GF                          F H                              
Subjt:  --NDEIAE----------------------------EVTGF--------------------------FSH------------------------------

Query:  -------------------------------------------------------------------------LYQKDEG--------------------
                                                                                 L++K  G                    
Subjt:  -------------------------------------------------------------------------LYQKDEG--------------------

Query:  ---------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHS
                        +YLG PLGGN   R FWD  +E + KK++ W++  +SKGGR+TL ++ LSSLP Y  S  KAP S+ K  EK  R+F+W G   
Subjt:  ---------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHS

Query:  KLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGR
        K +++L+ W   + P  LGGLG+  L   N AL+ KWLWR+  E + LW+K I + Y  +  G      R    N  W  I      ++  +    N+G 
Subjt:  KLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGR

Query:  CIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSW
         + FW   W ++  L L  P LY +SN + ATV + W+  S  W++  R+ L ERE  +W
Subjt:  CIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSW

TrEMBL top hitse value%identityAlignment
A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein4.1e-8522.66Show/hide
Query:  VVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLS
        VV+ +R  HDDW +IL+ L +Q EE    N F   K L+   S+  A+LL  NKGW + G   ++ EKW    H+    +PSYGGW   R +PLHLW + 
Subjt:  VVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLS

Query:  IFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADI
         F+ IG   GG I+  E      + +E K+K+R NY GFLPA ++I D E ++F VQVVT+ EG  L+ER   +HG+F  +AA +F        F P   
Subjt:  IFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-DQFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADI

Query:  WRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPES-------------SGQAHNGRSH---------SENS
          + +G +    ++   I     SG +      +    + + I     A  P+ + E   +  S             SG +++G            S+ +
Subjt:  WRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPES-------------SGQAHNGRSH---------SENS

Query:  PSVTVYK-RKGVSFANNNQVTTCKRGDIQHTDKNGSPPRAEDTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELD
         +   YK ++ VSF + +  TT    D      N SP + +  S +                   V K      P L  +     L+  Q L     +LD
Subjt:  PSVTVYK-RKGVSFANNNQVTTCKRGDIQHTDKNGSPPRAEDTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELD

Query:  RGTQKRQLASKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAI----------------IQSTREEHFDKINEKDRGYIEG
           +   L        +L P   S E+ H ++ A   +  +         EV+ ETP+                      S R  H+ +  E        
Subjt:  RGTQKRQLASKKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAI----------------IQSTREEHFDKINEKDRGYIEG

Query:  FSISKEIVLTLRKNNLCIRPIAGSS-----------------------------SKKGQIEG---------------WVSGVYGSCSYRERSQFLQELYD
         +   ++V  L++N L +     SS                             S   Q EG               W++G+YG    RER    ++L++
Subjt:  FSISKEIVLTLRKNNLCIRPIAGSS-----------------------------SKKGQIEG---------------WVSGVYGSCSYRERSQFLQELYD

Query:  LQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG----------------
        L  L    W IGGD N++R  +E  A + SS S    N F+    L DPP+ N  ++WS +      +R+DRFL +  W   F                 
Subjt:  LQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFG----------------

Query:  -----------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSRE----------------------------NLSKA
                                        +E WW+     G  G+  +++LK L   I+ W +E                            +L ++
Subjt:  -----------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSRE----------------------------NLSKA

Query:  GRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQ-----------------
         R+L LK  L ++ + E +   Q  K  WL+EGDEN++FFHR  S+ + R  I  ++ EEG I   N+ I+      FS +Y+                 
Subjt:  GRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---KDEGQ-------------------------------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVT
           K +G+                                                 +YLG PLGGN    +FW  + +R++KK+  W++  +SKGGR+T
Subjt:  ---KDEGQ-------------------------------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVT

Query:  LAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGL
        L +S LSSLP+Y  S+ +AP S  K  EKL RNF+W G      S+L+ W   + P   GGLG+  L   N AL+SKWLWR+  E + LWR++I   Y  
Subjt:  LAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGL

Query:  SPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPS
           G   +          W  I  N   F       +NNG  I FW   W  +  L   YP L+ +S  K +++ D W S +  W++ FR+ L +RE+ +
Subjt:  SPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPS

Query:  WLALLDRIKDVR
        W  +L+ +  +R
Subjt:  WLALLDRIKDVR

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein3.3e-8723.29Show/hide
Query:  RSFAD-IVRSKPRAPSNFQGKESSREWKDSQERKGCKEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM
        RS+A  +   +P A S+      S +   S     C      +     VVI +R  HDDW +IL+ L +Q EE    N F   KAL+   S+  A+LL  
Subjt:  RSFAD-IVRSKPRAPSNFQGKESSREWKDSQERKGCKEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM

Query:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-D
        NKGW + G   ++ EKW+  +H+    +PSYGGW   R +PLHLW +  F+ IG    G I+  E      + +E ++KVR NY GFLPA +RI D E +
Subjt:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGE-D

Query:  QFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAF-------------------------------HRGPNDPSFCPADIWRIENGWDFPMVNIQELIEEE
        +F VQVVT+ EG  LIER   +HG+F  +AA +F                                  P+ PS   + I + +     P    +EL+ + 
Subjt:  QFHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAF-------------------------------HRGPNDPSFCPADIWRIENGWDFPMVNIQELIEEE

Query:  T----------------------DSGQQLFSIERKFELSRQLEIGERSKA-AGPS-----FVPES-PAH-------PESSGQAHNGRS----HSENSPSV
                               D G+Q   I+ +   +  L+  +R  +   PS     F P+S PA+       PE   +    RS     S   P+ 
Subjt:  T----------------------DSGQQLFSIERKFELSRQLEIGERSKA-AGPS-----FVPES-PAH-------PESSGQAHNGRS----HSENSPSV

Query:  TVYKRKGVSFANNNQVT-------------TCKRGDIQHTDKNGSPPRAEDTSN----DWMRDTGLDSDPSMSSP---GSRVSKDVDEESPNLYEDNEDS
           + KGV      Q+              T   GD+   D N S     ++ N    D      +   P M  P    S  S + +   P      +  
Subjt:  TVYKRKGVSFANNNQVT-------------TCKRGDIQHTDKNGSPPRAEDTSN----DWMRDTGLDSDPSMSSP---GSRVSKDVDEESPNLYEDNEDS

Query:  VLVDYQRLFEADKELDRGTQKRQLAS--KKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEV-----IEETPKALAIIQSTREEHFDKINEK
            Y++  E +K+ D    K+QL S  KK    L     SS             N   K+     ++ +     I    K  +         +D  N  
Subjt:  VLVDYQRLFEADKELDRGTQKRQLAS--KKFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEV-----IEETPKALAIIQSTREEHFDKINEK

Query:  DRGYIEG-FSISKEIVLTLRKNNLCIRPIAGSSSKKGQIEGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRK
             EG FS+S   +L    NN                  W++G+YG    RER  F  EL++LQ L    W +GGD N+IR  +E  +  +SS + R 
Subjt:  DRGYIEG-FSISKEIVLTLRKNNLCIRPIAGSSSKKGQIEGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRK

Query:  FNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFGDGKVES---------------------------------------------W
         N F+    L DPP+ N  F+WS +      +RIDRFL + SW   F      +                                             W
Subjt:  FNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFGDGKVES---------------------------------------------W

Query:  WKELNPTGWAGYRIMEKLKGLKMKIREWSRENL----------------------------SKAGRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDEN
        W+     G+ G+  +++LK L   I+ W +E L                             ++ R+L LK +L E+ + E +   Q  K  WL+EGDEN
Subjt:  WKELNPTGWAGYRIMEKLKGLKMKIREWSRENL----------------------------SKAGRKL-LKVNLMEIIIDEQRCLNQNCKIKWLQEGDEN

Query:  TSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKD-----------------------------EGQ-------------------
        +SFFHR  S+ + R+FI  ++ EEG I   N+ I+     FFS +Y+                               EG+                   
Subjt:  TSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKD-----------------------------EGQ-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEK
                                    SYLG PLGGN    +FW  + ++++KK++ W++  +SKGGR+TL +S LSSLP+Y  S+ +AP    K  EK
Subjt:  ----------------------------SYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEK

Query:  LIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTF
        L R F+W G +    S+L+ W   S     GGLG+  L   N AL+SKWLWR+  E + LWR++I   Y     G   +          W  I  +   F
Subjt:  LIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTF

Query:  DPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSWLALLD
               +NNG  I FW   W  +  L   YP L+ ++  K  +V D W +    W++ FR+ L +RE  +W  +L+
Subjt:  DPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESESLTWDLDFRKGLFEREIPSWLALLD

A0A6J1D6X4 uncharacterized protein LOC1110181861.3e-6748.47Show/hide
Query:  RSFADIVRSKPRAPSNFQGKESSREWKDSQERKGC-KEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM
        +S A++V++  R       KE     ++S  RKG   E VR+++W E +VIT+RD HDDW RIL  + +Q E   +INPFQ  KAL+KCPS  +A LL  
Subjt:  RSFADIVRSKPRAPSNFQGKESSREWKDSQERKGC-KEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSM

Query:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGEDQ
        NKGWV+FGP  +K E WN   H +    PSYG W+K+RN+PLHLW L+ FKAIG+ LGGFI+Y++++S  I+C +V +KV+ NYCGF+PA I  +DG   
Subjt:  NKGWVSFGPAILKTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGEDQ

Query:  FHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADIWRIENGWDFPMVNIQ
        F  +VV++++   L  +  GIHG FSSEAA +FH+G  + S    D WR+ENG ++P VNIQ
Subjt:  FHVQVVTYQEGTMLIERVAGIHGSFSSEAAHAFHRGPNDPSFCPADIWRIENGWDFPMVNIQ

A5BTG1 zf-RVT domain-containing protein1.1e-7430.68Show/hide
Query:  WVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRM------------GIRV
        W+S VYG  +   R  F  EL D+ GL    WC+GGDFN+IR + E+L  S  + +M+ F+ F+R C L D P+ +  F+WS               + V
Subjt:  WVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRM------------GIRV

Query:  AATRIDR-----FLLSQSWVQ------SFGDGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSR--------------------ENLSKAG------
          T   +     F     W+Q      SFG     SWW+E    GW G++ M KL+ +K K++EW++                    + L + G      
Subjt:  AATRIDR-----FLLSQSWVQ------SFGDGKVESWWKELNPTGWAGYRIMEKLKGLKMKIREWSR--------------------ENLSKAG------

Query:  ---RKLLKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKDEGQS----------
           R L K  L E+I+ E+    Q  ++KW++EGD N+ FFH+ A+   NR  I  +E+E G +L  ++ I EE+  +F  LY    G+           
Subjt:  ---RKLLKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLYQKDEGQS----------

Query:  -----------------------YLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFV
                               YLG P+GGN     FWDP++ER+ +++D W+   +S GGR+TL QS L+ +P Y  SL K   S+    E+L R+F+
Subjt:  -----------------------YLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFV

Query:  WNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKL
        W+G       +L+                   + RNVAL+ KWLWR+ +E   LW ++I+SIYG    G              W  IA  +  F    + 
Subjt:  WNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKL

Query:  TVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVAD-CWESESLTWDLDFRKGLFEREI---PSWLALLDRI
         V +G  I+FW+D W  DQ L   YP L  +   K A ++     S   +W+ +FR+ L + EI    S +  LDR+
Subjt:  TVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVAD-CWESESLTWDLDFRKGLFEREI---PSWLALLDRI

M5WPQ5 Reverse transcriptase domain-containing protein4.5e-6822.83Show/hide
Query:  KFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAIIQSTREEHFDK------INEKDRGYIEGFSISKE---IVLTLRKNNLCI
        K   SLLP   + A  RH        N + +    GS +E +      + I+  T++E  D+         + + ++   S+ +     VL   ++   I
Subjt:  KFQSSLLPPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAIIQSTREEHFDK------INEKDRGYIEGFSISKE---IVLTLRKNNLCI

Query:  RPIAGSSSKKGQI------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDF
          + G  S   +I      + W+SG+YG C  RER+ F +EL DL G C   WC+GGDFN++R++ E+      ++SMR FN F++   L DP ++N  F
Subjt:  RPIAGSSSKKGQI------EGWVSGVYGSCSYRERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDF

Query:  SWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLK
        +WS +       R+DRFL+S SW + F                                              K++ WW E    GW GY+ M +LK LK
Subjt:  SWSRMGIRVAATRIDRFLLSQSWVQSFG-------------------------------------------DGKVESWWKELNPTGWAGYRIMEKLKGLK

Query:  MKIREWSRENLSKAGRKLLKVNLMEIIID-----------------------------EQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMES
         K++ WS+E      R L +     +++D                             E+    Q  K+KW +EGD NT FFHR A+  + R +I  +E 
Subjt:  MKIREWSRENLSKAGRKLLKVNLMEIIID-----------------------------EQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMES

Query:  EEGRILAKNDEIAEEVTGFFSHLY----------------------------------------------------------------------------
        E+  ++  +  I  EV  FF  LY                                                                            
Subjt:  EEGRILAKNDEIAEEVTGFFSHLY----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------QKDEG-------
                                                                                                QK +G       
Subjt:  ----------------------------------------------------------------------------------------QKDEG-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYH
                                                     YLG PLGGN     FW+P++E+VEK++ KW+   +SKGGR+TL Q++LSS+P Y+
Subjt:  -------------------------------------------QSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYH

Query:  FSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGK
         SL K P  +    E+L+RNF+W G       +LV+WE  +     GGLG+GSL +RN AL +KWLWRF  E + LW +II S YG+   G  T +    
Subjt:  FSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGK

Query:  KGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESE---SLTWDLDFRKGLFEREIPSWLALLDRIKD
             W +I+  Y +F  C + +V NG  I+FWED W  +  LK L+P L  +S +K  ++A C+ +     L WD DFR+ L E E+   + LLD + +
Subjt:  KGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKGATVADCWESE---SLTWDLDFRKGLFEREIPSWLALLDRIKD

Query:  VRL
        VRL
Subjt:  VRL

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog1.5e-0725Show/hide
Query:  WDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSL--LKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLA--Q
        ++ L + + + ++KW++ P S  GR+ + +  +    +Y+F+   +KAP S  K  EK+I +F+WN    +++  L+     S     GG+ +  L    
Subjt:  WDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSL--LKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLA--Q

Query:  RNVALMSKWLWRFSKEDDLLWRKI
        +++ + + W W  ++E D +W +I
Subjt:  RNVALMSKWLWRFSKEDDLLWRKI

P0C2F6 Putative ribonuclease H protein At1g657508.9e-2130.94Show/hide
Query:  LLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMS
        +LERV  ++  WR   +S  GR+TL +++LSS+P++  S +  P+SI+   ++L R F+W     K   +LVKW     P   GGLGV +    N AL+S
Subjt:  LLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLAQRNVALMS

Query:  KWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYY-TFDPCVKLTVNNGRCIKFWEDGWCDDQSL
        K  WR  +E + LW  ++   Y +               +  W  IA          V     +G+ I+FW D W   + L
Subjt:  KWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYY-TFDPCVKLTVNNGRCIKFWEDGWCDDQSL

P11369 LINE-1 retrotransposable element ORF2 protein9.9e-0423.14Show/hide
Query:  LLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSL--LKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLA--QRNV
        L + +++ + +W+  P S  GR+ + +  +    +Y F+   +K P       E  I  FVWN    +++ +L+K + +S     GG+ +  L    R +
Subjt:  LLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSL--LKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGGLGVGSLA--QRNV

Query:  ALMSKWLWRFSKEDDLLWRKI
         + + W W   ++ D  W +I
Subjt:  ALMSKWLWRFSKEDDLLWRKI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTCTCAGATGACCTCCGATGAGTTCTCCGACTGGGTTCTGAGCGTTTTGCGAGGGAGAGTAGGGTGTTTTTGGGATCTGGTAGACTTTCCAGAGGGGAGGGTTGG
GTTGGGTGTTTTCTTTGGTTATCATCGGAGGGCACTAGAGATTATGAATGATCTTCTTCTCTCACTGGCTACTCAAAAGTTCTTTCGAAAAACAGATTGCACAAATGGGT
TCATATGGATTCAGAAAATTTCGAACAAGCGTGGAAGTTTCCTTGAGATTAGAAAGTTATTAACTCAGACTAACCTGGCCACACCAAATAGAAGGAGAGATAAGGGAAGA
TCCTTCGCTGATATTGTGAGAAGCAAGCCACGAGCGCCAAGTAATTTTCAGGGGAAAGAGTCCTCCAGAGAATGGAAAGATTCCCAAGAGCGCAAAGGATGTAAAGAAGT
AGTGAGGAAAATTGATTGGTACGAAGTTGTGGTGATCACCAAGCGGGACCTCCACGATGACTGGGGGCGCATATTAGAGATTTTGCATCAGCAATGGGAAGAACCTCGTG
TTATAAACCCCTTTCAACCTTATAAGGCCCTTCTTAAATGCCCCTCATCCAAAATGGCAGACTTGCTTTCAATGAACAAAGGTTGGGTCAGCTTTGGACCAGCCATCTTA
AAAACTGAAAAATGGAACATCAAGCATCATTCCAAAATCACATGTGTCCCTAGCTATGGGGGCTGGATTAAACTAAGGAACCTTCCTCTGCATTTATGGTGCCTAAGTAT
TTTTAAGGCGATTGGCGACTGTTTAGGTGGGTTCATTGAATACGAAGAAAGCCACTCCTTGCTCATTGACTGCGTGGAAGTGAAGATGAAAGTTAGAGATAATTACTGTG
GTTTTCTACCGGCGGGAATAAGGATTGTAGATGGTGAAGATCAGTTTCACGTCCAAGTGGTAACTTACCAAGAGGGTACCATGTTGATAGAAAGGGTCGCCGGAATACAC
GGGAGCTTCTCGTCGGAGGCGGCTCACGCCTTTCACAGAGGGCCGAATGATCCTTCCTTTTGTCCAGCAGATATATGGAGAATTGAAAATGGTTGGGATTTTCCAATGGT
TAATATCCAAGAGTTAATTGAGGAGGAGACTGATAGTGGGCAACAACTGTTCAGCATTGAAAGAAAATTTGAATTGTCCCGCCAATTGGAAATAGGGGAACGTAGTAAAG
CTGCTGGGCCGTCTTTTGTCCCCGAGTCTCCAGCCCATCCTGAAAGTTCAGGACAAGCCCATAATGGGAGGAGTCATTCGGAAAATAGCCCATCGGTCACAGTGTACAAG
AGAAAAGGGGTTTCCTTCGCCAATAATAATCAAGTTACCACTTGTAAACGAGGGGACATTCAACACACAGATAAAAACGGAAGCCCACCAAGAGCAGAGGACACGTCGAA
CGATTGGATGAGGGACACAGGTCTGGATTCGGATCCGTCTATGTCCAGCCCAGGAAGTAGAGTTTCGAAAGATGTGGATGAAGAAAGCCCAAACCTCTATGAAGATAACG
AGGACTCAGTTCTGGTAGATTACCAACGACTTTTTGAGGCCGACAAGGAACTTGACAGAGGCACCCAAAAGCGGCAACTCGCAAGTAAGAAATTTCAATCTTCCCTTCTT
CCCCCTCCTGTCAGTAGTGCTGAAAATAGACATGAAACGGAAGCGGCTGGACATCCAAATTGTGAGTCTAAGTTGCAAGGATATGGCAGTCTTGAAGAAGTGATTGAAGA
AACTCCAAAGGCCTTGGCTATCATACAATCGACTAGGGAAGAGCATTTTGACAAGATTAATGAGAAGGACAGAGGTTATATTGAGGGCTTTTCGATCAGTAAAGAAATAG
TATTAACGCTGAGGAAAAATAACCTGTGTATTAGGCCGATAGCTGGTTCTAGTAGCAAAAAGGGGCAGATTGAGGGGTGGGTCTCCGGGGTTTATGGGTCATGCTCCTAC
AGAGAAAGGAGTCAATTTTTACAAGAGCTCTATGATCTTCAAGGGTTGTGCCAGGGGGCTTGGTGCATTGGTGGCGACTTTAATATGATTAGATGGGCTAAGGAAAGATT
AGCTACTTCAAATTCCTCGAGAAGCATGAGGAAGTTTAACCGCTTTGTGAGAATCTGTGCCCTCAGCGACCCGCCCATGGTTAATGGAGATTTCTCGTGGTCTAGAATGG
GGATTAGAGTTGCAGCTACACGTATTGATAGATTCCTCTTGTCTCAGTCTTGGGTTCAGTCGTTTGGGGATGGCAAAGTTGAATCCTGGTGGAAGGAGCTAAACCCGACC
GGCTGGGCTGGGTATAGAATCATGGAGAAGTTGAAAGGATTGAAAATGAAAATTAGAGAGTGGAGTAGGGAGAATCTCTCCAAAGCAGGGAGGAAATTGCTTAAGGTGAA
TCTGATGGAGATTATCATTGATGAGCAAAGGTGCCTCAACCAAAATTGCAAGATTAAATGGCTGCAAGAGGGTGATGAAAACACTTCTTTCTTCCACAGATGGGCCTCGA
CCATAAAAAATAGAACCTTCATCTCTGTTATGGAATCTGAGGAGGGTAGAATCCTAGCCAAAAATGACGAGATTGCAGAGGAAGTCACTGGTTTTTTTAGCCATTTGTAC
CAGAAGGATGAGGGGCAGAGTTATTTGGGATTTCCTCTCGGTGGGAATTATCATAGAAGGATATTCTGGGATCCGCTTTTGGAGAGGGTTGAGAAGAAAATCGACAAATG
GAGACACTTCCCTATGTCCAAAGGAGGTAGGGTTACCCTCGCCCAATCAATTTTAAGTAGTCTCCCTTTATATCATTTCTCTTTGCTCAAAGCTCCTAAATCGATTATCA
AGGCTACGGAAAAATTGATCAGAAATTTTGTGTGGAATGGTGGCCATTCTAAACTGAGCTCTAACCTTGTCAAGTGGGAATGGTCCTCGTTACCTACCAATCTAGGGGGC
TTGGGTGTTGGCTCCCTAGCTCAAAGGAATGTGGCACTTATGTCTAAATGGTTGTGGAGATTTTCTAAGGAAGATGACTTGCTTTGGAGGAAGATCATTGTGAGTATTTA
TGGGCTCTCTCCTTGTGGGCGAAAAACCACCCAATTTAGAGGTAAAAAAGGTAATCGGATTTGGGTAGATATTGCTGCCAACTACTATACTTTTGACCCTTGTGTCAAGC
TCACGGTCAACAACGGTAGATGCATAAAATTCTGGGAGGATGGTTGGTGTGATGACCAATCCCTCAAGCTTCTATATCCGGACCTATACATCATCTCTAACAAAAAGGGG
GCGACCGTGGCTGATTGCTGGGAGTCTGAGTCACTTACCTGGGACTTGGATTTTAGAAAGGGCCTGTTCGAAAGAGAGATCCCTAGTTGGTTGGCCTTGCTGGACAGAAT
CAAAGATGTGAGGCTGATGGACGAGCAAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCTCTCAGATGACCTCCGATGAGTTCTCCGACTGGGTTCTGAGCGTTTTGCGAGGGAGAGTAGGGTGTTTTTGGGATCTGGTAGACTTTCCAGAGGGGAGGGTTGG
GTTGGGTGTTTTCTTTGGTTATCATCGGAGGGCACTAGAGATTATGAATGATCTTCTTCTCTCACTGGCTACTCAAAAGTTCTTTCGAAAAACAGATTGCACAAATGGGT
TCATATGGATTCAGAAAATTTCGAACAAGCGTGGAAGTTTCCTTGAGATTAGAAAGTTATTAACTCAGACTAACCTGGCCACACCAAATAGAAGGAGAGATAAGGGAAGA
TCCTTCGCTGATATTGTGAGAAGCAAGCCACGAGCGCCAAGTAATTTTCAGGGGAAAGAGTCCTCCAGAGAATGGAAAGATTCCCAAGAGCGCAAAGGATGTAAAGAAGT
AGTGAGGAAAATTGATTGGTACGAAGTTGTGGTGATCACCAAGCGGGACCTCCACGATGACTGGGGGCGCATATTAGAGATTTTGCATCAGCAATGGGAAGAACCTCGTG
TTATAAACCCCTTTCAACCTTATAAGGCCCTTCTTAAATGCCCCTCATCCAAAATGGCAGACTTGCTTTCAATGAACAAAGGTTGGGTCAGCTTTGGACCAGCCATCTTA
AAAACTGAAAAATGGAACATCAAGCATCATTCCAAAATCACATGTGTCCCTAGCTATGGGGGCTGGATTAAACTAAGGAACCTTCCTCTGCATTTATGGTGCCTAAGTAT
TTTTAAGGCGATTGGCGACTGTTTAGGTGGGTTCATTGAATACGAAGAAAGCCACTCCTTGCTCATTGACTGCGTGGAAGTGAAGATGAAAGTTAGAGATAATTACTGTG
GTTTTCTACCGGCGGGAATAAGGATTGTAGATGGTGAAGATCAGTTTCACGTCCAAGTGGTAACTTACCAAGAGGGTACCATGTTGATAGAAAGGGTCGCCGGAATACAC
GGGAGCTTCTCGTCGGAGGCGGCTCACGCCTTTCACAGAGGGCCGAATGATCCTTCCTTTTGTCCAGCAGATATATGGAGAATTGAAAATGGTTGGGATTTTCCAATGGT
TAATATCCAAGAGTTAATTGAGGAGGAGACTGATAGTGGGCAACAACTGTTCAGCATTGAAAGAAAATTTGAATTGTCCCGCCAATTGGAAATAGGGGAACGTAGTAAAG
CTGCTGGGCCGTCTTTTGTCCCCGAGTCTCCAGCCCATCCTGAAAGTTCAGGACAAGCCCATAATGGGAGGAGTCATTCGGAAAATAGCCCATCGGTCACAGTGTACAAG
AGAAAAGGGGTTTCCTTCGCCAATAATAATCAAGTTACCACTTGTAAACGAGGGGACATTCAACACACAGATAAAAACGGAAGCCCACCAAGAGCAGAGGACACGTCGAA
CGATTGGATGAGGGACACAGGTCTGGATTCGGATCCGTCTATGTCCAGCCCAGGAAGTAGAGTTTCGAAAGATGTGGATGAAGAAAGCCCAAACCTCTATGAAGATAACG
AGGACTCAGTTCTGGTAGATTACCAACGACTTTTTGAGGCCGACAAGGAACTTGACAGAGGCACCCAAAAGCGGCAACTCGCAAGTAAGAAATTTCAATCTTCCCTTCTT
CCCCCTCCTGTCAGTAGTGCTGAAAATAGACATGAAACGGAAGCGGCTGGACATCCAAATTGTGAGTCTAAGTTGCAAGGATATGGCAGTCTTGAAGAAGTGATTGAAGA
AACTCCAAAGGCCTTGGCTATCATACAATCGACTAGGGAAGAGCATTTTGACAAGATTAATGAGAAGGACAGAGGTTATATTGAGGGCTTTTCGATCAGTAAAGAAATAG
TATTAACGCTGAGGAAAAATAACCTGTGTATTAGGCCGATAGCTGGTTCTAGTAGCAAAAAGGGGCAGATTGAGGGGTGGGTCTCCGGGGTTTATGGGTCATGCTCCTAC
AGAGAAAGGAGTCAATTTTTACAAGAGCTCTATGATCTTCAAGGGTTGTGCCAGGGGGCTTGGTGCATTGGTGGCGACTTTAATATGATTAGATGGGCTAAGGAAAGATT
AGCTACTTCAAATTCCTCGAGAAGCATGAGGAAGTTTAACCGCTTTGTGAGAATCTGTGCCCTCAGCGACCCGCCCATGGTTAATGGAGATTTCTCGTGGTCTAGAATGG
GGATTAGAGTTGCAGCTACACGTATTGATAGATTCCTCTTGTCTCAGTCTTGGGTTCAGTCGTTTGGGGATGGCAAAGTTGAATCCTGGTGGAAGGAGCTAAACCCGACC
GGCTGGGCTGGGTATAGAATCATGGAGAAGTTGAAAGGATTGAAAATGAAAATTAGAGAGTGGAGTAGGGAGAATCTCTCCAAAGCAGGGAGGAAATTGCTTAAGGTGAA
TCTGATGGAGATTATCATTGATGAGCAAAGGTGCCTCAACCAAAATTGCAAGATTAAATGGCTGCAAGAGGGTGATGAAAACACTTCTTTCTTCCACAGATGGGCCTCGA
CCATAAAAAATAGAACCTTCATCTCTGTTATGGAATCTGAGGAGGGTAGAATCCTAGCCAAAAATGACGAGATTGCAGAGGAAGTCACTGGTTTTTTTAGCCATTTGTAC
CAGAAGGATGAGGGGCAGAGTTATTTGGGATTTCCTCTCGGTGGGAATTATCATAGAAGGATATTCTGGGATCCGCTTTTGGAGAGGGTTGAGAAGAAAATCGACAAATG
GAGACACTTCCCTATGTCCAAAGGAGGTAGGGTTACCCTCGCCCAATCAATTTTAAGTAGTCTCCCTTTATATCATTTCTCTTTGCTCAAAGCTCCTAAATCGATTATCA
AGGCTACGGAAAAATTGATCAGAAATTTTGTGTGGAATGGTGGCCATTCTAAACTGAGCTCTAACCTTGTCAAGTGGGAATGGTCCTCGTTACCTACCAATCTAGGGGGC
TTGGGTGTTGGCTCCCTAGCTCAAAGGAATGTGGCACTTATGTCTAAATGGTTGTGGAGATTTTCTAAGGAAGATGACTTGCTTTGGAGGAAGATCATTGTGAGTATTTA
TGGGCTCTCTCCTTGTGGGCGAAAAACCACCCAATTTAGAGGTAAAAAAGGTAATCGGATTTGGGTAGATATTGCTGCCAACTACTATACTTTTGACCCTTGTGTCAAGC
TCACGGTCAACAACGGTAGATGCATAAAATTCTGGGAGGATGGTTGGTGTGATGACCAATCCCTCAAGCTTCTATATCCGGACCTATACATCATCTCTAACAAAAAGGGG
GCGACCGTGGCTGATTGCTGGGAGTCTGAGTCACTTACCTGGGACTTGGATTTTAGAAAGGGCCTGTTCGAAAGAGAGATCCCTAGTTGGTTGGCCTTGCTGGACAGAAT
CAAAGATGTGAGGCTGATGGACGAGCAAGATTAG
Protein sequenceShow/hide protein sequence
MISQMTSDEFSDWVLSVLRGRVGCFWDLVDFPEGRVGLGVFFGYHRRALEIMNDLLLSLATQKFFRKTDCTNGFIWIQKISNKRGSFLEIRKLLTQTNLATPNRRRDKGR
SFADIVRSKPRAPSNFQGKESSREWKDSQERKGCKEVVRKIDWYEVVVITKRDLHDDWGRILEILHQQWEEPRVINPFQPYKALLKCPSSKMADLLSMNKGWVSFGPAIL
KTEKWNIKHHSKITCVPSYGGWIKLRNLPLHLWCLSIFKAIGDCLGGFIEYEESHSLLIDCVEVKMKVRDNYCGFLPAGIRIVDGEDQFHVQVVTYQEGTMLIERVAGIH
GSFSSEAAHAFHRGPNDPSFCPADIWRIENGWDFPMVNIQELIEEETDSGQQLFSIERKFELSRQLEIGERSKAAGPSFVPESPAHPESSGQAHNGRSHSENSPSVTVYK
RKGVSFANNNQVTTCKRGDIQHTDKNGSPPRAEDTSNDWMRDTGLDSDPSMSSPGSRVSKDVDEESPNLYEDNEDSVLVDYQRLFEADKELDRGTQKRQLASKKFQSSLL
PPPVSSAENRHETEAAGHPNCESKLQGYGSLEEVIEETPKALAIIQSTREEHFDKINEKDRGYIEGFSISKEIVLTLRKNNLCIRPIAGSSSKKGQIEGWVSGVYGSCSY
RERSQFLQELYDLQGLCQGAWCIGGDFNMIRWAKERLATSNSSRSMRKFNRFVRICALSDPPMVNGDFSWSRMGIRVAATRIDRFLLSQSWVQSFGDGKVESWWKELNPT
GWAGYRIMEKLKGLKMKIREWSRENLSKAGRKLLKVNLMEIIIDEQRCLNQNCKIKWLQEGDENTSFFHRWASTIKNRTFISVMESEEGRILAKNDEIAEEVTGFFSHLY
QKDEGQSYLGFPLGGNYHRRIFWDPLLERVEKKIDKWRHFPMSKGGRVTLAQSILSSLPLYHFSLLKAPKSIIKATEKLIRNFVWNGGHSKLSSNLVKWEWSSLPTNLGG
LGVGSLAQRNVALMSKWLWRFSKEDDLLWRKIIVSIYGLSPCGRKTTQFRGKKGNRIWVDIAANYYTFDPCVKLTVNNGRCIKFWEDGWCDDQSLKLLYPDLYIISNKKG
ATVADCWESESLTWDLDFRKGLFEREIPSWLALLDRIKDVRLMDEQD