; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002190 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002190
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold1:30518643..30524245
RNA-Seq ExpressionSpg002190
SyntenySpg002190
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]2.7e-12227.12Show/hide
Query:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF
        K L+WN RGLGS KKR  +++ +  QNP +V+ QETK+ +   R++ S+W    + W +L + GASGGI+I+W   +F+  E + G FS+++ +   +  
Subjt:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF

Query:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLID-TPLQNGCYTWSSCGENHYCSLIDRFL
        SFWLS++YGP++   R +FW EL DL GL    W +GGDFNV R   EK     +T +MR F+++I +  L+D  PL+N  +TWS+   +  C  +DRFL
Subjt:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLID-TPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN-----------
              + F       L   TSDH P  L      WGP PFRFEN WL    F+    +WW + T++GW GH FM KLK +KS++++WN           
Subjt:  MTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------LSQRSS-----------------------------------------------------------------------
                               +S+ S+                                                                       
Subjt:  -----------------------LSQRSS-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELD
                       AD L  L                     VS L+  DDT       +D L N+  I+ +F   SGL IN  KS + GI+  +  L 
Subjt:  ---------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELD

Query:  WLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGD
         L     C+   WP +YLGLPLGGN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+LSLFK+   IA  ++K+ RDF W G+   
Subjt:  WLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGD

Query:  GGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNG
           H I W  V  P  MGG+G G    RN ALL KW+WRF  E + LWHK+I + Y     P+ W  +++ + SH+ PW+ I       S  V+  +GNG
Subjt:  GGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNG

Query:  LATSFWHDSWLSCGVLATNFPRLYRLTD-RPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMED
            FW D W     L   F  LYR++  R  ++      +   +W+ N RRNL D E +    L   L S+ L  +  DS  W L SS  FSVKS    
Subjt:  LATSFWHDSWLSCGVLATNFPRLYRLTD-RPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMED

Query:  LVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKD
        L    N    L  K +WS   P K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+HC      W  + +  G   V P  I+D
Subjt:  LVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKD

Query:  VLTLIF---------------------------------VDHPFRG--------EKKILWLALNRVFFWFLWGERNSRIF
        +L + F                                 V     G        ++K++WL      FW +W  RNS  F
Subjt:  VLTLIF---------------------------------VDHPFRG--------EKKILWLALNRVFFWFLWGERNSRIF

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.6e-12526.75Show/hide
Query:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF
        K ++WNVRGLGS  KR +IK  ++ +NP +V+IQETKK     R + S+W+  +  W +L + GASGGILI+W     S +E + G FS+S+   L    
Subjt:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF

Query:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM
          W+S +YGP+  + R +FW EL D+ GL    W +GGDFNV R S EK  G  +T SMR F+ +I++  L+D PL+N  +TWS+  E+  C  +DRFL 
Subjt:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM

Query:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------
        ++     F       L R TSDH+P  +      WGP PFRFEN WL+  +F+    +WWS     GW GH FM +L+ +K+++++WN            
Subjt:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------LSQRSS---------------------
                                                                                 LS+R S                     
Subjt:  -------------------------------------------------------------------------LSQRSS---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------ADQLPSL---------------------VSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLG
                             AD L  +                     VS L+  DD         E++ TL +   ++ +F    GL +N +KS + G
Subjt:  ---------------------ADQLPSL---------------------VSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLG

Query:  IHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRD
        I+L+++ L  L     CK   WP  YLGLPLGGN K+  FW PV+ERI  +L  W+ +Y+S GGR TL Q+ L+ +P Y+LSLFK+   +A  +++L RD
Subjt:  IHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRD

Query:  FFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSR
        F W G       H + W  V  P  +GG+G+GN   RNLALL KW+WR+  E ++LWH++I++  Y S       + + + SH+ PW+ I       S  
Subjt:  FFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSR

Query:  VKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIF
         +   GNG    FW D W     L T +PRL+R + D+  S+      +    W+LN RRNL+D E E+   L   L  + L  +  D+ +WPL SS +F
Subjt:  VKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIF

Query:  SVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSI
        SVKS    L      + +   K +W+   P K+K F+W ++H  +NT D LQ R P+  LSP  CI+C    E   HLF+HC+     W  +        
Subjt:  SVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSI

Query:  VFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA
        V P  I D++++ F          +LW A +      +W ERN+RIF D   + +   + I+F A  W  C   F    L+ +  +W A
Subjt:  VFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA

RVW53010.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.3e-12127.19Show/hide
Query:  GLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIY
        GLGS KKR ++K  +  + P +V+IQETKK     R++ S+WS  +  W +L + GASGGILI+W   +   +E + G FS+SI   +    S WLS +Y
Subjt:  GLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIY

Query:  GPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKF
        GP+  A R +FW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I D  LID+PL++  YTWS+  EN  C  +DRFL ++     F
Subjt:  GPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKF

Query:  GVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN--------------------
          +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K+++++WN                    
Subjt:  GVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN--------------------

Query:  ---------LSQR------------------------------------------------SSADQLPS-------------------------------
                 LSQ                                                  SA +L S                               
Subjt:  ---------LSQR------------------------------------------------SSADQLPS-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------LVSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGN
                       VS L+  DDT        ED+ TL +   ++ +F   SGL +N  KS + GI++E++ L  L     CK   WP  YLGLPLGGN
Subjt:  --------------LVSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGN

Query:  SKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNF
         K   FW PVIERI  +L  W+ +Y+S GGR TL Q+ L+ MP Y+LSLF++   +A  ++++ R+F W G       H +NW  V  P   GG+G G  
Subjt:  SKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNF

Query:  QNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-
          RN+ALL KW+WR+  E ++LWH++I++  Y S       +   + SH+ PW+ I       S   +  +G+G    FW D W     L T +PRL   
Subjt:  QNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-

Query:  LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIK
        +TD+   +      +   +W+ N RRNL D E E+   L   L  + +  +  D   W +  S +F+VKS    L            K +W+   P K+K
Subjt:  LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIK

Query:  IFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVF
         F+W ++H  +NT D LQ R PH  LSP+ C +C    E   HLF+HC+     W  +        V P  I D+    F          +LW       
Subjt:  IFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVF

Query:  FWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA
         W +W ERN+RIF D   +     + I F A  W  C   F    L+ L  +W A
Subjt:  FWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.7e-13026.95Show/hide
Query:  KSKMISSPAVMPKISVPVHISPSP-PKISVPEHISPPPLSSDHLNKGKLPL-EAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIEQPNHNNPPKPLES
        +S ++S  +  PKI+V   + PS    +S PE ++  PL + + N  + PL +     PE   +             G    PN    P H  P  PLE 
Subjt:  KSKMISSPAVMPKISVPVHISPSP-PKISVPEHISPPPLSSDHLNKGKLPL-EAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIEQPNHNNPPKPLES

Query:  LR---LSPHP-----------SPRH-LGFNGGGRE---YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTS
         +   L+P              PR  LG   G       Y K L+WN RGLGS KKR ++++ +  QNP IV++QETK+     R + S+W    + W +
Subjt:  LR---LSPHP-----------SPRH-LGFNGGGRE---YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTS

Query:  LDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSM
        L + GASGGI+I+W   +    E + G FS+++     +  SFWL+++YGP     R +FW EL DL GL    W +GGDFNV R   EK     +T +M
Subjt:  LDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSM

Query:  RIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENW
        R F+++I +  LID PL+N  +TWS+   +  C  +DRFL +      F  +    L R TSDH P  L    L WGP PFRFEN WL    F+     W
Subjt:  RIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENW

Query:  WSQNTIQGWPGHGFMMKLKGLKSEIRKWNL----------------------------------------------------------------------
        W + T +GW GH FM KLK +KS++++WN+                                                                      
Subjt:  WSQNTIQGWPGHGFMMKLKGLKSEIRKWNL----------------------------------------------------------------------

Query:  --------SQRSSADQLPSLVSQ-----------------------------------------------------------------------------
                + R S   + SL+S+                                                                             
Subjt:  --------SQRSSADQLPSLVSQ-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------LKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLG
                               L+  DDT       M+ L N+  I+ +F   SGL IN  KS + GI+  +  L  L S F C+   WP +YLGLPLG
Subjt:  -----------------------LKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLG

Query:  GNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIG
        GN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+LSLFK+   IA  ++K+ R+F W G+      H + W  V  P  +GG+G G
Subjt:  GNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIG

Query:  NFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRL
            RN+ALL KW+WRF  E + LW+K+I + Y     P+ W  +++ + SH+ PW+ I       S  V+  +GNG    FW D W     L + F  L
Subjt:  NFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRL

Query:  YRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIR-LQNRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPK
        YR+       V      +   AW+LN RRNL D E +    L   LSS+R   +  DS  W L SS +F+VKS    L    N    L  K +WS   P 
Subjt:  YRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIR-LQNRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPK

Query:  KIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF--VDHPFRGEKKILWLA
        K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+HC      W+++    G   V P   +D+L + F  + +  RG  K LW  
Subjt:  KIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF--VDHPFRGEKKILWLA

Query:  LNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNW
              W +W ERN RIF D   S +   +LILF++  W  C   F    L+ +  NW
Subjt:  LNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNW

RVX11537.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.5e-12227.78Show/hide
Query:  LESLRLSPHPSPRHLGFNGGGRE-------YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASG
        LESLR+      R +  NG G E       +  K L+WN RGLGS KKR ++++ +  QNP IV++QETK+     R + S+W    + W +L + GA G
Subjt:  LESLRLSPHPSPRHLGFNGGGRE-------YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASG

Query:  GILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIA
        GI+I+W   +F   E + G FS+++     +  S WL+++YGP     R +FW EL DL GL    W +GGDFNV R   EK     +T +MR F+++I 
Subjt:  GILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIA

Query:  DYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQG
        +  L+D PL+N  +TWS+   +  C  +DRFL +      F  +    L R TSDH P  L    L WGP PFRFEN WL    F+     WW + T +G
Subjt:  DYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQG

Query:  WPGHGFMMKLKGLKSEIRKWNL------------------------------------------------------------------------------
        W GH FM KLK +KS++++WN+                                                                              
Subjt:  WPGHGFMMKLKGLKSEIRKWNL------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------SQRSS-------------------------------------
                                                                   +RS                                      
Subjt:  ----------------------------------------------------------SQRSS-------------------------------------

Query:  -------------------------------------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVK
                                                   AD L  +                     VS L+  DDT       M+ L N+  I+ 
Subjt:  -------------------------------------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVK

Query:  IFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYL
        +F   SGL IN  KS + GI+  +  L  L S F C+   WP +YLGLPLGGN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+L
Subjt:  IFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYL

Query:  SLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQK
        SLFK+   IA  ++K+ R+F W  +      H + W  V  P  +GG+G G    RN+ALL KW+WRF  E + LW+K+I + Y     P+ W  +++ +
Subjt:  SLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQK

Query:  SSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSI
         SH+ PW+ I       S  V+  +GNG    FW D W     L + F  LYR+       V      +   AW+LN RRNL D E +    L   LSS+
Subjt:  SSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSI

Query:  RLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFV
        R   +  DS  W L SS +F+VKS    L    N    L  K +WS   P K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+
Subjt:  RLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFV

Query:  HCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF
        HC      W+++    G   V P   +D+L + F
Subjt:  HCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF

TrEMBL top hitse value%identityAlignment
A0A438BYX6 Transposon TX1 uncharacterized 149 kDa protein1.2e-12526.75Show/hide
Query:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF
        K ++WNVRGLGS  KR +IK  ++ +NP +V+IQETKK     R + S+W+  +  W +L + GASGGILI+W     S +E + G FS+S+   L    
Subjt:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF

Query:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM
          W+S +YGP+  + R +FW EL D+ GL    W +GGDFNV R S EK  G  +T SMR F+ +I++  L+D PL+N  +TWS+  E+  C  +DRFL 
Subjt:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM

Query:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------
        ++     F       L R TSDH+P  +      WGP PFRFEN WL+  +F+    +WWS     GW GH FM +L+ +K+++++WN            
Subjt:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------LSQRSS---------------------
                                                                                 LS+R S                     
Subjt:  -------------------------------------------------------------------------LSQRSS---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------ADQLPSL---------------------VSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLG
                             AD L  +                     VS L+  DD         E++ TL +   ++ +F    GL +N +KS + G
Subjt:  ---------------------ADQLPSL---------------------VSQLKLLDDT--------EDMDTLANMFDIVKIFELASGLNINYSKSEVLG

Query:  IHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRD
        I+L+++ L  L     CK   WP  YLGLPLGGN K+  FW PV+ERI  +L  W+ +Y+S GGR TL Q+ L+ +P Y+LSLFK+   +A  +++L RD
Subjt:  IHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRD

Query:  FFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSR
        F W G       H + W  V  P  +GG+G+GN   RNLALL KW+WR+  E ++LWH++I++  Y S       + + + SH+ PW+ I       S  
Subjt:  FFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSR

Query:  VKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIF
         +   GNG    FW D W     L T +PRL+R + D+  S+      +    W+LN RRNL+D E E+   L   L  + L  +  D+ +WPL SS +F
Subjt:  VKRRLGNGLATSFWHDSWLSCGVLATNFPRLYR-LTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIF

Query:  SVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSI
        SVKS    L      + +   K +W+   P K+K F+W ++H  +NT D LQ R P+  LSP  CI+C    E   HLF+HC+     W  +        
Subjt:  SVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSI

Query:  VFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA
        V P  I D++++ F          +LW A +      +W ERN+RIF D   + +   + I+F A  W  C   F    L+ +  +W A
Subjt:  VFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKA

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein3.7e-13026.95Show/hide
Query:  KSKMISSPAVMPKISVPVHISPSP-PKISVPEHISPPPLSSDHLNKGKLPL-EAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIEQPNHNNPPKPLES
        +S ++S  +  PKI+V   + PS    +S PE ++  PL + + N  + PL +     PE   +             G    PN    P H  P  PLE 
Subjt:  KSKMISSPAVMPKISVPVHISPSP-PKISVPEHISPPPLSSDHLNKGKLPL-EAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIEQPNHNNPPKPLES

Query:  LR---LSPHP-----------SPRH-LGFNGGGRE---YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTS
         +   L+P              PR  LG   G       Y K L+WN RGLGS KKR ++++ +  QNP IV++QETK+     R + S+W    + W +
Subjt:  LR---LSPHP-----------SPRH-LGFNGGGRE---YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTS

Query:  LDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSM
        L + GASGGI+I+W   +    E + G FS+++     +  SFWL+++YGP     R +FW EL DL GL    W +GGDFNV R   EK     +T +M
Subjt:  LDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSM

Query:  RIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENW
        R F+++I +  LID PL+N  +TWS+   +  C  +DRFL +      F  +    L R TSDH P  L    L WGP PFRFEN WL    F+     W
Subjt:  RIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENW

Query:  WSQNTIQGWPGHGFMMKLKGLKSEIRKWNL----------------------------------------------------------------------
        W + T +GW GH FM KLK +KS++++WN+                                                                      
Subjt:  WSQNTIQGWPGHGFMMKLKGLKSEIRKWNL----------------------------------------------------------------------

Query:  --------SQRSSADQLPSLVSQ-----------------------------------------------------------------------------
                + R S   + SL+S+                                                                             
Subjt:  --------SQRSSADQLPSLVSQ-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------LKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLG
                               L+  DDT       M+ L N+  I+ +F   SGL IN  KS + GI+  +  L  L S F C+   WP +YLGLPLG
Subjt:  -----------------------LKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLG

Query:  GNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIG
        GN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+LSLFK+   IA  ++K+ R+F W G+      H + W  V  P  +GG+G G
Subjt:  GNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIG

Query:  NFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRL
            RN+ALL KW+WRF  E + LW+K+I + Y     P+ W  +++ + SH+ PW+ I       S  V+  +GNG    FW D W     L + F  L
Subjt:  NFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRL

Query:  YRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIR-LQNRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPK
        YR+       V      +   AW+LN RRNL D E +    L   LSS+R   +  DS  W L SS +F+VKS    L    N    L  K +WS   P 
Subjt:  YRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIR-LQNRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPK

Query:  KIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF--VDHPFRGEKKILWLA
        K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+HC      W+++    G   V P   +D+L + F  + +  RG  K LW  
Subjt:  KIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF--VDHPFRGEKKILWLA

Query:  LNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNW
              W +W ERN RIF D   S +   +LILF++  W  C   F    L+ +  NW
Subjt:  LNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHALYWCKCKHPFSDYSLSFLISNW

A0A438JRF4 LINE-1 retrotransposable element ORF2 protein2.2e-12227.78Show/hide
Query:  LESLRLSPHPSPRHLGFNGGGRE-------YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASG
        LESLR+      R +  NG G E       +  K L+WN RGLGS KKR ++++ +  QNP IV++QETK+     R + S+W    + W +L + GA G
Subjt:  LESLRLSPHPSPRHLGFNGGGRE-------YYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASG

Query:  GILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIA
        GI+I+W   +F   E + G FS+++     +  S WL+++YGP     R +FW EL DL GL    W +GGDFNV R   EK     +T +MR F+++I 
Subjt:  GILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIA

Query:  DYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQG
        +  L+D PL+N  +TWS+   +  C  +DRFL +      F  +    L R TSDH P  L    L WGP PFRFEN WL    F+     WW + T +G
Subjt:  DYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQG

Query:  WPGHGFMMKLKGLKSEIRKWNL------------------------------------------------------------------------------
        W GH FM KLK +KS++++WN+                                                                              
Subjt:  WPGHGFMMKLKGLKSEIRKWNL------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------SQRSS-------------------------------------
                                                                   +RS                                      
Subjt:  ----------------------------------------------------------SQRSS-------------------------------------

Query:  -------------------------------------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVK
                                                   AD L  +                     VS L+  DDT       M+ L N+  I+ 
Subjt:  -------------------------------------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVK

Query:  IFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYL
        +F   SGL IN  KS + GI+  +  L  L S F C+   WP +YLGLPLGGN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+L
Subjt:  IFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYL

Query:  SLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQK
        SLFK+   IA  ++K+ R+F W  +      H + W  V  P  +GG+G G    RN+ALL KW+WRF  E + LW+K+I + Y     P+ W  +++ +
Subjt:  SLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQK

Query:  SSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSI
         SH+ PW+ I       S  V+  +GNG    FW D W     L + F  LYR+       V      +   AW+LN RRNL D E +    L   LSS+
Subjt:  SSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGET-WIATQSAWDLNLRRNLNDVETEEWMDLSLILSSI

Query:  RLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFV
        R   +  DS  W L SS +F+VKS    L    N    L  K +WS   P K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+
Subjt:  RLQ-NRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFV

Query:  HCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF
        HC      W+++    G   V P   +D+L + F
Subjt:  HCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIF

A5B978 Reverse transcriptase domain-containing protein1.3e-12227.12Show/hide
Query:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF
        K L+WN RGLGS KKR  +++ +  QNP +V+ QETK+ +   R++ S+W    + W +L + GASGGI+I+W   +F+  E + G FS+++ +   +  
Subjt:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF

Query:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLID-TPLQNGCYTWSSCGENHYCSLIDRFL
        SFWLS++YGP++   R +FW EL DL GL    W +GGDFNV R   EK     +T +MR F+++I +  L+D  PL+N  +TWS+   +  C  +DRFL
Subjt:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLID-TPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN-----------
              + F       L   TSDH P  L      WGP PFRFEN WL    F+    +WW + T++GW GH FM KLK +KS++++WN           
Subjt:  MTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------LSQRSS-----------------------------------------------------------------------
                               +S+ S+                                                                       
Subjt:  -----------------------LSQRSS-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELD
                       AD L  L                     VS L+  DDT       +D L N+  I+ +F   SGL IN  KS + GI+  +  L 
Subjt:  ---------------ADQLPSL---------------------VSQLKLLDDT-----EDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELD

Query:  WLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGD
         L     C+   WP +YLGLPLGGN KTI FW PV+ERI  +L  WK +Y+S GGR TL Q+ LS +P Y+LSLFK+   IA  ++K+ RDF W G+   
Subjt:  WLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGD

Query:  GGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNG
           H I W  V  P  MGG+G G    RN ALL KW+WRF  E + LWHK+I + Y     P+ W  +++ + SH+ PW+ I       S  V+  +GNG
Subjt:  GGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW-PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNG

Query:  LATSFWHDSWLSCGVLATNFPRLYRLTD-RPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMED
            FW D W     L   F  LYR++  R  ++      +   +W+ N RRNL D E +    L   L S+ L  +  DS  W L SS  FSVKS    
Subjt:  LATSFWHDSWLSCGVLATNFPRLYRLTD-RPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVKSLMED

Query:  LVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKD
        L    N    L  K +WS   P K+K   W ++HG +NT D+LQ R P+  L P WCI+C  + E   HLF+HC      W  + +  G   V P  I+D
Subjt:  LVDRPNMANDL-YKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKD

Query:  VLTLIF---------------------------------VDHPFRG--------EKKILWLALNRVFFWFLWGERNSRIF
        +L + F                                 V     G        ++K++WL      FW +W  RNS  F
Subjt:  VLTLIF---------------------------------VDHPFRG--------EKKILWLALNRVFFWFLWGERNSRIF

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)3.4e-13127.62Show/hide
Query:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF
        K ++WN+RGLGS +KR L+K+ +++  P IV++ ETKK ++  +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I      
Subjt:  KFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNF

Query:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM
         +WLS IYGP R  +R+ FW EL DL G  GD W LGGDFNV R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL+
Subjt:  SFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLM

Query:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------
        + +    F   R   L R+TSDH P  L    + WGP PFRFEN WL    F+  ++ WW ++ I GW G+ FM +LK LKS+++ W+            
Subjt:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLMENWWSQNTIQGWPGHGFMMKLKGLKSEIRKWN------------

Query:  --------LSQRSSADQLPSL-------------------------------------------------------------------------------
                L QR   + L  L                                                                               
Subjt:  --------LSQRSSADQLPSL-------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------VSQLKLLDDTEDM-----DTLANMFDIVKIFELASGLNINYSKSEVLGIHLE
                                                        VS L+  DDT  +     +   N+  ++K+F   SG+ IN +KS +LGI+  
Subjt:  ------------------------------------------------VSQLKLLDDTEDM-----DTLANMFDIVKIFELASGLNINYSKSEVLGIHLE

Query:  ESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWE
           L+ +  ++GC+ G WP  YLGLPLGGN + + FW PV+E+++ +L  WK + +SKGGR TL QAVLSS+P YY+SLFK+   +A  +++L+R+F WE
Subjt:  ESELDWLTSTFGCKQGFWPSTYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWE

Query:  GSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRR
        G       H + W  V      GG+GIG+ + R  AL AKW+WRF  E NSLWH++I +KY                S+ +PWR I+   +      +  
Subjt:  GSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRR

Query:  LGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVG--ETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVK
        +GNG    FW D WL  G+L   FPRL  L+ R    +            WD + RRNL++ E  E + L  IL ++RL  +R D   W +E    FS K
Subjt:  LGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVG--ETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQ-NRNDSWIWPLESSNIFSVK

Query:  SLMEDLVDRPNMANDLYKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPN
        S    L+         +  IW    P KI+ F+W  ++G INT D +QRR P   LSPSWC++C  ++E+  HLF+HC+++ R W ++L A G   V P 
Subjt:  SLMEDLVDRPNMANDLYKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPN

Query:  CIKDVLTLIFVDHPFRGEKK---ILWLALNRVFFWFLWGERNSRIFRDSFS-SFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKAFM
          K    L+ ++    G+ K   IL   L    FW +W ERN RIF+       ++  + I F A  W      F DY  S ++ +  A +
Subjt:  CIKDVLTLIFVDHPFRGEKK---ILWLALNRVFFWFLWGERNSRIFRDSFS-SFDKFMELILFHALYWCKCKHPFSDYSLSFLISNWKAFM

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.7e-4029.67Show/hide
Query:  VIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLA
        ++ER+  ++  W+   +S  GR TLT+AVLSSMP++ +S   L   I   LD+L R F W  +      H + W+ V  P   GG+G+   ++ N AL++
Subjt:  VIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLA

Query:  KWIWRFLHEENSLWHKLIVAKYYNSEL-PSLWPSIIQKSSHKSPWRFITSTI-DLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSL
        K  WR L E+NSLW  ++  KY+  E+  S W  +I K S  S WR I   + D+VS  V    G+G    FW D W+S G          R TD    +
Subjt:  KWIWRFLHEENSLWHKLIVAKYYNSEL-PSLWPSIIQKSSHKSPWRFITSTI-DLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSL

Query:  VGETWIATQSAWDLNLRRNLNDVETEEW-MDLSLILSSIRLQNRNDSWIWPLESSNIFSVKSLME----DLVDRPNMANDLYKVIWSDFYPKKIKIFLWE
          + WI  +  WD      ++   T    ++L  ++  + +    D   W       FSV+S  E    D V RPNMA+  +  +W    P+++K FLW 
Subjt:  VGETWIATQSAWDLNLRRNLNDVETEEW-MDLSLILSSIRLQNRNDSWIWPLESSNIFSVKSLME----DLVDRPNMANDLYKVIWSDFYPKKIKIFLWE

Query:  LSHGAINTADRLQRRMPHFHLSPS-WCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFL
        + + A+ T +   RR    HLS S  C +C    E   H+   C      W  ++        F   + + L     D    G + I W  +  V  W+ 
Subjt:  LSHGAINTADRLQRRMPHFHLSPS-WCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFL

Query:  WGERNSRIFRDSFSSFDK
        W  R   IF ++    D+
Subjt:  WGERNSRIFRDSFSSFDK

P93295 Uncharacterized mitochondrial protein AtMg003102.4e-0926.85Show/hide
Query:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSL
        ++P+Y +S F+LS  + K L   + +F+W        +  + W  + +     GG+G  +    N ALLAK  +R +H+ ++L  +L+ ++Y+       
Subjt:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSL

Query:  WPSIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLATSFWHDSWL
          S+++ S    P   WR I    +L+S  + R +G+G+ T  W D W+
Subjt:  WPSIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLATSFWHDSWL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-2325.19Show/hide
Query:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW
        ++P Y ++ F L   + K +  +L DF+W   +   GMH   W  +      GGIG  + +  NLALL K +WR L    SL  K+  ++Y++   P   
Subjt:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLW

Query:  PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLS-
        P     S     W+ I ++ +++    +  +GNG     W   WL     A+   R+ R+  +  + V    +      D + R    DV    + ++  
Subjt:  PSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGETWIATQSAWDLNLRRNLNDVETEEWMDLS-

Query:  LILSSIRLQNRN--DSWIWPLESSNIFSVKS---LMEDLVDRPN--------MANDLYKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLS-P
         ++  +R   R   DS+ W   SS  ++VKS   ++  ++++ +          N +Y+ IW      KI+ FLW+    ++  A  L  R    HLS  
Subjt:  LILSSIRLQNRN--DSWIWPLESSNIFSVKS---LMEDLVDRPN--------MANDLYKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMPHFHLS-P

Query:  SWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRV---FFWFLWGERNSRIFR
        S CI C +  E   HL   CTFA   W+           + + I   L  +F      G     W   +++     W LW  RN  +FR
Subjt:  SWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRV---FFWFLWGERNSRIFR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-1026.85Show/hide
Query:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSL
        ++P+Y +S F+LS  + K L   + +F+W        +  + W  + +     GG+G  +    N ALLAK  +R +H+ ++L  +L+ ++Y+       
Subjt:  SMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSL

Query:  WPSIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLATSFWHDSWL
          S+++ S    P   WR I    +L+S  + R +G+G+ T  W D W+
Subjt:  WPSIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLATSFWHDSWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCACAAAGATTCATCCATATCTGCCATGGAATCACTCTACTCGATCCATATCCATAGATAGGAAAACCTTCACAATAGCCTTTGATGAACACTTTAGAGGAAG
TAGAGCCAAGATAACCGAATACAGCAGATATTCATCTCATTCGATTTCCCTTTCTTGGAAATCTCTAAAATGGCTTGCCTTACCTTTCAACACAATTGTTCACTCACCAT
GTTCGCACAAATTCTTCTCGGATCTGAGGAGCGAAGAATACACTCTTTGGCTGGAAAAACTGAATAACAAGAATGGTTTTTATGTGGAAATTAACCAGGTGCAAAATTCT
GGTAGCCGACAAAGGATCCTTATCCCCTCGGAAAACAACAAACAAGGTTGGTTCTCTTTTTTTTCGCTCATCTCAGAATACCCTACTGAAGCTCATCGCCAGCCCACACA
ACCATCACCTCCATCATTCAAGGACATCCTTCAAACAAAACCACCAACAGCCGCCATTACTCCTTCCTTGAAAGGGCCCGTGAAGGAAGCTTCTGTCTCCACACATGCTG
AAGAATGGAAAGAAATTATTGTTCTCCAACGATGCAATCAACATGACGACTGGCCTAGTATCCATCAATCACTAATTAACGGGTTGTCTCTTCGATGTAGCATCAACCCT
TTCCACGCTAACAAAGCCATGCTCCATGTATATGATCAAGGCACTGCTACAAACTTGTGTTCTCACTCGGATTGGACCCATATTGGTAAGCATAAATTGAAGTTTTATCC
ATTAACCACTGCCTCTGCTCAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATTTCTCTTCTTCCCCCTACCTTATGGACTGAGCACATATTCCGTTTCA
TTGGAGATATTTGCGGCGGCTTTGTGGAAACATCTAACCTCACTAGTCGGATGATTGTTGCTACTGAAGCTAGGATAAAAGTTTGGCCAAATGTTACAGGTTTCATTCCT
GCAGCCGTCAAACTCTCACAGGACCTTGCCGGCGTTGACCTTACGGTTCATATTCGAGGAATTTCCGGCAGCCCACAGAGAATCGTTCACATTAATGATAAAATTAATGA
GGAAATTCCCAATATGGCATCTAAGGATATTGTTTTTAAGAAGAGAGAGGAATCAGAGAACGTGTGTTCGATTGCTAAATCGAAAATGATCTCCTCGCCAGCAGTTATGC
CTAAAATCTCGGTACCAGTTCATATCTCCCCTTCACCGCCTAAAATATCGGTACCAGAACATATTTCCCCTCCTCCGCTGTCATCTGATCATTTGAATAAAGGGAAGCTC
CCTCTCGAAGCGCCTTTCCCTGGGCCTGAATCATCGATTATACAAATCACAGAACCCACAAATCTTAGATGCGGCAATATTGGATCTACATCAAAGCCCAATTTGATCGA
GCAGCCAAACCATAATAATCCTCCCAAACCTCTAGAATCCTTACGGCTTTCTCCTCACCCTTCCCCACGCCACCTTGGCTTTAATGGAGGGGGCCGAGAATATTATAAGA
AATTTTTGACATGGAATGTGCGTGGATTGGGGTCTTGGAAGAAAAGAGCTTTAATTAAGAAAACTATTCAACAACAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAA
AAATCATTGATTTGCAGTAGGATTATTAAATCCCTTTGGAGCTCCTCTCATATTGGTTGGACTTCTCTCGACTCAGTGGGCGCCTCTGGAGGCATTCTTATTATGTGGAG
TGAACCAGAATTTTCAGTAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCGTTCTGGCTGATAATTTCTCTTTTTGGCTATCGACTATTTATGGCCCTT
CTAGACATGCTGATAGATCGGAATTCTGGAATGAACTACACGACTTGGCTGGTTTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACACGTTGGTCCTGG
GAAAAATCGCATGGTCGACCCGTGACTAGGAGTATGCGTATTTTTAACCAATGGATTGCTGACTACCATCTTATAGACACCCCTTTACAGAATGGCTGCTATACGTGGTC
CAGTTGTGGTGAAAATCATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGCCTTGATAGGGTTACAT
CGGATCATTATCCTTGTACTCTATCTTTTGGGGATCTCTCTTGGGGCCCTTGCCCCTTTAGATTTGAGAATTCTTGGTTGAAAAAAGACTCTTTTCGTTGTCTTATGGAA
AATTGGTGGTCACAAAACACCATTCAAGGTTGGCCAGGCCATGGGTTTATGATGAAGCTTAAAGGATTGAAATCTGAAATCAGAAAATGGAATTTATCTCAGCGTTCATC
TGCTGATCAACTTCCATCTCTGGTCTCACAGTTGAAATTGTTGGATGATACAGAAGACATGGATACTTTGGCCAACATGTTTGATATTGTTAAAATTTTTGAGTTAGCTT
CTGGATTGAATATTAATTATTCCAAGAGTGAGGTTTTGGGAATTCATTTAGAGGAGTCAGAATTGGATTGGTTGACATCTACGTTTGGCTGTAAACAAGGATTTTGGCCT
TCTACTTACCTTGGTTTACCATTGGGAGGCAATTCTAAAACCATTCCTTTTTGGCAGCCTGTGATTGAAAGAATCCAACATAAACTTCATAGCTGGAAATATTCATATAT
TTCGAAAGGTGGTCGACATACTCTTACTCAAGCAGTTCTTTCCAGTATGCCAATATATTATCTATCATTATTCAAATTGTCGGGACAGATTGCAAAAACTCTTGATAAGT
TACTTCGTGATTTCTTTTGGGAAGGATCTAGAGGTGATGGTGGTATGCACAATATTAATTGGGCAACAGTTCAACTTCCACACTTGATGGGGGGTATTGGTATTGGCAAT
TTTCAAAATCGCAATCTTGCTCTTCTTGCAAAGTGGATCTGGAGATTTTTACATGAGGAAAACTCTCTATGGCATAAGCTGATTGTAGCTAAATATTATAACTCTGAGTT
GCCTAGTCTTTGGCCTAGCATTATTCAGAAAAGTTCTCACAAATCTCCTTGGCGATTTATTACTTCCACTATTGACCTTGTATCTTCACGTGTAAAAAGAAGATTGGGTA
ATGGTCTTGCTACATCATTTTGGCATGATTCGTGGTTAAGTTGTGGTGTTCTGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGT
GAAACATGGATTGCTACTCAATCAGCATGGGATCTAAATCTTCGGCGTAATTTAAATGATGTAGAGACAGAGGAATGGATGGACTTATCACTTATTCTTTCCTCCATCAG
ATTACAGAACCGTAATGATTCCTGGATTTGGCCTTTGGAATCGTCCAATATTTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACCGTCCGAATATGGCAAATGATC
TATATAAGGTCATTTGGTCAGATTTCTATCCAAAGAAGATCAAGATTTTTTTATGGGAACTCAGTCATGGTGCTATTAATACAGCTGATCGTCTTCAACGACGGATGCCT
CATTTTCATTTGTCTCCATCTTGGTGCATAATGTGTGCTGCTAGTTCAGAACATCCTAGGCATCTATTTGTCCATTGTACCTTCGCTTCCAGATATTGGTCAGAGATTCT
TGATGCTTTTGGATGGTCCATCGTTTTTCCAAATTGCATTAAGGATGTTCTTACTCTCATTTTTGTGGATCATCCCTTTCGTGGAGAAAAGAAGATCTTGTGGCTTGCTT
TGAACAGAGTTTTCTTCTGGTTTTTATGGGGCGAACGAAATTCTCGAATTTTCAGGGATTCTTTCTCTTCCTTTGATAAATTTATGGAGCTAATTCTTTTTCATGCATTG
TATTGGTGTAAATGTAAACACCCTTTCTCTGACTATAGTTTATCCTTTTTAATTTCCAATTGGAAAGCATTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCACAAAGATTCATCCATATCTGCCATGGAATCACTCTACTCGATCCATATCCATAGATAGGAAAACCTTCACAATAGCCTTTGATGAACACTTTAGAGGAAG
TAGAGCCAAGATAACCGAATACAGCAGATATTCATCTCATTCGATTTCCCTTTCTTGGAAATCTCTAAAATGGCTTGCCTTACCTTTCAACACAATTGTTCACTCACCAT
GTTCGCACAAATTCTTCTCGGATCTGAGGAGCGAAGAATACACTCTTTGGCTGGAAAAACTGAATAACAAGAATGGTTTTTATGTGGAAATTAACCAGGTGCAAAATTCT
GGTAGCCGACAAAGGATCCTTATCCCCTCGGAAAACAACAAACAAGGTTGGTTCTCTTTTTTTTCGCTCATCTCAGAATACCCTACTGAAGCTCATCGCCAGCCCACACA
ACCATCACCTCCATCATTCAAGGACATCCTTCAAACAAAACCACCAACAGCCGCCATTACTCCTTCCTTGAAAGGGCCCGTGAAGGAAGCTTCTGTCTCCACACATGCTG
AAGAATGGAAAGAAATTATTGTTCTCCAACGATGCAATCAACATGACGACTGGCCTAGTATCCATCAATCACTAATTAACGGGTTGTCTCTTCGATGTAGCATCAACCCT
TTCCACGCTAACAAAGCCATGCTCCATGTATATGATCAAGGCACTGCTACAAACTTGTGTTCTCACTCGGATTGGACCCATATTGGTAAGCATAAATTGAAGTTTTATCC
ATTAACCACTGCCTCTGCTCAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATTTCTCTTCTTCCCCCTACCTTATGGACTGAGCACATATTCCGTTTCA
TTGGAGATATTTGCGGCGGCTTTGTGGAAACATCTAACCTCACTAGTCGGATGATTGTTGCTACTGAAGCTAGGATAAAAGTTTGGCCAAATGTTACAGGTTTCATTCCT
GCAGCCGTCAAACTCTCACAGGACCTTGCCGGCGTTGACCTTACGGTTCATATTCGAGGAATTTCCGGCAGCCCACAGAGAATCGTTCACATTAATGATAAAATTAATGA
GGAAATTCCCAATATGGCATCTAAGGATATTGTTTTTAAGAAGAGAGAGGAATCAGAGAACGTGTGTTCGATTGCTAAATCGAAAATGATCTCCTCGCCAGCAGTTATGC
CTAAAATCTCGGTACCAGTTCATATCTCCCCTTCACCGCCTAAAATATCGGTACCAGAACATATTTCCCCTCCTCCGCTGTCATCTGATCATTTGAATAAAGGGAAGCTC
CCTCTCGAAGCGCCTTTCCCTGGGCCTGAATCATCGATTATACAAATCACAGAACCCACAAATCTTAGATGCGGCAATATTGGATCTACATCAAAGCCCAATTTGATCGA
GCAGCCAAACCATAATAATCCTCCCAAACCTCTAGAATCCTTACGGCTTTCTCCTCACCCTTCCCCACGCCACCTTGGCTTTAATGGAGGGGGCCGAGAATATTATAAGA
AATTTTTGACATGGAATGTGCGTGGATTGGGGTCTTGGAAGAAAAGAGCTTTAATTAAGAAAACTATTCAACAACAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAA
AAATCATTGATTTGCAGTAGGATTATTAAATCCCTTTGGAGCTCCTCTCATATTGGTTGGACTTCTCTCGACTCAGTGGGCGCCTCTGGAGGCATTCTTATTATGTGGAG
TGAACCAGAATTTTCAGTAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCGTTCTGGCTGATAATTTCTCTTTTTGGCTATCGACTATTTATGGCCCTT
CTAGACATGCTGATAGATCGGAATTCTGGAATGAACTACACGACTTGGCTGGTTTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACACGTTGGTCCTGG
GAAAAATCGCATGGTCGACCCGTGACTAGGAGTATGCGTATTTTTAACCAATGGATTGCTGACTACCATCTTATAGACACCCCTTTACAGAATGGCTGCTATACGTGGTC
CAGTTGTGGTGAAAATCATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGCCTTGATAGGGTTACAT
CGGATCATTATCCTTGTACTCTATCTTTTGGGGATCTCTCTTGGGGCCCTTGCCCCTTTAGATTTGAGAATTCTTGGTTGAAAAAAGACTCTTTTCGTTGTCTTATGGAA
AATTGGTGGTCACAAAACACCATTCAAGGTTGGCCAGGCCATGGGTTTATGATGAAGCTTAAAGGATTGAAATCTGAAATCAGAAAATGGAATTTATCTCAGCGTTCATC
TGCTGATCAACTTCCATCTCTGGTCTCACAGTTGAAATTGTTGGATGATACAGAAGACATGGATACTTTGGCCAACATGTTTGATATTGTTAAAATTTTTGAGTTAGCTT
CTGGATTGAATATTAATTATTCCAAGAGTGAGGTTTTGGGAATTCATTTAGAGGAGTCAGAATTGGATTGGTTGACATCTACGTTTGGCTGTAAACAAGGATTTTGGCCT
TCTACTTACCTTGGTTTACCATTGGGAGGCAATTCTAAAACCATTCCTTTTTGGCAGCCTGTGATTGAAAGAATCCAACATAAACTTCATAGCTGGAAATATTCATATAT
TTCGAAAGGTGGTCGACATACTCTTACTCAAGCAGTTCTTTCCAGTATGCCAATATATTATCTATCATTATTCAAATTGTCGGGACAGATTGCAAAAACTCTTGATAAGT
TACTTCGTGATTTCTTTTGGGAAGGATCTAGAGGTGATGGTGGTATGCACAATATTAATTGGGCAACAGTTCAACTTCCACACTTGATGGGGGGTATTGGTATTGGCAAT
TTTCAAAATCGCAATCTTGCTCTTCTTGCAAAGTGGATCTGGAGATTTTTACATGAGGAAAACTCTCTATGGCATAAGCTGATTGTAGCTAAATATTATAACTCTGAGTT
GCCTAGTCTTTGGCCTAGCATTATTCAGAAAAGTTCTCACAAATCTCCTTGGCGATTTATTACTTCCACTATTGACCTTGTATCTTCACGTGTAAAAAGAAGATTGGGTA
ATGGTCTTGCTACATCATTTTGGCATGATTCGTGGTTAAGTTGTGGTGTTCTGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGT
GAAACATGGATTGCTACTCAATCAGCATGGGATCTAAATCTTCGGCGTAATTTAAATGATGTAGAGACAGAGGAATGGATGGACTTATCACTTATTCTTTCCTCCATCAG
ATTACAGAACCGTAATGATTCCTGGATTTGGCCTTTGGAATCGTCCAATATTTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACCGTCCGAATATGGCAAATGATC
TATATAAGGTCATTTGGTCAGATTTCTATCCAAAGAAGATCAAGATTTTTTTATGGGAACTCAGTCATGGTGCTATTAATACAGCTGATCGTCTTCAACGACGGATGCCT
CATTTTCATTTGTCTCCATCTTGGTGCATAATGTGTGCTGCTAGTTCAGAACATCCTAGGCATCTATTTGTCCATTGTACCTTCGCTTCCAGATATTGGTCAGAGATTCT
TGATGCTTTTGGATGGTCCATCGTTTTTCCAAATTGCATTAAGGATGTTCTTACTCTCATTTTTGTGGATCATCCCTTTCGTGGAGAAAAGAAGATCTTGTGGCTTGCTT
TGAACAGAGTTTTCTTCTGGTTTTTATGGGGCGAACGAAATTCTCGAATTTTCAGGGATTCTTTCTCTTCCTTTGATAAATTTATGGAGCTAATTCTTTTTCATGCATTG
TATTGGTGTAAATGTAAACACCCTTTCTCTGACTATAGTTTATCCTTTTTAATTTCCAATTGGAAAGCATTTATGTAA
Protein sequenceShow/hide protein sequence
MKTTKIHPYLPWNHSTRSISIDRKTFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLALPFNTIVHSPCSHKFFSDLRSEEYTLWLEKLNNKNGFYVEINQVQNS
GSRQRILIPSENNKQGWFSFFSLISEYPTEAHRQPTQPSPPSFKDILQTKPPTAAITPSLKGPVKEASVSTHAEEWKEIIVLQRCNQHDDWPSIHQSLINGLSLRCSINP
FHANKAMLHVYDQGTATNLCSHSDWTHIGKHKLKFYPLTTASAQQDIMTPSYGGWIEISLLPPTLWTEHIFRFIGDICGGFVETSNLTSRMIVATEARIKVWPNVTGFIP
AAVKLSQDLAGVDLTVHIRGISGSPQRIVHINDKINEEIPNMASKDIVFKKREESENVCSIAKSKMISSPAVMPKISVPVHISPSPPKISVPEHISPPPLSSDHLNKGKL
PLEAPFPGPESSIIQITEPTNLRCGNIGSTSKPNLIEQPNHNNPPKPLESLRLSPHPSPRHLGFNGGGREYYKKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETK
KSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIVLADNFSFWLSTIYGPSRHADRSEFWNELHDLAGLGGDNWILGGDFNVTRWSW
EKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENSWLKKDSFRCLME
NWWSQNTIQGWPGHGFMMKLKGLKSEIRKWNLSQRSSADQLPSLVSQLKLLDDTEDMDTLANMFDIVKIFELASGLNINYSKSEVLGIHLEESELDWLTSTFGCKQGFWP
STYLGLPLGGNSKTIPFWQPVIERIQHKLHSWKYSYISKGGRHTLTQAVLSSMPIYYLSLFKLSGQIAKTLDKLLRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGN
FQNRNLALLAKWIWRFLHEENSLWHKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLATSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVG
ETWIATQSAWDLNLRRNLNDVETEEWMDLSLILSSIRLQNRNDSWIWPLESSNIFSVKSLMEDLVDRPNMANDLYKVIWSDFYPKKIKIFLWELSHGAINTADRLQRRMP
HFHLSPSWCIMCAASSEHPRHLFVHCTFASRYWSEILDAFGWSIVFPNCIKDVLTLIFVDHPFRGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMELILFHAL
YWCKCKHPFSDYSLSFLISNWKAFM