; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012030 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012030
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:36570762..36574727
RNA-Seq ExpressionLag0012030
SyntenyLag0012030
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO59710.1 reverse transcriptase [Corchorus capsularis]2.5e-13336.02Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISLCNV YK+ISKVLVNR+K +L   IS++QSAF+PGR + DN ++ FE +H LK R  G   + ALKLDMSKAYDRVEW F++ IMLR+GF + WV+
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        LIMRCV SVSFS  +NG+      P  GL+QGD LSPYLFL+C E LS+LL   +   L+SG  ++R+ P +SHLFFADDSLLF +AN+AES  V+D L 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT
        +YE  SGQ IN+EKSVV FS N     +  V  I +V  +    +YLGLP+F+ R++                         GG+EV++KS++QAIP Y 
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT

Query:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------
        MN F  P+ L ++I+ +++ FWW      + I+W+ W+ LC+ K  GG+GFRDME FNQ+LLAKQ W                                 
Subjt:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------

Query:  -LVWADNLHISEGACYG-----------GVIY--WLEAVVG-------------------------HWDGEKIRAHFTVADCDAILRISLGSLLSEDQLI
           W   L   +   YG            +++  W+  + G                          WDG+ IR+ F   + +AI++I L   L  D L+
Subjt:  -LVWADNLHISEGACYG-----------GVIY--WLEAVVG-------------------------HWDGEKIRAHFTVADCDAILRISLGSLLSEDQLI

Query:  WHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMI
        WHF+  G +SV+SGYR+  +L  + + +  +    + ++  +W  +VP K++ F WRL    L   D+L  R M++   C  C+   E   H    CP  
Subjt:  WHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMI

Query:  KSMWCCSKFSFLRQSCFGLSFD------------------SLL--WAI----RDRAFQLALGRDVQQSVALHQ-----SQAVEAAVL--------WVPPA
          +WC     FL    + L  D                  SLL  WAI        F+    R +  ++         S+  +AA +        W PPA
Subjt:  KSMWCCSKFSFLRQSCFGLSFD------------------SLL--WAI----RDRAFQLALGRDVQQSVALHQ-----SQAVEAAVL--------WVPPA

Query:  VNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVR
         N +K+N D +  S       G + RN  G VL      +       +AE +A +R +  + +MGFS   +E D+L + R +N   +D S VGA +   +
Subjt:  VNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVR

Query:  SLRQRDSSCRVLFTPRQGN
        SL+   SSC V    R GN
Subjt:  SLRQRDSSCRVLFTPRQGN

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]5.1e-13437.19Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISLCNV YK+ISK+L  R+K VL+ VIS+ QSAF+  R + DN ++ FE IH LK R RGS  +AALK DMSKA+DRVEWSFI  +M ++GF   W+ 
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        LIM C+ +  FSFN+NGE +G V P RGL+QGD LSPYLFL+C+E LS LL+  EQ   + G  ++R  PSISHLFFADDSLLF +AN      ++  L 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYT
        +Y RASGQ +N +KSV++FSPNT    +     IL +    CH  YLGLP++  R +S                        GGKEVLLK++VQ+IP Y 
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYT

Query:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHISEGACYG-----------------
        M+CFRLP  L +EI  +MA FWW +    K+IHW  W+ LC+ K  GG+GFR    FNQ+LLAKQ W ++ D   +      G                 
Subjt:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHISEGACYG-----------------

Query:  -----GVIYWLEAVV----------------------GH-------------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHF
             G+++  E +V                      GH                         W+ E +++ F+  D D IL+I L  L   D+ IWH+
Subjt:  -----GVIYWLEAVV----------------------GH-------------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHF

Query:  EKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSM
        E +G +SV SGY LA SL  +D +S S +   + WW S WK+N+PSK+K F W++  + +P   +L  R +  S  C +CQS  E   H  + C   K +
Subjt:  EKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSM

Query:  WCCSKFSF-------LRQSCFGLSFDS------------LLWAIRDRAFQLALGRDVQ-------QSVA-LHQSQAVEAAVL--------------WVPP
        W  S FS        L+   + +   S            L+W I         G+ V+       QSVA + Q +++ +AV               W PP
Subjt:  WCCSKFSF-------LRQSCFGLSFDS------------LLWAIRDRAFQLALGRDVQ-------QSVA-LHQSQAVEAAVL--------------WVPP

Query:  AVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMV
          N  KLNVDA++ S   +   G ++RN  G+V           +     E  AM  G+  A         VE D L L  ALN    D+S+     D+V
Subjt:  AVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMV

Query:  RSLR
          ++
Subjt:  RSLR

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]3.3e-13335.27Show/hide
Query:  LCPQLSIRFYPQNGRLVESAMLATLTH-AD----QRIISTLNPT----------RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNA
        +CP  S+      G LV  A+L  L + AD     + I TL P           RPISLC V YKLISK +V R++  L  VIS+ QSAF+  R + DN 
Subjt:  LCPQLSIRFYPQNGRLVESAMLATLTH-AD----QRIISTLNPT----------RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNA

Query:  ILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAED
        ++ FE +H LK R RG+  +AA+KLDMSKA+DRVEW ++ Q+ML++GF    +DLI+RC+ SV++SF LNG+ LG++VPTRG++QGD LSPYLFL+CAE 
Subjt:  ILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAED

Query:  LSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQY
        LS LL+  EQ   + G +++R+ PS+SHLFFADDS+LF RAN   +  ++ +L +Y RASGQ +N +K V++FSPNT    + +   +L++  +PCH +Y
Subjt:  LSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQY

Query:  LGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCR
        LGLPSF  R ++                        GGKEVLLK++VQAIP Y M+CFRLP  L ++I  +M+NFWW +   G  IHW +W SLC+ K  
Subjt:  LGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCR

Query:  GGLGFRDMELFNQSLLAKQCWL----------------------VWADNLHISEGACYGGVIYWLEAVV----------------------GH-------
        GGLGFR+  LFNQ+LLAKQ W                       + A  L  +    +  +++  E +V                      GH       
Subjt:  GGLGFRDMELFNQSLLAKQCWL----------------------VWADNLHISEGACYGGVIYWLEAVV----------------------GH-------

Query:  -------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVP
                           WD   I A+F  AD D IL I L    +ED LIW+   +G ++V+SGY+ A  +S+ D   ++ ST    WW+  WK+ +P
Subjt:  -------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVP

Query:  SKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCSKFSF-LRQSCFGLSFDSLLW---------------------
        SKI+ F W++ HN LP    L ++ +  +  C +C+   E   H  + C   K +W  S   F  + +    S D LL+                     
Subjt:  SKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCSKFSF-LRQSCFGLSFDSLLW---------------------

Query:  ----------------AIRDRAFQLALGRDVQQSVALHQ-----SQAVEAAV-----------LWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGE
                        A+ D A    L     Q+ + H      S A  A+V           LW  P + +LKLN DA+     G+   G V+R+  G 
Subjt:  ----------------AIRDRAFQLALGRDVQQSVALHQ-----SQAVEAAV-----------LWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGE

Query:  VLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSL
        ++    + +  C+  +  E  A+   ++ A  +G S   +E DSL
Subjt:  VLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSL

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]6.0e-13536.44Show/hide
Query:  LSIRFYPQN----GRLVESAMLATLTH-ADQRIISTL---------NPT-----RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNA
        LS+ FY       G+ V +A+L  L + AD    +T           PT     RPISLCNV YKL+SK +V R+K  L +VIS+ QSAFI  R + DN 
Subjt:  LSIRFYPQN----GRLVESAMLATLTH-ADQRIISTL---------NPT-----RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNA

Query:  ILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAED
        ++ FE +H LK R RGS  +AA+KLDMSKA+DRVEW+F+ Q+ML++GF    VDLI+RC+++V++SF LNG   G V P+RG++QGD LSPYLFL+CAE 
Subjt:  ILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAED

Query:  LSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQY
        LS LL+  E    + G +++R+ PS+SHLFFADDS+LF RAN   +  +   L  Y +ASGQ IN +K V++FS NT    + + + +L +  +PCH QY
Subjt:  LSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQY

Query:  LGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCR
        LGLPSF  R +                        +GGKE+LLK++VQAIP Y M+CFRLP  L  +I  +MANFWW +   GK IHW +W  LC+ K +
Subjt:  LGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCR

Query:  GGLGFRDMELFNQSLLAKQCWLV----------------WADNLHISEGACYGGVIYW-------------LEAVVGHWDGEKIRAHFTVADCDAILRIS
        GGLGFR+   FNQ+LLAKQ W +                +++   +S G      + W             L      WD   +RA+F+  D D IL I 
Subjt:  GGLGFRDMELFNQSLLAKQCWLV----------------WADNLHISEGACYGGVIYW-------------LEAVVGHWDGEKIRAHFTVADCDAILRIS

Query:  LGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLED
        L     +D +IW     G ++V+SGY+LA S + QD  +SS S   + WWS+ WKM +P K++ F W++ H+ LP    L +R +  S  C +C S  E 
Subjt:  LGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLED

Query:  CLHVFWDCPMIKSMWCCSK----FSFLRQSCFGLSFDSLLWAIRDRAFQLAL---------------GRDVQQSVAL-----------------------
          H  +DCP  K++W  S     F  LRQS        L  A+    F+L L               G  V+ S A+                       
Subjt:  CLHVFWDCPMIKSMWCCSK----FSFLRQSCFGLSFDSLLWAIRDRAFQLAL---------------GRDVQQSVAL-----------------------

Query:  --------HQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSL
                  S     A  W  P    LKLN DA++     +   G  LRN  G ++    + L   +  +  E   +   +        S   +E DSL
Subjt:  --------HQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSL

Query:  RLSRALNDEVIDISEVGAIMDMVRSL
         + + L      +S   AI++ +  L
Subjt:  RLSRALNDEVIDISEVGAIMDMVRSL

XP_039834390.1 uncharacterized protein LOC120695147 [Panicum virgatum]4.1e-13636.36Show/hide
Query:  LNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQ
        L   RPISLCNV YKLISKVL NR+K VL ++IS +QSAF+PGR + DN +L +E  H L +R  G +  AA+KLDMSKAYDRVEW F++++ML+LGFA 
Subjt:  LNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQ

Query:  EWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVR
        +WV+ +M+CVS+VS+   +NG+   ++ P RGL+QG+ LSPYLF+LCAE LS+LL+  E++  I G ++ R  P I+HLFFADDSL+  RAN  ++  ++
Subjt:  EWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVR

Query:  DLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAI
         +L +YE  SGQ IN +KS V FSPNT  D K+ V   L++V +  + +YLGLP  + +SR                          GKE+L+K++ QAI
Subjt:  DLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAI

Query:  PYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW--LVWADNL--------------------
        P Y M+CF L K L  E+  ++  +WW   +   +IHWL+W  L  PK +GGLGFRD+ LFNQ++LA+Q W  LV  D L                    
Subjt:  PYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW--LVWADNL--------------------

Query:  ----------------HISEG---------------------------ACYGG------VIYWLEAVVGHWDGEKIRAHFTVADCDAILRISLGSLLSED
                         I EG                           A Y G      V+  L+ V G WD   ++  F+  D +AIL++ + +L  ED
Subjt:  ----------------HISEG---------------------------ACYGG------VIYWLEAVVGHWDGEKIRAHFTVADCDAILRISLGSLLSED

Query:  QLIWHFEKNGFFSVRSGYRLA---HSLSIQDQASSSDSTIW--QGW-WSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCL
        +  WHF+K G FSV+S Y+LA       +   A+SS+S      G+ W  +W M VP+K+K F WRL HN L  + NLL+RGM++  +C MCQ   ED  
Subjt:  QLIWHFEKNGFFSVRSGYRLA---HSLSIQDQASSSDSTIW--QGW-WSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCL

Query:  HVFWDCPMIKSMW-------------CCSKFSFLRQSCFGLSFDSLL---------WAIRDRA---FQLALGRDVQQSVALH--------QSQAVEAAV-
        H+F+ C  +K  W              C     + Q+ + L     L         W+ R++A    +   G +V   V  +        +S  +   V 
Subjt:  HVFWDCPMIKSMW-------------CCSKFSFLRQSCFGLSFDSLL---------WAIRDRA---FQLALGRDVQQSVALH--------QSQAVEAAV-

Query:  --LWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEV
           W PP  +  K+N D + R+V+ +   GFV+RN  G+ L   C  L +  S    E  A+L  +    Q+G SR  +E D+  L R L    +D S  
Subjt:  --LWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEV

Query:  GAIMDMVRS-LRQRDSSCRVLFTPRQGNMGCQKTAD
        G+++  +R  +      C +   PR     C K AD
Subjt:  GAIMDMVRS-LRQRDSSCRVLFTPRQGNMGCQKTAD

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase1.2e-13336.02Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISLCNV YK+ISKVLVNR+K +L   IS++QSAF+PGR + DN ++ FE +H LK R  G   + ALKLDMSKAYDRVEW F++ IMLR+GF + WV+
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        LIMRCV SVSFS  +NG+      P  GL+QGD LSPYLFL+C E LS+LL   +   L+SG  ++R+ P +SHLFFADDSLLF +AN+AES  V+D L 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT
        +YE  SGQ IN+EKSVV FS N     +  V  I +V  +    +YLGLP+F+ R++                         GG+EV++KS++QAIP Y 
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT

Query:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------
        MN F  P+ L ++I+ +++ FWW      + I+W+ W+ LC+ K  GG+GFRDME FNQ+LLAKQ W                                 
Subjt:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------

Query:  -LVWADNLHISEGACYG-----------GVIY--WLEAVVG-------------------------HWDGEKIRAHFTVADCDAILRISLGSLLSEDQLI
           W   L   +   YG            +++  W+  + G                          WDG+ IR+ F   + +AI++I L   L  D L+
Subjt:  -LVWADNLHISEGACYG-----------GVIY--WLEAVVG-------------------------HWDGEKIRAHFTVADCDAILRISLGSLLSEDQLI

Query:  WHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMI
        WHF+  G +SV+SGYR+  +L  + + +  +    + ++  +W  +VP K++ F WRL    L   D+L  R M++   C  C+   E   H    CP  
Subjt:  WHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMI

Query:  KSMWCCSKFSFLRQSCFGLSFD------------------SLL--WAI----RDRAFQLALGRDVQQSVALHQ-----SQAVEAAVL--------WVPPA
          +WC     FL    + L  D                  SLL  WAI        F+    R +  ++         S+  +AA +        W PPA
Subjt:  KSMWCCSKFSFLRQSCFGLSFD------------------SLL--WAI----RDRAFQLALGRDVQQSVALHQ-----SQAVEAAVL--------WVPPA

Query:  VNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVR
         N +K+N D +  S       G + RN  G VL      +       +AE +A +R +  + +MGFS   +E D+L + R +N   +D S VGA +   +
Subjt:  VNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVR

Query:  SLRQRDSSCRVLFTPRQGN
        SL+   SSC V    R GN
Subjt:  SLRQRDSSCRVLFTPRQGN

A0A2N9I9F4 Reverse transcriptase domain-containing protein4.5e-13637.25Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISLCNV YKLISKVL NR+K VL KVIS  QSAF+PGR + DN ++ FE +H +  +  G     ALKLDMSKAYDRVEW F++Q+M R+GF + W  
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        +IM C+S+VS+S  +NGE  G + PTRGL+QGD +SPYLFLLCAE L+ LL+    +  I G  L R  P I++LFFADDSLLF RA   E   ++ +L 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT
        LYE+ASGQ +N EK+ + FS NT    ++ +  IL V     + +YLGLPS + + +                          G+E+L+K++VQAIP YT
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQAIPYYT

Query:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------
        MNCF+LP  L  EI  L+  FWW      ++IHWL W+ LC+PK  GGLGFR+++ FN +LLAKQ W                                 
Subjt:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------------

Query:  -------------LVWADNLHISEGACYGGVIYWLEAVVGHWDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQ
                      +  D L I+       V   +      W+   I   F   D +AIL+I + +    D+L+WH  ++G FSVRSGY L         
Subjt:  -------------LVWADNLHISEGACYGGVIYWLEAVVGHWDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQ

Query:  ASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMW-CCSKFSFLRQSCFGLSFDSLL
           S +      W ++W   VP+KIK F WR     LPTK  L +R +     C  C + +ED LH  W CP++   W    +FS  RQ+ F  SF +L 
Subjt:  ASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMW-CCSKFSFLRQSCFGLSFDSLL

Query:  WAIRDRAFQLAL--------GRDVQQSVALHQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAM
          I      +A+         R   Q +  +Q +   A   W PPA N  K N D +    S     G V+R+++G V+ T  + +  C S ++ E  A 
Subjt:  WAIRDRAFQLAL--------GRDVQQSVALHQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAM

Query:  LRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN
         R +  A ++G +    E D+  L R LN      +  G I+D ++++ Q      V  T R GN
Subjt:  LRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN

A0A803NTN0 Uncharacterized protein2.9e-13535.58Show/hide
Query:  STLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGF
        S ++  RPISLCNV YKLISK +V R + VL  VIS+ QSAF+  R + DN ++ FE IH L+ +T+G   ++ALKLDMSKA+DRVEW F++ +ML++GF
Subjt:  STLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGF

Query:  AQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSC
          +WV LIM C+++ SFSF+LNGE +G V P RGL+QGD LSPYLFL+C+E  S LL+  E    + G +L R+ PS+SHL FADDSLLF RA    ++ 
Subjt:  AQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSC

Query:  VRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQ
        ++ +L  Y +ASGQ +N  KSV++FSPNT    + + +  L +    CH +YLGLPS+  R +                        +GGKEVLLK++VQ
Subjt:  VRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSR------------------------SGGKEVLLKSIVQ

Query:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLV----------------WADN----L
        +IP Y M+CF+L K   S++  +MANFWW     G +IHW  WK+LC+ K  GG+GFR    FNQ+LLAKQ W +                +++N     
Subjt:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLV----------------WADN----L

Query:  HIS-------EGACYG------GVIY--------------WLEAVVG----------------------HWDGEKIRAHFTVADCDAILRISLGSLLSED
        HI        +  C+G      GV +              W+ +                          W+ + + ++F   D + IL I L     +D
Subjt:  HIS-------EGACYG------GVIY--------------WLEAVVG----------------------HWDGEKIRAHFTVADCDAILRISLGSLLSED

Query:  QLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDC
        +LIWH   +G ++V+SG+ LA +L  Q+ +S+SD    +GWW   W +N+P KI+ F W++ +N LP    L K+ +  S  C +C S  E   H  + C
Subjt:  QLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDC

Query:  PMIKSMWCCSKFSFLRQSCFGLSFDSLLWAIRDRAFQLALGRDVQQSVALHQSQAVEAA------------------VLWVPPAVNELKLNVDASVRSVS
           +++W  SKFS        ++   L  +I   AF   L +D  ++    ++    A                   V W PP VN  KLNVDA+  S  
Subjt:  PMIKSMWCCSKFSFLRQSCFGLSFDSLLWAIRDRAFQLALGRDVQQSVALHQSQAVEAA------------------VLWVPPAVNELKLNVDASVRSVS

Query:  GEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPR
         +   G VLR+  G V+    ++    +  D  E  A+   +  A Q      H+E D+LR+S ALN   +D+S    ++  VR L     +  V    R
Subjt:  GEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPR

Query:  QGNMGCQKTADATVG
          N      A   +G
Subjt:  QGNMGCQKTADATVG

A0A803QGT2 Uncharacterized protein2.4e-13437.19Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISLCNV YK+ISK+L  R+K VL+ VIS+ QSAF+  R + DN ++ FE IH LK R RGS  +AALK DMSKA+DRVEWSFI  +M ++GF   W+ 
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        LIM C+ +  FSFN+NGE +G V P RGL+QGD LSPYLFL+C+E LS LL+  EQ   + G  ++R  PSISHLFFADDSLLF +AN      ++  L 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYT
        +Y RASGQ +N +KSV++FSPNT    +     IL +    CH  YLGLP++  R +S                        GGKEVLLK++VQ+IP Y 
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQAIPYYT

Query:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHISEGACYG-----------------
        M+CFRLP  L +EI  +MA FWW +    K+IHW  W+ LC+ K  GG+GFR    FNQ+LLAKQ W ++ D   +      G                 
Subjt:  MNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHISEGACYG-----------------

Query:  -----GVIYWLEAVV----------------------GH-------------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHF
             G+++  E +V                      GH                         W+ E +++ F+  D D IL+I L  L   D+ IWH+
Subjt:  -----GVIYWLEAVV----------------------GH-------------------------WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHF

Query:  EKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSM
        E +G +SV SGY LA SL  +D +S S +   + WW S WK+N+PSK+K F W++  + +P   +L  R +  S  C +CQS  E   H  + C   K +
Subjt:  EKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSM

Query:  WCCSKFSF-------LRQSCFGLSFDS------------LLWAIRDRAFQLALGRDVQ-------QSVA-LHQSQAVEAAVL--------------WVPP
        W  S FS        L+   + +   S            L+W I         G+ V+       QSVA + Q +++ +AV               W PP
Subjt:  WCCSKFSF-------LRQSCFGLSFDS------------LLWAIRDRAFQLALGRDVQ-------QSVA-LHQSQAVEAAVL--------------WVPP

Query:  AVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMV
          N  KLNVDA++ S   +   G ++RN  G+V           +     E  AM  G+  A         VE D L L  ALN    D+S+     D+V
Subjt:  AVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMV

Query:  RSLR
          ++
Subjt:  RSLR

A0A803QHU6 Uncharacterized protein4.9e-13535.3Show/hide
Query:  STLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGF
        S++   RPISLCNV YKLISK +V R K VL  VIS+ QSAF+  R + DN ++ FE IH L+ +T+G   ++ LKLDMSKA+DRVEW ++Q IML++GF
Subjt:  STLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGF

Query:  AQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSC
           W  LIMRC+++ SFSF+LNGE +GQV P+RGL QGD LSPYLFL+C+E LS LL+  EQ   ++G +L R+ PS+SHL FADDSLLF ++N   +  
Subjt:  AQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSC

Query:  VRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQ
        ++  L  Y +ASGQ +N +KSV++FSPNT    + + S  L +    CH +YLGLPS+  R +S                        GGKEVLLK++VQ
Subjt:  VRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRS------------------------GGKEVLLKSIVQ

Query:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVW--ADNL------------------
        +IP Y M+CFRL K   +++  +MANFWW +   G +IHW  WKSLC+ K  GG+GFR    FNQ+LLAKQ W ++   D+L                  
Subjt:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVW--ADNL------------------

Query:  ---HIS----EGACYGG--VIYWLEAVVGH----------------------------------------WDGEKIRAHFTVADCDAILRISLGSLLSED
           H      +  C+G   ++  L   VG+                                        W+   +  +F   D D IL I L      D
Subjt:  ---HIS----EGACYGG--VIYWLEAVVGH----------------------------------------WDGEKIRAHFTVADCDAILRISLGSLLSED

Query:  QLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDC
        +LIWHF  NG ++V+SG+ LA SL  ++Q  SS S   + WW   W +N+P KI+ F W++ HN LPT   L K+ +  S  C +C S  E   H  + C
Subjt:  QLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDC

Query:  PMIKSMWCCSKF--------------------SFLRQSCFGLSFDSL--LWAIRDRAFQLALGR----DVQQSVALHQ----------------------
           K++W  S F                    + L Q  F L    L  +W  R++       R     +  ++  H+                      
Subjt:  PMIKSMWCCSKF--------------------SFLRQSCFGLSFDSL--LWAIRDRAFQLALGR----DVQQSVALHQ----------------------

Query:  --SQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALND
          S  ++    W PP  N  KLNVDA+      +   G ++R+  G V+    ++    +  D  E  A+   +  A Q   +  H+E D+LR+S ALN 
Subjt:  --SQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALND

Query:  EVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGNMGCQKTADATVG
           D+S    I+  VR L        V    R  N      A  ++G
Subjt:  EVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGNMGCQKTADATVG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.2e-1827.31Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISL N+  K+++K+L NR++  + K+I  +Q  FIPG     N       I  + +          + +D  KA+D+++  F+ + + +LG    ++ 
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        +I       + +  LNG+KL       G +QG  LSP LF +  E L+   R + Q   I G QL +    +S   FADD +++       +  +  L+ 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGL
         + + SG  IN +KS  AF  N     +  +   L         +YLG+
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGL

P08548 LINE-1 reverse transcriptase homolog3.9e-2023.82Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RPISL N+  K+++K+L NR++  + K+I  +Q  FIPG     N       I  + K    +     L +D  KA+D ++  F+ + + ++G    ++ 
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
        LI    S  + +  LNG KL       G +QG  LSP LF +  E L+  +R   +   I G  +      I    FADD +++       ++ + +++ 
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLP---------------------------SFMPRSRSGGKEVLLKSIV-QAI
         Y   SG  IN  KS VAF     +  ++ V   +     P   +YLG+                              +P S  G   ++  SI+ +AI
Subjt:  LYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLP---------------------------SFMPRSRSGGKEVLLKSIV-QAI

Query:  PYYTMNCFRLPKCLVSEIHRLMANFWWD--NPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHI
          +     + P     ++ +++ +F W+   P   K +       L      GG+   D+ L+ +S++ K  W  W  N  +
Subjt:  PYYTMNCFRLPKCLVSEIHRLMANFWWD--NPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHI

P0C2F6 Putative ribonuclease H protein At1g657503.7e-3122.63Show/hide
Query:  GKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW-----------LVWAD
        G+  L K+++ ++P ++M+   LP+ +++ + +L   F W +    K+ H + W  +C PK  GGLG R  +  N++L++K  W           LV   
Subjt:  GKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW-----------LVWAD

Query:  NLHISE-------------GACYGGVIYWLEAVVGH---W---DGEKIRAHF----------------TVADCDAIL-----------------------
          H+ E              + +  +   L  VV H   W   DG++IR                      DCD ++                       
Subjt:  NLHISE-------------GACYGGVIYWLEAVVGH---W---DGEKIRAHF----------------TVADCDAIL-----------------------

Query:  -RISLGSLL------SEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNM
         R+ L +++      + D+L W F ++G FSVRS Y +   L++ +    + ++    +++ LWK+ VP ++K F W + +  + T++   +R +  SN+
Subjt:  -RISLGSLL------SEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNM

Query:  CVMCQSFLEDCLHVFWDCPMIKSMWC-----CSKFSFLRQSCFGLSFDSL------------------LW---------------AIRDR-------AFQ
        C +C+  +E  LHV  DCP    +W        +  F  +S F   +D+L                  +W                 RDR       A +
Subjt:  CVMCQSFLEDCLHVFWDCPMIKSMWC-----CSKFSFLRQSCFGLSFDSL------------------LW---------------AIRDR-------AFQ

Query:  LALGRDVQQSVALHQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHV
        +         V + Q + VE  + WV P V  +K+N D + R   G A  G VLR+  G         + +C S   AE W +  G+  A +    R  +
Subjt:  LALGRDVQQSVALHQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHV

Query:  EIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN
        E+DS  +   L   + D   +  ++ +     Q+D   R++   R+ N
Subjt:  EIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN

P11369 LINE-1 retrotransposable element ORF2 protein5.4e-2224.62Show/hide
Query:  TLTHADQRIISTLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSF
        TL    Q+  + +   RPISL N+  K+++K+L NR++  +  +I  +Q  FIPG     N       IH + K          + LD  KA+D+++  F
Subjt:  TLTHADQRIISTLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSF

Query:  IQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLF
        + +++ R G    ++++I    S    +  +NGEKL  +    G +QG  LSPYLF +  E L+   R + Q+  I G Q+ +    IS L  ADD +++
Subjt:  IQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLF

Query:  FRANAAESSCVRDLLLLYERASGQTINYEKSVV-AFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLP---------------------------SFMPRS
               +  + +L+  +    G  IN  KS+   ++ N   + +   +   S+V    + +YLG+                              +P S
Subjt:  FRANAAESSCVRDLLLLYERASGQTINYEKSVV-AFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLP---------------------------SFMPRS

Query:  RSGGKEVLLKSIV-QAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPK-CRGGLGFRDMELFNQSLLAKQCWLVWAD
          G   ++  +I+ +AI  +     ++P    +E+   +  F W+N     RI     KSL + K   GG+   D++L+ ++++ K  W  + D
Subjt:  RSGGKEVLLKSIV-QAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPK-CRGGLGFRDMELFNQSLLAKQCWLVWAD

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-1825.07Show/hide
Query:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD
        RP+SL +  YK+++K +  R+K VL +VI  +QS  +PGR + DN  L  + +H  +   R     A L LD  KA+DRV+  ++   +    F  ++V 
Subjt:  RPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVD

Query:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL
         +    +S      +N      +   RG++QG  LS  L+ L  E    LL     R  ++G  L      +    +ADD +L  + +  +    ++   
Subjt:  LIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPSISHLFFADDSLLFFRANAAESSCVRDLLL

Query:  LYERASGQTINYEKS--------VVAFSPNTGDDCK------QYVSLILSVVCKPCHSQYLGL-----------PSFMPRSRSGGKEVLLKSIVQAIPYY
        +Y  AS   IN+ KS         V F P    D        +Y+ + LS    P    ++ L             F       G+ +++  +V +  +Y
Subjt:  LYERASGQTINYEKS--------VVAFSPNTGDDCK------QYVSLILSVVCKPCHSQYLGL-----------PSFMPRSRSGGKEVLLKSIVQAIPYY

Query:  TMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLG
         + C    +  +++I R + +F W     GK  HW+S      P   GG G
Subjt:  TMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLG

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.0e-1623.64Show/hide
Query:  WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGY-RLAHSLS-----IQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHN
        WD  KI      +D   I RI L      D++IW++   G ++VRSGY  L H  S     I     S D        + +W + +  K+K F WR    
Subjt:  WDGEKIRAHFTVADCDAILRISLGSLLSEDQLIWHFEKNGFFSVRSGY-RLAHSLS-----IQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHN

Query:  RLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCSKFSFLRQSCFGLSFDS------------------------LLWAI---------
         L T + L  RGM I   C  C    E   H  + CP     W  S  S +R       F+                         L+W I         
Subjt:  RLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCSKFSFLRQSCFGLSFDS------------------------LLWAI---------

Query:  ---RDRAFQLALGRDVQQSVALHQSQA-----------VEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEG
           R+   +  L    +    L+ +Q+            E  + W  P    +K N DA       EA GG+++RN  G  +      L    +   AE 
Subjt:  ---RDRAFQLALGRDVQQSVALHQSQA-----------VEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEG

Query:  WAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN
         A+L  +Q     G+++  +E D   L   +N      S    + D +     + +S +  F  R+GN
Subjt:  WAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGN

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-1522.29Show/hide
Query:  GKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQ----SLLAKQCWLVWADNLHISEG
        G+  L+ S++ ++  + M+ FRLP   + EI  + ++F W  P+   +   ++W  +C PK  GGLG R ++  N+    S+        W     +   
Subjt:  GKEVLLKSIVQAIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQ----SLLAKQCWLVWADNLHISEG

Query:  ACYGGVI-----------YWLE---------AVVGHWD--GEKIRAHFTVADC-----------DAILRIS--------LGSLLSEDQLIWHFEKNGF--
        A   G +           +W +          V GH       I  H +VA+            D +LRI          G    ED + W    + F  
Subjt:  ACYGGVI-----------YWLE---------AVVGHWD--GEKIRAHFTVADC-----------DAILRIS--------LGSLLSEDQLIWHFEKNGF--

Query:  -FSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCS
         F+ +  +           A++ +  +   W+  +W  +   K     W    NRL T D +L       + CV+C   +E   H+F+ CP        +
Subjt:  -FSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDCPMIKSMWCCS

Query:  KFSFLRQSCFGLSFDSLLWAIRD
        +  FL +  F L+  S LW  R+
Subjt:  KFSFLRQSCFGLSFDSLLWAIRD

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.3e-1429.05Show/hide
Query:  LVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLN
        +V R+K ++  +I   Q++FIPGR   DN +   E +H + +R +G   W  LKLD+ KAYDR+ W +++  ++  GF + W+  I R     +F     
Subjt:  LVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTRGSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLN

Query:  GEKLGQVVPTR---------GLQQGDSLSPYL--FLLCAEDLSSLLRG
          ++G+   ++         G +  D  +P+    + CAE L  + RG
Subjt:  GEKLGQVVPTR---------GLQQGDSLSPYL--FLLCAEDLSSLLRG

AT4G29090.1 Ribonuclease H-like superfamily protein5.9e-3225.27Show/hide
Query:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------
        A+P YTM CF LPK +  +I  ++A+FWW N    K +HW +W  L   K  GG+GF+D+E FN +LL KQ W                           
Subjt:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCW---------------------------

Query:  -------LVWADNLHISEGACYGGVIYWLEAVVGHWDGEKIRAH--FTVADCDAILRI---------SLGSLLSEDQLI------W----------HFEK
                VW  ++H S+     G      AVVG+ +   I  H         A LR+         S+ S+L    LI      W            E+
Subjt:  -------LVWADNLHISEGACYGGVIYWLEAVVGHWDGEKIRAH--FTVADCDAILRI---------SLGSLLSEDQLI------W----------HFEK

Query:  NGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWW------------------------SSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCV
             +R G R        D  SS D T+  G+W                          +WK     KI+ F W+   N LP    L  R +   + C+
Subjt:  NGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWW------------------------SSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCV

Query:  MCQSFLEDCLHVFWDCPMIKSMWCCSKFSF-LRQSCFGLSFDSLLWAIR--------DRAFQLA----------------LGRDVQQSVALHQSQ-----
         C S  E   H+ + C   +  W  S     L        + +L W           ++A QL                  GR+      L +++     
Subjt:  MCQSFLEDCLHVFWDCPMIKSMWCCSKFSF-LRQSCFGLSFDSLLWAIR--------DRAFQLA----------------LGRDVQQSVALHQSQ-----

Query:  ---AVEAAVL-------------WVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVE
             EA                W PP    +K N DA+    +     G+VLRNE+GEV       LPK  SV  AE  AM   +    +  ++    E
Subjt:  ---AVEAAVL-------------WVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCWSVDLAEGWAMLRGIQIAHQMGFSRFHVE

Query:  IDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGNMGCQKTA
         DS  L   LN++ I  S    I D+ R L Q  +  + +F PR+GN   ++ A
Subjt:  IDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGNMGCQKTA

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.5e-1646.05Show/hide
Query:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPK-CRGGLGFRDMELFNQSLLAKQCWLV
        A+P Y M+CFRL K L  ++   M  FWW + +  ++I W++W+ LC+ K   GGLGFRD+  FNQ+LLAKQ + +
Subjt:  AIPYYTMNCFRLPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPK-CRGGLGFRDMELFNQSLLAKQCWLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAGGAGTGAATTCCGTCTTGCGAAACTATGTCCCCAGCTATCTATTCGATTTTATCCCCAAAATGGTAGGCTTGTTGAGTCGGCGATGCTGGCCACTCTCACCCA
TGCAGATCAAAGGATAATCTCGACATTAAACCCAACAAGGCCTATATCTCTGTGTAATGTCAGCTACAAGTTGATTTCTAAGGTTCTGGTGAACCGTATGAAGTTTGTTT
TGAACAAGGTTATATCTCAAAATCAGAGTGCTTTCATACCAGGGCGATGTGTGGTTGACAACGCCATTTTGGGCTTTGAGTGTATTCATGGTCTGAAGAAACGGACAAGG
GGTTCTTCTAGATGGGCAGCCCTGAAACTGGATATGAGTAAGGCCTACGACAGAGTAGAGTGGTCATTCATTCAGCAGATCATGCTTCGTTTGGGGTTTGCTCAGGAGTG
GGTGGATCTGATCATGAGATGTGTGAGCTCAGTTTCTTTCTCGTTCAACCTAAATGGGGAGAAATTGGGTCAGGTGGTCCCAACTAGGGGTCTCCAACAGGGGGATTCTT
TGTCACCTTACCTTTTCCTTCTGTGTGCAGAAGATTTGTCCAGCTTGCTTCGTGGGGTAGAGCAAAGGTCTCTTATCTCAGGTTTTCAGCTTGCGCGATCCTGCCCATCA
ATTTCGCACCTGTTTTTCGCTGACGATAGTTTGCTTTTCTTCCGAGCAAATGCTGCAGAGAGTAGTTGTGTTCGGGATCTCCTGTTGCTATACGAGCGTGCCTCTGGTCA
GACTATAAATTATGAAAAGTCTGTTGTGGCTTTCAGCCCAAATACAGGGGATGATTGTAAGCAGTATGTTAGCCTAATCCTATCTGTTGTATGTAAGCCATGCCATAGCC
AGTACCTGGGCCTTCCCTCTTTTATGCCTCGTAGTCGATCAGGGGGTAAAGAGGTGCTGCTTAAGTCGATTGTGCAAGCTATTCCCTATTACACGATGAACTGCTTCAGA
CTCCCAAAATGCCTGGTTAGTGAGATTCATCGTCTCATGGCGAACTTTTGGTGGGATAATCCAGATGGGGGGAAGAGAATTCACTGGCTGAGTTGGAAGTCCCTTTGTCG
TCCAAAGTGTCGTGGTGGCTTGGGTTTCAGAGATATGGAATTGTTTAACCAGTCGCTCTTGGCAAAACAGTGTTGGCTGGTTTGGGCCGACAACCTTCATATATCTGAAG
GAGCTTGCTATGGGGGCGTGATTTACTGGCTAGAGGCTGTTGTTGGTCATTGGGATGGGGAGAAAATACGTGCCCATTTTACTGTGGCAGACTGTGATGCTATTTTGAGG
ATCTCGCTTGGGAGTCTCTTATCTGAAGATCAGTTGATATGGCATTTCGAAAAGAATGGGTTTTTCTCTGTTAGAAGTGGATACCGCTTGGCCCATAGTTTGAGTATACA
AGATCAGGCGTCATCGTCAGACTCGACTATATGGCAGGGGTGGTGGTCTAGCCTCTGGAAGATGAATGTCCCGAGCAAGATCAAATTCTTTTTTTGGAGGTTATCTCACA
ACCGGCTTCCCACCAAGGATAATCTTCTTAAAAGAGGTATGGAAATTTCTAATATGTGTGTGATGTGCCAGTCTTTTCTTGAGGATTGTTTGCATGTCTTTTGGGATTGT
CCTATGATCAAATCTATGTGGTGTTGCTCAAAATTCTCTTTTTTACGTCAATCTTGTTTTGGGTTGAGTTTTGACTCGCTGCTGTGGGCGATAAGAGATAGGGCGTTTCA
GTTGGCTTTGGGGAGAGATGTCCAGCAGTCGGTGGCGTTGCATCAATCGCAGGCGGTGGAGGCTGCTGTCTTGTGGGTTCCGCCTGCTGTAAATGAGCTTAAATTGAATG
TGGATGCGTCGGTGAGGTCGGTTTCGGGGGAGGCTTACGGTGGGTTTGTCCTGAGGAATGAGAGAGGGGAGGTCCTATTGACGGCGTGCGAAATTCTGCCAAAATGTTGG
AGTGTCGATTTGGCCGAGGGATGGGCAATGTTGAGAGGTATCCAAATAGCCCACCAAATGGGCTTTTCCAGGTTCCATGTTGAGATTGACTCGCTGAGGCTTTCAAGAGC
TTTAAATGATGAGGTGATTGATATTTCGGAAGTAGGAGCAATTATGGACATGGTTCGCAGCTTGCGCCAGCGTGATTCTTCATGTAGGGTGTTGTTTACTCCTAGGCAAG
GCAATATGGGTTGCCAGAAAACTGCCGATGCTACTGTAGGTAAGTCACCGTCGCCGTCGCCGTTGCCGGCATCGCTAATATTTGTTCTTGGCATTTTGTTGGCTGTAATC
GTCTTTGAAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTAGGAGTGAATTCCGTCTTGCGAAACTATGTCCCCAGCTATCTATTCGATTTTATCCCCAAAATGGTAGGCTTGTTGAGTCGGCGATGCTGGCCACTCTCACCCA
TGCAGATCAAAGGATAATCTCGACATTAAACCCAACAAGGCCTATATCTCTGTGTAATGTCAGCTACAAGTTGATTTCTAAGGTTCTGGTGAACCGTATGAAGTTTGTTT
TGAACAAGGTTATATCTCAAAATCAGAGTGCTTTCATACCAGGGCGATGTGTGGTTGACAACGCCATTTTGGGCTTTGAGTGTATTCATGGTCTGAAGAAACGGACAAGG
GGTTCTTCTAGATGGGCAGCCCTGAAACTGGATATGAGTAAGGCCTACGACAGAGTAGAGTGGTCATTCATTCAGCAGATCATGCTTCGTTTGGGGTTTGCTCAGGAGTG
GGTGGATCTGATCATGAGATGTGTGAGCTCAGTTTCTTTCTCGTTCAACCTAAATGGGGAGAAATTGGGTCAGGTGGTCCCAACTAGGGGTCTCCAACAGGGGGATTCTT
TGTCACCTTACCTTTTCCTTCTGTGTGCAGAAGATTTGTCCAGCTTGCTTCGTGGGGTAGAGCAAAGGTCTCTTATCTCAGGTTTTCAGCTTGCGCGATCCTGCCCATCA
ATTTCGCACCTGTTTTTCGCTGACGATAGTTTGCTTTTCTTCCGAGCAAATGCTGCAGAGAGTAGTTGTGTTCGGGATCTCCTGTTGCTATACGAGCGTGCCTCTGGTCA
GACTATAAATTATGAAAAGTCTGTTGTGGCTTTCAGCCCAAATACAGGGGATGATTGTAAGCAGTATGTTAGCCTAATCCTATCTGTTGTATGTAAGCCATGCCATAGCC
AGTACCTGGGCCTTCCCTCTTTTATGCCTCGTAGTCGATCAGGGGGTAAAGAGGTGCTGCTTAAGTCGATTGTGCAAGCTATTCCCTATTACACGATGAACTGCTTCAGA
CTCCCAAAATGCCTGGTTAGTGAGATTCATCGTCTCATGGCGAACTTTTGGTGGGATAATCCAGATGGGGGGAAGAGAATTCACTGGCTGAGTTGGAAGTCCCTTTGTCG
TCCAAAGTGTCGTGGTGGCTTGGGTTTCAGAGATATGGAATTGTTTAACCAGTCGCTCTTGGCAAAACAGTGTTGGCTGGTTTGGGCCGACAACCTTCATATATCTGAAG
GAGCTTGCTATGGGGGCGTGATTTACTGGCTAGAGGCTGTTGTTGGTCATTGGGATGGGGAGAAAATACGTGCCCATTTTACTGTGGCAGACTGTGATGCTATTTTGAGG
ATCTCGCTTGGGAGTCTCTTATCTGAAGATCAGTTGATATGGCATTTCGAAAAGAATGGGTTTTTCTCTGTTAGAAGTGGATACCGCTTGGCCCATAGTTTGAGTATACA
AGATCAGGCGTCATCGTCAGACTCGACTATATGGCAGGGGTGGTGGTCTAGCCTCTGGAAGATGAATGTCCCGAGCAAGATCAAATTCTTTTTTTGGAGGTTATCTCACA
ACCGGCTTCCCACCAAGGATAATCTTCTTAAAAGAGGTATGGAAATTTCTAATATGTGTGTGATGTGCCAGTCTTTTCTTGAGGATTGTTTGCATGTCTTTTGGGATTGT
CCTATGATCAAATCTATGTGGTGTTGCTCAAAATTCTCTTTTTTACGTCAATCTTGTTTTGGGTTGAGTTTTGACTCGCTGCTGTGGGCGATAAGAGATAGGGCGTTTCA
GTTGGCTTTGGGGAGAGATGTCCAGCAGTCGGTGGCGTTGCATCAATCGCAGGCGGTGGAGGCTGCTGTCTTGTGGGTTCCGCCTGCTGTAAATGAGCTTAAATTGAATG
TGGATGCGTCGGTGAGGTCGGTTTCGGGGGAGGCTTACGGTGGGTTTGTCCTGAGGAATGAGAGAGGGGAGGTCCTATTGACGGCGTGCGAAATTCTGCCAAAATGTTGG
AGTGTCGATTTGGCCGAGGGATGGGCAATGTTGAGAGGTATCCAAATAGCCCACCAAATGGGCTTTTCCAGGTTCCATGTTGAGATTGACTCGCTGAGGCTTTCAAGAGC
TTTAAATGATGAGGTGATTGATATTTCGGAAGTAGGAGCAATTATGGACATGGTTCGCAGCTTGCGCCAGCGTGATTCTTCATGTAGGGTGTTGTTTACTCCTAGGCAAG
GCAATATGGGTTGCCAGAAAACTGCCGATGCTACTGTAGGTAAGTCACCGTCGCCGTCGCCGTTGCCGGCATCGCTAATATTTGTTCTTGGCATTTTGTTGGCTGTAATC
GTCTTTGAAGCATAG
Protein sequenceShow/hide protein sequence
MGRSEFRLAKLCPQLSIRFYPQNGRLVESAMLATLTHADQRIISTLNPTRPISLCNVSYKLISKVLVNRMKFVLNKVISQNQSAFIPGRCVVDNAILGFECIHGLKKRTR
GSSRWAALKLDMSKAYDRVEWSFIQQIMLRLGFAQEWVDLIMRCVSSVSFSFNLNGEKLGQVVPTRGLQQGDSLSPYLFLLCAEDLSSLLRGVEQRSLISGFQLARSCPS
ISHLFFADDSLLFFRANAAESSCVRDLLLLYERASGQTINYEKSVVAFSPNTGDDCKQYVSLILSVVCKPCHSQYLGLPSFMPRSRSGGKEVLLKSIVQAIPYYTMNCFR
LPKCLVSEIHRLMANFWWDNPDGGKRIHWLSWKSLCRPKCRGGLGFRDMELFNQSLLAKQCWLVWADNLHISEGACYGGVIYWLEAVVGHWDGEKIRAHFTVADCDAILR
ISLGSLLSEDQLIWHFEKNGFFSVRSGYRLAHSLSIQDQASSSDSTIWQGWWSSLWKMNVPSKIKFFFWRLSHNRLPTKDNLLKRGMEISNMCVMCQSFLEDCLHVFWDC
PMIKSMWCCSKFSFLRQSCFGLSFDSLLWAIRDRAFQLALGRDVQQSVALHQSQAVEAAVLWVPPAVNELKLNVDASVRSVSGEAYGGFVLRNERGEVLLTACEILPKCW
SVDLAEGWAMLRGIQIAHQMGFSRFHVEIDSLRLSRALNDEVIDISEVGAIMDMVRSLRQRDSSCRVLFTPRQGNMGCQKTADATVGKSPSPSPLPASLIFVLGILLAVI
VFEA