; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000545 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000545
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:9647416..9650136
RNA-Seq ExpressionLag0000545
SyntenyLag0000545
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAF22173.1 Os07g0613900 [Oryza sativa Japonica Group]1.1e-11734.35Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GDRNT +FH   S  +R N I  ++  +G W   +         YFK +F S+  +  +   +LS V +KVT+EMN+ L+  F  EE+  +I S    KA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P PDG PA+F++ +WDVVG    S  L+ILN       WN   IVLIPK +Q   + D RPISL NV YKIV++VLANR+K +L ++I +SQSAF+ G  
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV------------AAKEFRHFRNILKDYEKASGQSVNLTKSMLGMPSSFSRGKTCDF-----
        I+DN+++ +E  H L  KR G VG+AALKLDMSKAYDRV               +     +  K Y++ SGQ +N  KS +    + SR K  +F     
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV------------AAKEFRHFRNILKDYEKASGQSVNLTKSMLGMPSSFSRGKTCDF-----

Query:  ---KFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKI------------------------------------PKGVRGLNFRDLVAFN
           + + D++W  +QGWK +  S+ G+EVLIK++ QAIPT+AM CF +                                    PK   GL FRD+ AFN
Subjt:  ---KFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKI------------------------------------PKGVRGLNFRDLVAFN

Query:  QEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIA
          MLAKQ WR++ NP+ + ++VL  KY+P+ ++ +A   A+ S+ W+    G+  LR G+   +GNG +I+I+SDPWI    T + I+P   ++  T + 
Subjt:  QEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIA

Query:  EFITPSLQS-DVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHN
        E I PS  S DV  L Q     DV++I+ +P+     +   WHYD +G++SVKS YK             V+R     R+                    
Subjt:  EFITPSLQS-DVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHN

Query:  KWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSR
                RG   + +G                ER+  +W+KVWKMQ+P KVK+F+W++ HN +   VNL    + +   C ICN   E   H  F+C +
Subjt:  KWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSR

Query:  AREIWSLIHPPMMR-SLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK
         +++W   +  ++R  L ++   ++  + + + P + +   S+  W  W++RN+
Subjt:  AREIWSLIHPPMMR-SLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK

ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]4.1e-10928.67Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GD+NT +FH +AS   ++N + G+ D+N  WQT++  I + F +YFK +FSSS  +Q  M+ +L+ V   +T  MN  LL  F+REE+   +    PTKA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P  DG PA+FFQKYW +VGD     CL ILN + SV+E+NH  I LIPK +   +VS++RPISLC   YK++ + +ANRLK +L+ +I E+QSAF+  R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------
        I DN++   E ++ +   +KG+    ALKLDM+KAYDRV                                                             
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------

Query:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------
                                                        K+      + + YE+ +GQ +N +KS                          
Subjt:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------

Query:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------
             LG+P+   +G+   F+ + DK+W  + GWK +  S+ G+E+LIK+++QAIPTY+M CF+IPKG+                               
Subjt:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------

Query:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST
              GL FRDL AFNQ +LAKQ WR+L  P+ +V+++   +Y P+   L A +  + SF W+   WG +LL  G+R  +G+G SI +++D W+  PS 
Subjt:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST

Query:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM
        FK++SP    + +T + +  T S Q +VP L     + +VD I ++P++  A                                                
Subjt:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM

Query:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC
                      H+  IWHY+  GMYSVKSGY+LA L+  + S    + V+  + +W+K+W +++P+K+K F+W+   + +P    L +  +  +  C
Subjt:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC

Query:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK
        P C+ + E+  HA++ C  A+E+W       +  +      ++ W  L +          +   W +WN RN  +   K
Subjt:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.1e-10931.12Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GDRNT++FH +AS  R+QN I G+ D  G W  ++ +I +A  +YF  I+SSSH  Q  ++ V   +  KVT+EMN+ L+  F++EEV  A+K   P KA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P PDG  A+FFQKYW +VG+      LN+LN    + E N  NI LIPK+   + ++D+RPISLCNV YK+++++LANRLK +L  II E+QSAF   R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------
        ITDN+++  E +H+L+ K  GK G+ A+KLDMSKA+DRV                                                             
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------

Query:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------
                                                      A +E    R+IL  YE+ASGQ +N  KS                          
Subjt:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------

Query:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------
             LG+PS   R K+  F  + +KV   L GWK +  S GG+E+LIK++ QAIPTY M CF +P+G+                               
Subjt:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------

Query:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST
              GL FR+L AFN  MLAKQAWR+L NP+ +V +VL  +YFP   +L A + +S S+ W+     ++++R G R  +GNG+ I I+ D W+  PST
Subjt:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST

Query:  FKVISPYDPHMNNTPIAEFITPSLQ-SDVPKLNQFLVELDVDLIKRLPISGSAP-NKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVW
        +KVISP   +     ++  I P  +   V  L    +  +V+ I R+P+S + P +K IW  + +G +SVKS Y +A       S+ D            
Subjt:  FKVISPYDPHMNNTPIAEFITPSLQ-SDVPKLNQFLVELDVDLIKRLPISGSAP-NKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVW

Query:  KMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCP
            P++                   RG  S    Y+L                   W+K+W + +P K+K F W+   + +P   N+    +  S  CP
Subjt:  KMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCP

Query:  ICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGLV-DEPMRVLERVSVGAWSIWNDRNKSV
        IC    E  +HA+  C  A  +W                  D    L   +  +VLE   V +W+IW +RNK V
Subjt:  ICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGLV-DEPMRVLERVSVGAWSIWNDRNKSV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.6e-10831.29Show/hide
Query:  MGDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTK
        +GDRNT++FH KAS  RR+N I+G+ D NG WQ     I +   +YF+ I+SSS   +  +  VL  +   VT+EMN  L+  F+REE+  A+    PTK
Subjt:  MGDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTK

Query:  APRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGR
        AP PDG  AIFFQKYW++VG+  +   L++LNS  S+ E N  NI L+PK +    +SD+RPISLCNV YK++++VLANRLK IL +II E+QSAF+ GR
Subjt:  APRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGR

Query:  SITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRVA-----------------------------------------------------------
         ITDN+++  E +H+L  K++GK G+AA+KLDMSKAYDRV                                                            
Subjt:  SITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRVA-----------------------------------------------------------

Query:  ------------------------------------------------AKEFRHFRNILKDYEKASGQSVNLTKS-------------------------
                                                        ++E +   +IL+ YE ASGQ +N+ KS                         
Subjt:  ------------------------------------------------AKEFRHFRNILKDYEKASGQSVNLTKS-------------------------

Query:  -----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPK---------------GVR--------------
              LG+PS   + K   F  + ++V   L GWK +  S GGRE+LIK++ QAIPTY M CF+IPK               G R              
Subjt:  -----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPK---------------GVR--------------

Query:  -------GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPS
               G+ FR+L AFN  MLAKQ WR+++NP+ +V+++   +Y+P+  V  A + AS S+ W+    G++++R G R  +GNG+ I I+ D W+  P 
Subjt:  -------GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPS

Query:  TFKVIS---PYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAP-NKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWR
        T+KVIS   P+D +   + + +      + DV  +    +  +   I  +P+S + P ++ IW  + +G +SVKS Y +A+                   
Subjt:  TFKVIS---PYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAP-NKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWR

Query:  KVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSG
                  + N                              L+  E+S  D   ++  WRK+W + +P KV+ F WK   N++P  +NL    V +  
Subjt:  KVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSG

Query:  NCPICNDEMETTDHAMFQCSRAREIWS--LIHPPMMRSLVDQMDIKDRWQGLVD-EPMRVLERVSVGAWSIWNDRNKSV
         CP C  E E+  H   +C  A+ +W   L +P  + ++   MDI D    ++D      LE   V AW+IW +RNK V
Subjt:  NCPICNDEMETTDHAMFQCSRAREIWS--LIHPPMMRSLVDQMDIKDRWQGLVD-EPMRVLERVSVGAWSIWNDRNKSV

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]5.2e-11231.75Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GD+NT++FH KAS  +R+N + G+E+  GIW  ++  I E   ++F+E+F++S      +   L  +  KVT EMN  L  PF+ E+V  A+ +  PTKA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P PDG PA FFQK+W  V +  IS CLN+LN + +    NH  I LIPK    R VSDYRPISLCNV Y++V + +ANR+K IL++II   QSAFI  R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDR--------------------------------------------------------------
        ITDN+I+G+E LH +   +  K G  ALKLD+SKAYDR                                                              
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDR--------------------------------------------------------------

Query:  ----------VAAKEFRHFRNI-------------------------------LKDYEKASGQSV-NLT-----KSMLGMPSSFSRGKTCDFKFILDKVW
                  V A++ +  R +                               + ++++A+ + + NL      +  LG+PS   R K   F  I  KV 
Subjt:  ----------VAAKEFRHFRNI-------------------------------LKDYEKASGQSV-NLT-----KSMLGMPSSFSRGKTCDFKFILDKVW

Query:  VVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV------------------------------------RGLNFRDLVAFNQEMLAKQAWRV
          + GW+ +F S GG+EVLIK+  QAIP YAM  FK+P+G                                      GL FR+   FNQ ++AKQAWR+
Subjt:  VVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV------------------------------------RGLNFRDLVAFNQEMLAKQAWRV

Query:  LTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIAEFITPSLQSDV
        L  P+ +VS+VL  +YF NSS L A   A++S+ W+  +WG  +++ G+R  +GNG+ I+IFSD W+ RP TF+ I P    +++  +A+ I    Q D 
Subjt:  LTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIAEFITPSLQSDV

Query:  PKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMY
         KL Q  +++D   I ++P+                             K+++  L                                   WHYD RG Y
Subjt:  PKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMY

Query:  SVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPM
        SVKSGY+LA+      S S  E  + +W  +W +++P K+K F+W+  +N +P   NL    V     C  C   +ET  HA+ +C  AR+IW       
Subjt:  SVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPM

Query:  MRSLVDQMDIKDRWQGLVDEPMRV-LERVSVGAWSIWNDRNKSV
         R   +  DI    Q +  E  +  LE +    WS W  RNK +
Subjt:  MRSLVDQMDIKDRWQGLVDEPMRV-LERVSVGAWSIWNDRNKSV

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein2.0e-10928.67Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GD+NT +FH +AS   ++N + G+ D+N  WQT++  I + F +YFK +FSSS  +Q  M+ +L+ V   +T  MN  LL  F+REE+   +    PTKA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P  DG PA+FFQKYW +VGD     CL ILN + SV+E+NH  I LIPK +   +VS++RPISLC   YK++ + +ANRLK +L+ +I E+QSAF+  R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------
        I DN++   E ++ +   +KG+    ALKLDM+KAYDRV                                                             
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------

Query:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------
                                                        K+      + + YE+ +GQ +N +KS                          
Subjt:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------

Query:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------
             LG+P+   +G+   F+ + DK+W  + GWK +  S+ G+E+LIK+++QAIPTY+M CF+IPKG+                               
Subjt:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------

Query:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST
              GL FRDL AFNQ +LAKQ WR+L  P+ +V+++   +Y P+   L A +  + SF W+   WG +LL  G+R  +G+G SI +++D W+  PS 
Subjt:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST

Query:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM
        FK++SP    + +T + +  T S Q +VP L     + +VD I ++P++  A                                                
Subjt:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM

Query:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC
                      H+  IWHY+  GMYSVKSGY+LA L+  + S    + V+  + +W+K+W +++P+K+K F+W+   + +P    L +  +  +  C
Subjt:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC

Query:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK
        P C+ + E+  HA++ C  A+E+W       +  +      ++ W  L +          +   W +WN RN  +   K
Subjt:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK

A0A2N9J109 Uncharacterized protein2.0e-10931.76Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GDRNT +FH +A+  +R+N I G+ D++G+W+T++  I     +YF+ IF +S+     +D VL  V   VTD MN+ L  P++  EV  A++   P  A
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P PDG P +F+Q +W ++G+  I   L+ LNS   V+  NH NI LIPK +    VS++RPISLCNV YKI+++V+ANRLK+IL  +I E+QSAF+ GR 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------
        ITDN+++  E+LH +   R+GK G+ ALKLDMSKAYDRV                                                             
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------

Query:  -------------AAKEFRHFRNILKDYEKASGQSVNLTKS------------------------------MLGMPSSFSRGKTCDFKFILDKVWVVLQG
                     +  E ++ + IL  YE ASGQ +N  K+                               LG+PS   R K   F  I ++VW  L+G
Subjt:  -------------AAKEFRHFRNILKDYEKASGQSVNLTKS------------------------------MLGMPSSFSRGKTCDFKFILDKVWVVLQG

Query:  WKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGVRGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWG
        WK +  SQ GRE LIK ++QAIP Y + CFK+P  + GL FRDL  FN  +LAKQ WR+L N + +   V   K+FP+ ++L+A  +   S+ W      
Subjt:  WKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGVRGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWG

Query:  MDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIAEFITPSLQS-DVPKLNQFLVELDVDLIKRLPI-SGSAPNKWIWHYDGRGMYS
          +++ G+   +G+G S+ I+ + W+      KVISP      NT ++  I P  +S ++  L Q  +  D   I+ +P+     P+  IW +   G YS
Subjt:  MDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIAEFITPSLQS-DVPKLNQFLVELDVDLIKRLPI-SGSAPNKWIWHYDGRGMYS

Query:  VKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSK
        V+S Y L +   QE + S                                                          +S S+ +   A W+KVW ++ P K
Subjt:  VKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSK

Query:  VKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQM----DIKDRWQGLVDEPMRVLERVSVGAWSI
        VKNF+W+    S+P  +NL    V  +  C  C +E+E   HA++QC     +WS       + L        D+  R   L+DE   +  R +  AW +
Subjt:  VKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQM----DIKDRWQGLVDEPMRVLERVSVGAWSI

Query:  WNDRNK
        W++RNK
Subjt:  WNDRNK

A0A803Q6Y1 Uncharacterized protein8.9e-11032.85Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GD NT++FH  AS  +  N I  + +  G+  + K  + E   N+F  +F+++ T    +  +L  +   V+D+MN  LL PF+  EVL A+++  P K+
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P  DG  A+F+Q YWD +G    +  L +LN   ++   N   I LIPK  + + + DYRPISLCNV YK++++V+  R K +L  +I E+QSAF+  R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRVAAKEFRHFRNILKDYEKASGQSVNLTKSM-----------LGMPSSFSRGKTCDFKFILDK
        ITDN+++  E +H L  K +G  GY+ALKLDMSKA+DRV             DY  A  + ++L K++           LG+P+  S  K   F  + ++
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRVAAKEFRHFRNILKDYEKASGQSVNLTKSM-----------LGMPSSFSRGKTCDFKFILDK

Query:  VWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIP------------------------------------KGVRGLNFRDLVAFNQEMLAKQAW
        +W +L  W  + FS GG+EVL+K++VQ+IPTYAM CF++P                                    K   G+ FR  V FNQ MLAKQAW
Subjt:  VWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIP------------------------------------KGVRGLNFRDLVAFNQEMLAKQAW

Query:  RVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVIS-PYDPHMNNTPIAEFITPSLQ
        R+  NP+ ++ ++L  +YFP +S L A    S S  W+G  WG +LL  G+R  +GNG  +    +PWI     FK IS   DP   +TP++ +I  +++
Subjt:  RVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVIS-PYDPHMNNTPIAEFITPSLQ

Query:  SDVPKLNQFLVELDVDLIKRLPIS-GSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDG
         ++P+L+Q   ++D+D I  +P+S   + ++ IWH++  G YSVKSG+ LA      TSLS+                                      
Subjt:  SDVPKLNQFLVELDVDLIKRLPIS-GSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDG

Query:  RGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIW---
                       K QE+S  D +    WW+  WK+ +P KVK F WK   N++P    L    V  S  C  C    E+  HA+F C  A+ IW   
Subjt:  RGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIW---

Query:  ----------SLIHPPMMRSLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK
                   + +   +  L   MD K+ ++ L+              WSIWNDRNK
Subjt:  ----------SLIHPPMMRSLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)2.0e-10928.67Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GD+NT +FH +AS   ++N + G+ D+N  WQT++  I + F +YFK +FSSS  +Q  M+ +L+ V   +T  MN  LL  F+REE+   +    PTKA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P  DG PA+FFQKYW +VGD     CL ILN + SV+E+NH  I LIPK +   +VS++RPISLC   YK++ + +ANRLK +L+ +I E+QSAF+  R 
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------
        I DN++   E ++ +   +KG+    ALKLDM+KAYDRV                                                             
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV-------------------------------------------------------------

Query:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------
                                                        K+      + + YE+ +GQ +N +KS                          
Subjt:  ----------------------------------------------AAKEFRHFRNILKDYEKASGQSVNLTKS--------------------------

Query:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------
             LG+P+   +G+   F+ + DK+W  + GWK +  S+ G+E+LIK+++QAIPTY+M CF+IPKG+                               
Subjt:  ----MLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIPKGV-------------------------------

Query:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST
              GL FRDL AFNQ +LAKQ WR+L  P+ +V+++   +Y P+   L A +  + SF W+   WG +LL  G+R  +G+G SI +++D W+  PS 
Subjt:  -----RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPST

Query:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM
        FK++SP    + +T + +  T S Q +VP L     + +VD I ++P++  A                                                
Subjt:  FKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKM

Query:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC
                      H+  IWHY+  GMYSVKSGY+LA L+  + S    + V+  + +W+K+W +++P+K+K F+W+   + +P    L +  +  +  C
Subjt:  QVPSKVKNFVWKYFHNKWIWHYDGRGMYSVKSGYKLAMLKSQETS---LSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNC

Query:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK
        P C+ + E+  HA++ C  A+E+W       +  +      ++ W  L +          +   W +WN RN  +   K
Subjt:  PICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGL-VDEPMRVLERVSVGAWSIWNDRNKSVTTNK

Q0D4Q0 Os07g0613900 protein5.2e-11834.35Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA
        GDRNT +FH   S  +R N I  ++  +G W   +         YFK +F S+  +  +   +LS V +KVT+EMN+ L+  F  EE+  +I S    KA
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKA

Query:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS
        P PDG PA+F++ +WDVVG    S  L+ILN       WN   IVLIPK +Q   + D RPISL NV YKIV++VLANR+K +L ++I +SQSAF+ G  
Subjt:  PRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRS

Query:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV------------AAKEFRHFRNILKDYEKASGQSVNLTKSMLGMPSSFSRGKTCDF-----
        I+DN+++ +E  H L  KR G VG+AALKLDMSKAYDRV               +     +  K Y++ SGQ +N  KS +    + SR K  +F     
Subjt:  ITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV------------AAKEFRHFRNILKDYEKASGQSVNLTKSMLGMPSSFSRGKTCDF-----

Query:  ---KFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKI------------------------------------PKGVRGLNFRDLVAFN
           + + D++W  +QGWK +  S+ G+EVLIK++ QAIPT+AM CF +                                    PK   GL FRD+ AFN
Subjt:  ---KFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKI------------------------------------PKGVRGLNFRDLVAFN

Query:  QEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIA
          MLAKQ WR++ NP+ + ++VL  KY+P+ ++ +A   A+ S+ W+    G+  LR G+   +GNG +I+I+SDPWI    T + I+P   ++  T + 
Subjt:  QEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNTPIA

Query:  EFITPSLQS-DVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHN
        E I PS  S DV  L Q     DV++I+ +P+     +   WHYD +G++SVKS YK             V+R     R+                    
Subjt:  EFITPSLQS-DVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHN

Query:  KWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSR
                RG   + +G                ER+  +W+KVWKMQ+P KVK+F+W++ HN +   VNL    + +   C ICN   E   H  F+C +
Subjt:  KWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSR

Query:  AREIWSLIHPPMMR-SLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK
         +++W   +  ++R  L ++   ++  + + + P + +   S+  W  W++RN+
Subjt:  AREIWSLIHPPMMR-SLVDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.3e-1926.99Show/hide
Query:  RRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVL-SHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKAPRPDGFPAIFFQKY
        R +N ID +++  G   TD   I      Y+K ++++      +MD  L ++   ++  E  + L  P +  E++A I S P  K+P PDGF A F+Q+Y
Subjt:  RRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVL-SHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKAPRPDGFPAIFFQKY

Query:  WDVVGDTTISNCLNILNSKASVKEWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRSITDNMILGHESLH
         + +    +    +I         +   +I+LIPK  R      ++RPISL N+  KI+ ++LANR++  + ++I   Q  FI G     N+      + 
Subjt:  WDVVGDTTISNCLNILNSKASVKEWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRSITDNMILGHESLH

Query:  FLNKKRKGKVGYAALKLDMSKAYDRV
         +N+ +     +  + +D  KA+D++
Subjt:  FLNKKRKGKVGYAALKLDMSKAYDRV

P08548 LINE-1 reverse transcriptase homolog4.1e-1925.9Show/hide
Query:  DRNTRWFHQK--------ASMCRR---QNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVH-QKVTDEMNQMLLNPFSREEVL
        +++  WF +K        A++ R+   ++ I  + + N    TD + I +    Y+K+++S  +   +++D  L   H  +++ +  +ML  P S  E+ 
Subjt:  DRNTRWFHQK--------ASMCRR---QNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVH-QKVTDEMNQMLLNPFSREEVL

Query:  AAIKSFPPTKAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEII
        + I++ P  K+P PDGF + F+Q + + +    ++   NI         +   NI LIPK  +      +YRPISL N+  KI+ ++L NR++  + +II
Subjt:  AAIKSFPPTKAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEII

Query:  DESQSAFIYGRSITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV
           Q  FI G     N+      +  +NK +     +  L +D  KA+D +
Subjt:  DESQSAFIYGRSITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-1628.32Show/hide
Query:  IDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVH-QKVTDEMNQMLLNPFSREEVLAAIKSFPPTKAPRPDGFPAIFFQKYWDVVG
        I+ + +  G   TD   I     +++K ++S+      +MD  L      K+  +    L +P S +E+ A I S P  K+P PDGF A F+Q + +   
Subjt:  IDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVH-QKVTDEMNQMLLNPFSREEVLAAIKSFPPTKAPRPDGFPAIFFQKYWDVVG

Query:  DTTISNCLNILNSKASVK-----EWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRSITDNMILGHESLH
           +   L+ L  K  V+      +    I LIPK  +    + ++RPISL N+  KI+ ++LANR++  +  II   Q  FI G     N+      +H
Subjt:  DTTISNCLNILNSKASVK-----EWNHINIVLIPK-SRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRSITDNMILGHESLH

Query:  FLNKKRKGKVGYAALKLDMSKAYDRV
        ++NK +     +  + LD  KA+D++
Subjt:  FLNKKRKGKVGYAALKLDMSKAYDRV

P14381 Transposon TX1 uncharacterized 149 kDa protein2.4e-1928.51Show/hide
Query:  DRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFS----SSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPP
        DR +R+F+        +  I  +   +G    D   I +   ++++ +FS    S    +   DG+       V++   + L  P + +E+  A++  P 
Subjt:  DRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFS----SSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPP

Query:  TKAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIY
         K+P  DG    FFQ +WD +G                        + L+PK    RL+ ++RP+SL +  YKIV + ++ RLK +L E+I   QS  + 
Subjt:  TKAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIY

Query:  GRSITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV
        GR+I DN+ L  + LHF    R+  +  A L LD  KA+DRV
Subjt:  GRSITDNMILGHESLHFLNKKRKGKVGYAALKLDMSKAYDRV

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-1437.78Show/hide
Query:  GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWIL
        GL FRDL  FNQ +LAKQ++R++  P  ++S++L  +YFP+SS++  S+    S+ W+  + G +LL  G+ + +G+G    ++ D WI+
Subjt:  GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWIL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.2e-1530.68Show/hide
Query:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIF-SSSHTEQRDMDGVLSHVHQ-KVTDEMNQMLLNPFSREEVLAAIKSFPPT
        GD NTR+FH+     + +N I  +   + +   +   + E    Y+  +  S S     D    +  +H  +  D +   L    S +E+ AA+ + P  
Subjt:  GDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIF-SSSHTEQRDMDGVLSHVHQ-KVTDEMNQMLLNPFSREEVLAAIKSFPPT

Query:  KAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTE
        KAP PD F A FF + W VV D+TI+       +   +K +N   I LIPK      +S +RP+S C V YKI+T+
Subjt:  KAPRPDGFPAIFFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTE

AT3G09510.1 Ribonuclease H-like superfamily protein3.3e-1621.94Show/hide
Query:  KYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNT--PIAEFITPSLQS--------DVPKLN
        +YF + S+L A +    S+ W   + G+ LL+ G R  +G+GQ+I I  D          ++  + P   NT     E    +L          D  K++
Subjt:  KYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMNNT--PIAEFITPSLQS--------DVPKLN

Query:  QFLVELDVDLIKRLPISGS-APNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVK
        QF+ + D   I R+ ++ S  P+K IW+Y+  G Y+V+SGY L                                            + H     + ++ 
Subjt:  QFLVELDVDLIKRLPISGS-APNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHYDGRGMYSVK

Query:  SGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRS
          +    LK+                ++W + +  K+K+F+W+    ++     L    + +  +CP C+ E E+ +HA+F C  A   W L    ++R+
Subjt:  SGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRS

Query:  LVDQMDIKDRWQGLV----DEPMRVLERVSVG--AWSIWNDRNKSVTTNKF
         +   D ++    ++    D  M    ++      W IW  RN +V  NKF
Subjt:  LVDQMDIKDRWQGLV----DEPMRVLERVSVG--AWSIWNDRNKSVTTNKF

AT3G25270.1 Ribonuclease H-like superfamily protein3.3e-0823.58Show/hide
Query:  KVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGLVDEPM-----RV
        K+WK++   K+K+F+WK    ++    NL   H+     C  C  E ET+ H  F C  A+++W     P        + ++ + + L+   +     ++
Subjt:  KVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSLVDQMDIKDRWQGLVDEPM-----RV

Query:  LERVSVGAWSIWNDRNKSVTTNK
                W +W  RN+ V   K
Subjt:  LERVSVGAWSIWNDRNKSVTTNK

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-2924.37Show/hide
Query:  AIPTYAMRCFKIPKGV------------------------------------RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTA
        A+PTY M CF +PK V                                     G+ F+D+ AFN  +L KQ WR+L+ P+ +++KV   +YF  S  L A
Subjt:  AIPTYAMRCFKIPKGV------------------------------------RGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTA

Query:  SITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWI-LRPSTFKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSA
         + +  SF WK      ++LR G R  +GNG+ I I+   W+  +P++  +       M   P  E+                                 
Subjt:  SITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWI-LRPSTFKVISPYDPHMNNTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSA

Query:  PNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRK-VWKMQVPSKVKNFV------WKYFHNKWIWHYDGRGMYSVKSGY-KLAMLKSQET
                      SV S  K++ L         ++     WRK V +M  P   +  +       +   + + W Y   G Y+VKSGY  L  + ++ +
Subjt:  PNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRK-VWKMQVPSKVKNFV------WKYFHNKWIWHYDGRGMYSVKSGY-KLAMLKSQET

Query:  SLSDVERQ--NAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSL--IHPPMMRSLVDQMDIKD
        S  +V     N  ++K+WK Q   K+++F+WK   NS+P    L + H+     C  C    ET +H +F+C+ AR  W++  I  P+     D + +  
Subjt:  SLSDVERQ--NAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSL--IHPPMMRSLVDQMDIKD

Query:  RWQGLVDEPMRVLERVSVGA----WSIWNDRNKSV
         W   +       E+ S       W +W +RN+ V
Subjt:  RWQGLVDEPMRVLERVSVGA----WSIWNDRNKSV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1537.78Show/hide
Query:  GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWIL
        GL FRDL  FNQ +LAKQ++R++  P  ++S++L  +YFP+SS++  S+    S+ W+  + G +LL  G+ + +G+G    ++ D WI+
Subjt:  GLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGATAGGAACACTCGTTGGTTCCATCAAAAGGCTTCCATGTGCAGAAGACAAAATTGGATAGATGGTGTTGAAGACTCAAATGGGATTTGGCAAACTGATAAGGC
AAATATCCATGAAGCTTTTGAAAACTATTTTAAGGAAATTTTCTCTTCTTCCCATACGGAACAAAGGGATATGGATGGTGTCCTCAGTCATGTTCATCAGAAAGTCACCG
ATGAGATGAATCAGATGCTTTTAAATCCATTCTCCCGAGAGGAGGTATTAGCAGCTATTAAGAGCTTCCCCCCGACTAAAGCTCCTAGACCCGATGGCTTCCCCGCTATC
TTTTTTCAGAAATATTGGGATGTCGTGGGAGATACGACAATTTCCAATTGTTTGAATATTTTGAACTCAAAGGCATCGGTGAAGGAGTGGAATCACATAAATATTGTGTT
AATACCAAAGAGTCGTCAGGCAAGGTTAGTCTCTGATTATCGCCCAATTAGCTTATGTAACGTATCTTACAAGATTGTTACTGAGGTTTTGGCTAATAGACTTAAACTTA
TTTTGAATGAGATCATTGATGAGAGCCAATCTGCTTTTATATATGGTAGATCAATAACTGATAATATGATTCTGGGCCATGAATCCTTGCACTTTCTTAACAAGAAGCGT
AAAGGTAAGGTGGGATATGCAGCACTTAAATTAGATATGAGTAAAGCATATGATAGGGTTGCGGCCAAGGAGTTCAGGCATTTTCGAAATATACTAAAGGATTACGAAAA
GGCATCTGGCCAATCGGTCAATCTTACAAAATCTATGCTTGGGATGCCTTCAAGTTTTAGTAGAGGTAAAACTTGTGATTTCAAATTCATTCTGGATAAGGTTTGGGTTG
TTCTGCAAGGGTGGAAAAGTCAATTCTTCTCACAGGGTGGAAGGGAGGTTCTGATAAAGAGTATTGTACAGGCTATCCCAACATATGCAATGAGGTGCTTCAAAATTCCA
AAAGGAGTTAGAGGTCTAAATTTTCGGGATCTGGTGGCCTTTAACCAAGAGATGTTGGCTAAGCAGGCATGGAGAGTTTTAACTAACCCGGATCTTATGGTGTCAAAAGT
TTTATGTGGTAAATATTTTCCCAACTCTTCAGTCTTAACTGCGTCTATTACAGCTTCCTCGTCTTTCTTTTGGAAAGGTTTTGTTTGGGGAATGGATCTATTGAGGTGTG
GCATAAGGAAAAACTTAGGGAATGGGCAGTCAATTTCTATATTCAGTGATCCATGGATCCTTCGACCTTCTACTTTTAAGGTCATATCACCTTATGATCCACATATGAAT
AATACGCCTATAGCAGAATTCATTACGCCATCTCTCCAATCGGATGTTCCAAAACTCAACCAATTTTTGGTCGAGTTAGATGTGGATTTGATAAAACGACTACCTATAAG
TGGCTCAGCTCCAAACAAGTGGATATGGCATTATGATGGTAGAGGGATGTACTCTGTTAAAAGTGGCTACAAGTTAGCAATGTTAAAGTCTCAGGAGACATCATTGTCAG
ACGTTGAGAGACAGAATGCCTGGTGGAGGAAGGTATGGAAGATGCAGGTGCCATCCAAAGTTAAAAATTTTGTATGGAAATATTTTCATAACAAGTGGATATGGCATTAT
GATGGTAGAGGGATGTACTCTGTTAAAAGTGGCTACAAGTTAGCAATGTTAAAGTCTCAGGAGACATCATTGTCAGACGTTGAGAGACAGAATGCCTGGTGGAGGAAGGT
ATGGAAGATGCAGGTGCCATCCAAAGTTAAAAATTTTGTATGGAAATATTTTCATAACTCCATCCCAAAAATGGTTAATCTAGGCCATCACCATGTTCCAGTAAGTGGGA
ATTGTCCGATTTGTAATGATGAAATGGAGACAACGGATCATGCCATGTTTCAGTGTTCAAGGGCTCGTGAGATATGGTCTTTGATTCATCCGCCAATGATGCGGTCTCTA
GTGGATCAAATGGATATCAAAGATCGGTGGCAAGGCTTGGTTGATGAACCAATGAGGGTTTTAGAGCGGGTTTCTGTGGGAGCATGGTCTATTTGGAATGATAGAAACAA
ATCGGTCACAACCAACAAATTCCTGATCCGGTGGTTCGAGGTGATTGGATTATCAACTACCTTGAAGCGTTCTGGATGGCTAATCCAAAAAGCGACGTCAATGCTCAAAC
GATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGATAGGAACACTCGTTGGTTCCATCAAAAGGCTTCCATGTGCAGAAGACAAAATTGGATAGATGGTGTTGAAGACTCAAATGGGATTTGGCAAACTGATAAGGC
AAATATCCATGAAGCTTTTGAAAACTATTTTAAGGAAATTTTCTCTTCTTCCCATACGGAACAAAGGGATATGGATGGTGTCCTCAGTCATGTTCATCAGAAAGTCACCG
ATGAGATGAATCAGATGCTTTTAAATCCATTCTCCCGAGAGGAGGTATTAGCAGCTATTAAGAGCTTCCCCCCGACTAAAGCTCCTAGACCCGATGGCTTCCCCGCTATC
TTTTTTCAGAAATATTGGGATGTCGTGGGAGATACGACAATTTCCAATTGTTTGAATATTTTGAACTCAAAGGCATCGGTGAAGGAGTGGAATCACATAAATATTGTGTT
AATACCAAAGAGTCGTCAGGCAAGGTTAGTCTCTGATTATCGCCCAATTAGCTTATGTAACGTATCTTACAAGATTGTTACTGAGGTTTTGGCTAATAGACTTAAACTTA
TTTTGAATGAGATCATTGATGAGAGCCAATCTGCTTTTATATATGGTAGATCAATAACTGATAATATGATTCTGGGCCATGAATCCTTGCACTTTCTTAACAAGAAGCGT
AAAGGTAAGGTGGGATATGCAGCACTTAAATTAGATATGAGTAAAGCATATGATAGGGTTGCGGCCAAGGAGTTCAGGCATTTTCGAAATATACTAAAGGATTACGAAAA
GGCATCTGGCCAATCGGTCAATCTTACAAAATCTATGCTTGGGATGCCTTCAAGTTTTAGTAGAGGTAAAACTTGTGATTTCAAATTCATTCTGGATAAGGTTTGGGTTG
TTCTGCAAGGGTGGAAAAGTCAATTCTTCTCACAGGGTGGAAGGGAGGTTCTGATAAAGAGTATTGTACAGGCTATCCCAACATATGCAATGAGGTGCTTCAAAATTCCA
AAAGGAGTTAGAGGTCTAAATTTTCGGGATCTGGTGGCCTTTAACCAAGAGATGTTGGCTAAGCAGGCATGGAGAGTTTTAACTAACCCGGATCTTATGGTGTCAAAAGT
TTTATGTGGTAAATATTTTCCCAACTCTTCAGTCTTAACTGCGTCTATTACAGCTTCCTCGTCTTTCTTTTGGAAAGGTTTTGTTTGGGGAATGGATCTATTGAGGTGTG
GCATAAGGAAAAACTTAGGGAATGGGCAGTCAATTTCTATATTCAGTGATCCATGGATCCTTCGACCTTCTACTTTTAAGGTCATATCACCTTATGATCCACATATGAAT
AATACGCCTATAGCAGAATTCATTACGCCATCTCTCCAATCGGATGTTCCAAAACTCAACCAATTTTTGGTCGAGTTAGATGTGGATTTGATAAAACGACTACCTATAAG
TGGCTCAGCTCCAAACAAGTGGATATGGCATTATGATGGTAGAGGGATGTACTCTGTTAAAAGTGGCTACAAGTTAGCAATGTTAAAGTCTCAGGAGACATCATTGTCAG
ACGTTGAGAGACAGAATGCCTGGTGGAGGAAGGTATGGAAGATGCAGGTGCCATCCAAAGTTAAAAATTTTGTATGGAAATATTTTCATAACAAGTGGATATGGCATTAT
GATGGTAGAGGGATGTACTCTGTTAAAAGTGGCTACAAGTTAGCAATGTTAAAGTCTCAGGAGACATCATTGTCAGACGTTGAGAGACAGAATGCCTGGTGGAGGAAGGT
ATGGAAGATGCAGGTGCCATCCAAAGTTAAAAATTTTGTATGGAAATATTTTCATAACTCCATCCCAAAAATGGTTAATCTAGGCCATCACCATGTTCCAGTAAGTGGGA
ATTGTCCGATTTGTAATGATGAAATGGAGACAACGGATCATGCCATGTTTCAGTGTTCAAGGGCTCGTGAGATATGGTCTTTGATTCATCCGCCAATGATGCGGTCTCTA
GTGGATCAAATGGATATCAAAGATCGGTGGCAAGGCTTGGTTGATGAACCAATGAGGGTTTTAGAGCGGGTTTCTGTGGGAGCATGGTCTATTTGGAATGATAGAAACAA
ATCGGTCACAACCAACAAATTCCTGATCCGGTGGTTCGAGGTGATTGGATTATCAACTACCTTGAAGCGTTCTGGATGGCTAATCCAAAAAGCGACGTCAATGCTCAAAC
GATAG
Protein sequenceShow/hide protein sequence
MGDRNTRWFHQKASMCRRQNWIDGVEDSNGIWQTDKANIHEAFENYFKEIFSSSHTEQRDMDGVLSHVHQKVTDEMNQMLLNPFSREEVLAAIKSFPPTKAPRPDGFPAI
FFQKYWDVVGDTTISNCLNILNSKASVKEWNHINIVLIPKSRQARLVSDYRPISLCNVSYKIVTEVLANRLKLILNEIIDESQSAFIYGRSITDNMILGHESLHFLNKKR
KGKVGYAALKLDMSKAYDRVAAKEFRHFRNILKDYEKASGQSVNLTKSMLGMPSSFSRGKTCDFKFILDKVWVVLQGWKSQFFSQGGREVLIKSIVQAIPTYAMRCFKIP
KGVRGLNFRDLVAFNQEMLAKQAWRVLTNPDLMVSKVLCGKYFPNSSVLTASITASSSFFWKGFVWGMDLLRCGIRKNLGNGQSISIFSDPWILRPSTFKVISPYDPHMN
NTPIAEFITPSLQSDVPKLNQFLVELDVDLIKRLPISGSAPNKWIWHYDGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNKWIWHY
DGRGMYSVKSGYKLAMLKSQETSLSDVERQNAWWRKVWKMQVPSKVKNFVWKYFHNSIPKMVNLGHHHVPVSGNCPICNDEMETTDHAMFQCSRAREIWSLIHPPMMRSL
VDQMDIKDRWQGLVDEPMRVLERVSVGAWSIWNDRNKSVTTNKFLIRWFEVIGLSTTLKRSGWLIQKATSMLKR