; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022494 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022494
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:30679859..30684092
RNA-Seq ExpressionLag0022494
SyntenyLag0022494
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3473721.1 retroelement pol polyprotein-like [Gossypium australe]2.4e-9134.31Show/hide
Query:  AWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKK
        AWL+S PPNSI++W +L E+FL K+F  ++NAK R EI AF    +E L  AWERF+ L++KCPHHG+P CI LE FY+GL   ++ +V+ SAN + L K
Subjt:  AWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKK

Query:  SANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTM--SNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYV
        S NEA  I++ IA+NN  W  T      + +A   E +A T++ +Q+  I ++   +T   SN     P N   +     CGE H  + CP NP SV+Y+
Subjt:  SANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTM--SNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYV

Query:  -------GQKGTRTDGIHTQLRQHPNFSW---------------------------------------------------------------EDKPAQIA
               G++G +++  ++  R H +FSW                                                               E++  Q+A
Subjt:  -------GQKGTRTDGIHTQLRQHPNFSW---------------------------------------------------------------EDKPAQIA

Query:  QEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRS--------------------------------------------------------GLEYDGP----
         E++NR QG LP+ TENP   GKE CKA+TLRS                                                         LE + P    
Subjt:  QEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRS--------------------------------------------------------GLEYDGP----

Query:  ------------------------------KYLVNQEMPTCVKFLKDILSKKRRLGEYETVALT---------ECSSALVKNEIPPNLRTQELG--IGEA
                                       + V++E+P  +   +  L+  R + + +   LT         + S      E PP L  + +G  I + 
Subjt:  ------------------------------KYLVNQEMPTCVKFLKDILSKKRRLGEYETVALT---------ECSSALVKNEIPPNLRTQELG--IGEA

Query:  RPTTVTL----------------------------------QLADRSLTQPI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNK
        R  + ++                                  +  D  +  PI        ++ V  KGGITVVEN  NELIPTRTVT WR CIDY +LNK
Subjt:  RPTTVTL----------------------------------QLADRSLTQPI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNK

Query:  VTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
         TRKDHFPLPF+DQ+LDR AG+DYYYFLDGYSGYN IT+A +DQ K TFTCPYGTF+FR MPF LCNAPATFQRCMM+I +DMVE
Subjt:  VTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

PIN17626.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.2e-8835.37Show/hide
Query:  IAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETE
        + F Q  +E +  AW RF++++R CP H +P  I +  FY GL    K  ++     SFL  +  E   +L+ +A N   + +          A+ +E +
Subjt:  IAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETE

Query:  ANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYVG-----QKGTRTDGIHTQLRQHPNFSW-------------
          T + A+I  +   M    + NQV   P        CE CGE H ++ CP +   + +V      Q    ++  +   RQHPNFSW             
Subjt:  ANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYVG-----QKGTRTDGIHTQLRQHPNFSW-------------

Query:  ------------EDKPA------------------------QIAQEIKNRPQGTLPNKTE-NPHQQGKEKCKAVTLRSGLEYDGPKYLVNQEMPTCVKFL
                    E KP+                        Q+A  I +R QG+LP+ TE NP Q+ K +C+AVTL +G E         +     VKF+
Subjt:  ------------EDKPA------------------------QIAQEIKNRPQGTLPNKTE-NPHQQGKEKCKAVTLRSGLEYDGPKYLVNQEMPTCVKFL

Query:  KDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIE
        KDI+SKKRRLG+YE VALTE  SA+++N++PP L+                                     LG+GEA+ T++TLQLADRSLT P G IE
Subjt:  KDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIE

Query:  DVLLK-----------------------------------------------------------------------------------------------
        D+L+K                                                                                               
Subjt:  DVLLK-----------------------------------------------------------------------------------------------

Query:  ---------------------GGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEK
                             GGITVV N  NELIPTRTVT WR C+DYR+LNK TRKDHFPLPFIDQ+LDR AGK++YYFLDGYSGYNQI IA EDQEK
Subjt:  ---------------------GGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEK

Query:  TTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        TTFTCPYGTF+FR MPF LCNAPATFQRCMMAIF+DMVE
Subjt:  TTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

XP_016649625.1 PREDICTED: uncharacterized protein LOC103330487 [Prunus mume]2.0e-9036.87Show/hide
Query:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS
        WL S P +SI +W+DL +KFL KFFP  + AK+R +I++F Q   EPL  AWERF+ L+RKCPHH LP  I ++ FY+GL Q S+ LV+ +A  + + K+
Subjt:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS

Query:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPL-NP-ASVFYVG
        A EA  +L+T+A+NN  W      +     A  +E +A   + AQI  +   +      + +++   N  ++  CE+C   H +  C   NP AS   V 
Subjt:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPL-NP-ASVFYVG

Query:  QKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNRPQG
        Q G          ++  +   R HPNFSW                                                      E +  Q+A  I  R QG
Subjt:  QKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNRPQG

Query:  TLPNKTE-NPHQQGKEKCKAVTLRSGLE----------------------------------------------------------------------YD
          P++ E NP  Q  E+ KA+TLR G +                                                                       D
Subjt:  TLPNKTE-NPHQQGKEKCKAVTLRSGLE----------------------------------------------------------------------YD

Query:  GP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQPI-
        G          K  +N       ++MP+  KF+KDILSKKR+ GE+E + LT ECS   S   +  + PN++  E+   E       L+L D  +  PI 
Subjt:  GP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQPI-

Query:  -----GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPY
                + V  KGG+TVV+N  NEL+PTRTVT WR CIDYR+LN  TRKDHFPLPFIDQ+L+R AG  YY FLDGYSGYNQI IA EDQEKTTFTCP+
Subjt:  -----GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPY

Query:  GTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        GTF++R MPF LCNAPATFQRCM++IFSDMVE
Subjt:  GTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

XP_020411305.1 uncharacterized protein LOC109946823 [Prunus persica]9.1e-9137.17Show/hide
Query:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS
        WL S P +SI +W+DL +KFL KFFP  + AK+R +I++F Q   EPL  AWERF+ L+RKCPHH LP  I ++ FY+GL Q S+ LV+ +A  + + K 
Subjt:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS

Query:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPC---CEICGEEHTTDNCPL-NP-ASVF
        A EA  +L+T+A+NN  W      +     A  +E +A   + AQI           ++ +V+    N I++     CE+C   H +  C   NP AS  
Subjt:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPC---CEICGEEHTTDNCPL-NP-ASVF

Query:  YVGQKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNR
         V Q G          ++  +   R HPNFSW                                                      E +  Q+A  I  R
Subjt:  YVGQKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNR

Query:  PQGTLPNKTE-NPHQQGKEKCKAVTLRSGLE---------------------------------------------------------------------
         QG  P++ E NP  Q  E+ KA+TLR G +                                                                     
Subjt:  PQGTLPNKTE-NPHQQGKEKCKAVTLRSGLE---------------------------------------------------------------------

Query:  -YDGP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ
          DG          K  +N       ++MP+  KF+KDILSKKR+ GE+E + LT ECS   S   +  + PN++  E+   E       L+L D  +  
Subjt:  -YDGP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ

Query:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT
        PI         + V  KGG+TVV+N  NEL+PTRTVT WR CIDYR+LN  TRKDHFPLPFIDQ+L+R AG  YY FLDGYSGYNQI IA EDQEKTTFT
Subjt:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT

Query:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        CP+GTF++R MPF LCNAPATFQRCMM+IFSDMVE
Subjt:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

XP_020417839.1 uncharacterized protein LOC109948603 [Prunus persica]2.0e-9037.17Show/hide
Query:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS
        WL S P +SI +W+DL +KFL KFFP  + AK+R +I++F Q   EPL  AWERF+ L+RKCPHH LP  I ++ FY+GL Q S+ LV+ +A  + + K+
Subjt:  WLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKS

Query:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPC---CEICGEEHTTDNCPL-NP-ASVF
        A EA  +L+T+A+NN  W      +     A  +E  A   + AQI           ++ +V+    N I++     CE+C   H +  C   NP AS  
Subjt:  ANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPC---CEICGEEHTTDNCPL-NP-ASVF

Query:  YVGQKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNR
         V Q G          ++  +   R HPNFSW                                                      E +  Q+A  I  R
Subjt:  YVGQKG--------TRTDGIHTQLRQHPNFSW------------------------------------------------------EDKPAQIAQEIKNR

Query:  PQGTLPNKTE-NPHQQGKEKCKAVTLRSGLE---------------------------------------------------------------------
         QG  P++ E NP  Q  E+ KA+TLR G +                                                                     
Subjt:  PQGTLPNKTE-NPHQQGKEKCKAVTLRSGLE---------------------------------------------------------------------

Query:  -YDGP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ
          DG          K  +N       ++MP+  KF+KDILSKKR+ GE+E + LT ECS   S   +  + PN++  E+   E       L+L D  +  
Subjt:  -YDGP---------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALT-ECS---SALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ

Query:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT
        PI         + V  KGG+TVV+N  NEL+PTRTVT WR CIDYR+LN  TRKDHFPLPFIDQ+L+R AG  YY FLDGYSGYNQI IA EDQEKTTFT
Subjt:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT

Query:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        CP+GTF++R MPF LCNAPATFQRCMM+IFSDMVE
Subjt:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase1.4e-8429.57Show/hide
Query:  GDVEAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRS
        GD   W +S P +SIT+W  LEE+F+ KFF   + A  RAEI+ F Q  +E +  AW RF++++R CP+H +P  I +  FY GL    K  ++     S
Subjt:  GDVEAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRS

Query:  FLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVF
        FL  +  E   +L+ +  N  H+ +          A  +E +  T + A+I  +   M    + NQV   P        CE CGE H +D CP +  S+ 
Subjt:  FLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVF

Query:  YVG-----QKGTRTDGIHTQLRQHPNFSW-------------------------EDKPA------------------------QIAQEIKNRPQGTLPNK
        +V      Q    ++  +   RQHPNFSW                         E KP+                        Q+A  I +RP+ +LP+ 
Subjt:  YVG-----QKGTRTDGIHTQLRQHPNFSW-------------------------EDKPA------------------------QIAQEIKNRPQGTLPNK

Query:  TE-NPHQQGKEKCKAVTLRSGLEY---------DGPKYLVNQE----------------MPTCVKFLKDILSKKRRLGEYETVALTECSSALVKNEIPPN
        TE NP Q  K +C+AVTLR+G E             K ++++E                MP+ VKF+KDILSKKRRLG+YETVALTE  SA+++N++PP 
Subjt:  TE-NPHQQGKEKCKAVTLRSGLEY---------DGPKYLVNQE----------------MPTCVKFLKDILSKKRRLGEYETVALTECSSALVKNEIPPN

Query:  LRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIEDVLL-----------------------------
        L+                                   + LG+GEA+PT++TLQLADRSLT P G IED+L+                             
Subjt:  LRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIEDVLL-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------KGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYY
                                                KGGITVV N  NELIPTRTVT WR C+DYR+LNK TRKDHFPLPFIDQ+LDR AGK++Y 
Subjt:  ----------------------------------------KGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYY

Query:  FLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        FLDGYSGYNQI I  EDQEKTTFTCPYGTF+FR MPF LCNAPATFQRCMMAIF+DMVE
Subjt:  FLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

A0A2G9HJB6 Reverse transcriptase2.0e-8835.37Show/hide
Query:  IAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETE
        + F Q  +E +  AW RF++++R CP H +P  I +  FY GL    K  ++     SFL  +  E   +L+ +A N   + +          A+ +E +
Subjt:  IAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETE

Query:  ANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYVG-----QKGTRTDGIHTQLRQHPNFSW-------------
          T + A+I  +   M    + NQV   P        CE CGE H ++ CP +   + +V      Q    ++  +   RQHPNFSW             
Subjt:  ANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYVG-----QKGTRTDGIHTQLRQHPNFSW-------------

Query:  ------------EDKPA------------------------QIAQEIKNRPQGTLPNKTE-NPHQQGKEKCKAVTLRSGLEYDGPKYLVNQEMPTCVKFL
                    E KP+                        Q+A  I +R QG+LP+ TE NP Q+ K +C+AVTL +G E         +     VKF+
Subjt:  ------------EDKPA------------------------QIAQEIKNRPQGTLPNKTE-NPHQQGKEKCKAVTLRSGLEYDGPKYLVNQEMPTCVKFL

Query:  KDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIE
        KDI+SKKRRLG+YE VALTE  SA+++N++PP L+                                     LG+GEA+ T++TLQLADRSLT P G IE
Subjt:  KDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT----------------------------------QELGIGEARPTTVTLQLADRSLTQPIGKIE

Query:  DVLLK-----------------------------------------------------------------------------------------------
        D+L+K                                                                                               
Subjt:  DVLLK-----------------------------------------------------------------------------------------------

Query:  ---------------------GGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEK
                             GGITVV N  NELIPTRTVT WR C+DYR+LNK TRKDHFPLPFIDQ+LDR AGK++YYFLDGYSGYNQI IA EDQEK
Subjt:  ---------------------GGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEK

Query:  TTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        TTFTCPYGTF+FR MPF LCNAPATFQRCMMAIF+DMVE
Subjt:  TTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

A0A2G9HWF8 Reverse transcriptase1.1e-7828.59Show/hide
Query:  GDVEAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRS
        GD   W +S P +SIT+W  L+E+F+ KFF   + A  RAEI+ F Q  +E +  AW RF++++R CP+H +P  I +  FY GL +  K  ++     S
Subjt:  GDVEAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRS

Query:  FLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVF
        FL  +  E   +L+ +  N  H+ +          A  +E +  T + A+I  +   M                        CGE H +D CP +  S+ 
Subjt:  FLKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVF

Query:  YVG-----QKGTRTDGIHTQLRQHPNFSWEDKPAQIAQEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRSG-----------------------------
        +V      Q    ++  +   RQHPNFSW +   Q         QG   N   NP Q GK +C+AVTLR+G                             
Subjt:  YVG-----QKGTRTDGIHTQLRQHPNFSWEDKPAQIAQEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRSG-----------------------------

Query:  -LEYDGP----------------------------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT-------
         LE   P                            K  +N       ++MP+ VKF+KDILSKKRRLG+YETVALTE  SA+++N++PP L+        
Subjt:  -LEYDGP----------------------------KYLVN-------QEMPTCVKFLKDILSKKRRLGEYETVALTECSSALVKNEIPPNLRT-------

Query:  --------------QELGIGEARPTTVTLQLADRSLTQPIGKIEDVLL----------------------------------------------------
                      + LG+ EA+PT++TLQLADRSLT P G IED+L+                                                    
Subjt:  --------------QELGIGEARPTTVTLQLADRSLTQPIGKIEDVLL----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------KGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTF
                         KGGITVV N  NE IPT+TVT WR C+DYR+LNK TRKDHFPLPFIDQ+LDR AGK++Y FLDGYSGYNQI IA EDQEKTTF
Subjt:  -----------------KGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTF

Query:  TCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        TCPYGTF+FR +PF+LCNAPATFQRCMMAIF+DMVE
Subjt:  TCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

A0A5B6VWJ0 Retroelement pol polyprotein-like1.2e-9134.31Show/hide
Query:  AWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKK
        AWL+S PPNSI++W +L E+FL K+F  ++NAK R EI AF    +E L  AWERF+ L++KCPHHG+P CI LE FY+GL   ++ +V+ SAN + L K
Subjt:  AWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLKK

Query:  SANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTM--SNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYV
        S NEA  I++ IA+NN  W  T      + +A   E +A T++ +Q+  I ++   +T   SN     P N   +     CGE H  + CP NP SV+Y+
Subjt:  SANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTM--SNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYV

Query:  -------GQKGTRTDGIHTQLRQHPNFSW---------------------------------------------------------------EDKPAQIA
               G++G +++  ++  R H +FSW                                                               E++  Q+A
Subjt:  -------GQKGTRTDGIHTQLRQHPNFSW---------------------------------------------------------------EDKPAQIA

Query:  QEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRS--------------------------------------------------------GLEYDGP----
         E++NR QG LP+ TENP   GKE CKA+TLRS                                                         LE + P    
Subjt:  QEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRS--------------------------------------------------------GLEYDGP----

Query:  ------------------------------KYLVNQEMPTCVKFLKDILSKKRRLGEYETVALT---------ECSSALVKNEIPPNLRTQELG--IGEA
                                       + V++E+P  +   +  L+  R + + +   LT         + S      E PP L  + +G  I + 
Subjt:  ------------------------------KYLVNQEMPTCVKFLKDILSKKRRLGEYETVALT---------ECSSALVKNEIPPNLRTQELG--IGEA

Query:  RPTTVTL----------------------------------QLADRSLTQPI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNK
        R  + ++                                  +  D  +  PI        ++ V  KGGITVVEN  NELIPTRTVT WR CIDY +LNK
Subjt:  RPTTVTL----------------------------------QLADRSLTQPI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNK

Query:  VTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
         TRKDHFPLPF+DQ+LDR AG+DYYYFLDGYSGYN IT+A +DQ K TFTCPYGTF+FR MPF LCNAPATFQRCMM+I +DMVE
Subjt:  VTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

A0A6P6GGJ1 LOW QUALITY PROTEIN: uncharacterized protein LOC1124928765.7e-8335.91Show/hide
Query:  EAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLK
        + WL+S P  +IT+W+ +E KFL+KFFPS++ AK +++I  F Q   E L  AWERF+ L+R+CPHHG P  I +  FY+GLD  +K LV+ +A  S +K
Subjt:  EAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSFLK

Query:  KSANEALAILDTIATNNRHW-------GETEPTII----LKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTN-------------------
        K  +EA  +++ +A NN  +       G ++   +    + NL   + T+   T+  Q             SN  N+   N                   
Subjt:  KSANEALAILDTIATNNRHW-------GETEPTII----LKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTN-------------------

Query:  ------VISSPCCEICGEEHTTDNCPLNPASVFYVGQK----GTRTDGI----HTQLRQHPNF---------SWEDKPAQIAQEIKNRPQGTLPNKTE-N
                 SP  +     H   N          + Q        T+        QL  H            S E++  Q+A + + R QGTLP++TE N
Subjt:  ------VISSPCCEICGEEHTTDNCPLNPASVFYVGQK----GTRTDGI----HTQLRQHPNF---------SWEDKPAQIAQEIKNRPQGTLPNKTE-N

Query:  PHQQGKEKCKAVTLRSGLEYDGPK----------------------------------------------------------------------------
        P    KE+ +A+TLRSG +  GPK                                                                            
Subjt:  PHQQGKEKCKAVTLRSGLEYDGPK----------------------------------------------------------------------------

Query:  ----YLVN--------------------QEMPTCVKFLKDILSKKRRLGEYETVALTECSS--ALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ
            YL                      ++MP+ VKFLK+ILS KRR   YE VAL+E S+    V+++   N   +++  GE       L+L D  +  
Subjt:  ----YLVN--------------------QEMPTCVKFLKDILSKKRRLGEYETVALTECSS--ALVKNEIPPNLRTQELGIGEARPTTVTLQLADRSLTQ

Query:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT
        PI        I+ V  KGG+TV EN K+ELIPTR VT WR CIDYR+LNKVTRKDHFPLPFIDQ+L+R AGK+YY FLDGYSGYNQI IA +DQEKTTFT
Subjt:  PI------GKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFT

Query:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        CPYGTF++R MPF LCNAPATFQRCMM+IFS+MVE
Subjt:  CPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.3e-1434.09Show/hide
Query:  KIEDVLLKGGITVVENAKNE---LIPTRT----VTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCP
        +I+D+L +G I    +  N    ++P +       ++R  IDYR+LN++T  D  P+P +D++L +    +Y+  +D   G++QI +  E   KT F+  
Subjt:  KIEDVLLKGGITVVENAKNE---LIPTRT----VTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCP

Query:  YGTFSFRTMPFKLCNAPATFQRCMMAIFSDMV
        +G + +  MPF L NAPATFQRCM  I   ++
Subjt:  YGTFSFRTMPFKLCNAPATFQRCMMAIFSDMV

P20825 Retrovirus-related Pol polyprotein from transposon 2979.3e-1439.18Show/hide
Query:  RWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMV
        ++R  IDYR+LN++T  D +P+P +D++L +     Y+  +D   G++QI +  E   KT F+   G + +  MPF L NAPATFQRCM  I   ++
Subjt:  RWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMV

P31843 RNA-directed DNA polymerase homolog7.6e-1646.88Show/hide
Query:  RDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE
        R CIDYR L KVT K+ +P+P +D L DR A   ++  LD  SGY Q+ IA+ D+ KTT    YG+F FR MPF L NA ATF   M  +  + ++
Subjt:  RDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.7e-1835.23Show/hide
Query:  LVKNEIPP------NLRTQ---ELGIGEARPTTVTLQLADRSLTQPIGKIEDVLLKGGITVVENAKNE----LIPTRTVTRWRDCIDYRRLNKVTRKDHF
        +++N++PP      N+  +   E+  G   P      + +++  Q I KI   LL     V   +       L+P +  T +R C+DYR LNK T  D F
Subjt:  LVKNEIPP------NLRTQ---ELGIGEARPTTVTLQLADRSLTQPIGKIEDVLLKGGITVVENAKNE----LIPTRTVTRWRDCIDYRRLNKVTRKDHF

Query:  PLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDM
        PLP ID LL R      +  LD +SGY+QI +  +D+ KT F  P G + +  MPF L NAP+TF R M   F D+
Subjt:  PLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDM

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.7e-1835.23Show/hide
Query:  LVKNEIPP------NLRTQ---ELGIGEARPTTVTLQLADRSLTQPIGKIEDVLLKGGITVVENAKNE----LIPTRTVTRWRDCIDYRRLNKVTRKDHF
        +++N++PP      N+  +   E+  G   P      + +++  Q I KI   LL     V   +       L+P +  T +R C+DYR LNK T  D F
Subjt:  LVKNEIPP------NLRTQ---ELGIGEARPTTVTLQLADRSLTQPIGKIEDVLLKGGITVVENAKNE----LIPTRTVTRWRDCIDYRRLNKVTRKDHF

Query:  PLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDM
        PLP ID LL R      +  LD +SGY+QI +  +D+ KT F  P G + +  MPF L NAP+TF R M   F D+
Subjt:  PLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIAREDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAGTAAGTTACCTAGTGAAGCTACAAATAAATCAACAAGGGTTGTTCATCAGGTTAGTCCTTCAACAAAGAGTTTGATCTCGAAAGGGACGAACGATATCGGTTT
TCATAAAGGGGATATAGGAGAAAGGGCTCGAAAAAGTGTATTCGATGTTATAGGGTTCAGGTTAGTGGATGAACAAGGGTGTCCACTCCTTGATATTGATCCTGAGATAG
AAATAACCTTTCGTCATCGTCGGAGAGAGCAAAGACGAAAGAGAAGGGAACAACAAGAGTTGAGCGCACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCTG
ATGGAACAACCTGGAGTCGATCCTCAAGGAGATGTTGAAGCTTGGTTGGACTCACACCCTCCAAACTCCATCACTTCTTGGAACGATTTGGAAGAGAAATTTTTAGAGAA
GTTTTTTCCTTCTAATGAAAATGCCAAATATAGAGCTGAAATTATTGCATTTACACAATCTTATAATGAACCTCTGGATGCAGCCTGGGAAAGATTTCAAAGGTTGGTTC
GGAAGTGTCCACATCACGGATTGCCAGCTTGCATCATCTTAGAGCATTTTTATAGTGGATTAGATCAAGCTTCGAAGGCACTAGTCAATACATCTGCAAACAGATCTTTC
TTAAAGAAGTCTGCAAATGAGGCACTTGCTATCTTGGACACCATAGCTACAAACAATAGACATTGGGGAGAAACTGAGCCAACAATAATTTTGAAGAATCTGGCTAAAGC
AGTAGAGACAGAGGCTAATACTACAATGCAAGCTCAAATCAAGGTTATCCACAACATAATGATGGGCATGACTATGAGCAACCAAGTGAACATAGCCCCTACCAATGTTA
TTTCTTCTCCCTGCTGTGAAATATGTGGTGAGGAACACACTACTGACAACTGTCCACTAAATCCCGCGTCTGTCTTTTATGTAGGTCAAAAGGGAACCAGAACAGATGGA
ATCCATACTCAGCTACGACAACATCCAAACTTCTCTTGGGAGGACAAGCCGGCTCAAATTGCTCAAGAAATCAAAAATAGACCACAAGGGACATTGCCCAACAAGACTGA
GAACCCTCATCAACAAGGAAAAGAGAAGTGCAAGGCAGTCACCTTGAGAAGTGGACTAGAGTATGATGGCCCAAAATACCTCGTGAATCAAGAAATGCCTACTTGCGTGA
AGTTCCTGAAGGACATTCTATCAAAGAAAAGAAGGTTGGGAGAATATGAAACAGTTGCACTTACTGAATGTTCTAGTGCTCTGGTCAAAAATGAGATTCCTCCAAACTTA
AGGACCCAGGAATTGGGGATAGGAGAAGCAAGACCAACAACCGTAACCTTACAGTTAGCCGACAGATCACTAACACAACCTATTGGAAAAATCGAAGATGTGTTACTCAA
GGGTGGGATAACTGTAGTGGAGAACGCGAAAAACGAATTGATCCCGACAAGGACAGTCACAAGATGGCGGGACTGCATTGACTATCGTCGCCTCAATAAAGTAACCAGGA
AAGACCACTTCCCGTTACCATTCATAGATCAACTGCTGGATAGACATGCAGGGAAAGATTATTACTATTTTTTAGACGGTTACTCAGGTTACAATCAAATAACAATAGCC
CGAGAAGACCAGGAGAAGACTACGTTCACATGTCCGTACGGGACATTTTCCTTTCGAACAATGCCATTCAAATTATGCAATGCACCAGCCACCTTTCAAAGGTGTATGAT
GGCTATATTCTCCGATATGGTAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAGTAAGTTACCTAGTGAAGCTACAAATAAATCAACAAGGGTTGTTCATCAGGTTAGTCCTTCAACAAAGAGTTTGATCTCGAAAGGGACGAACGATATCGGTTT
TCATAAAGGGGATATAGGAGAAAGGGCTCGAAAAAGTGTATTCGATGTTATAGGGTTCAGGTTAGTGGATGAACAAGGGTGTCCACTCCTTGATATTGATCCTGAGATAG
AAATAACCTTTCGTCATCGTCGGAGAGAGCAAAGACGAAAGAGAAGGGAACAACAAGAGTTGAGCGCACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCTG
ATGGAACAACCTGGAGTCGATCCTCAAGGAGATGTTGAAGCTTGGTTGGACTCACACCCTCCAAACTCCATCACTTCTTGGAACGATTTGGAAGAGAAATTTTTAGAGAA
GTTTTTTCCTTCTAATGAAAATGCCAAATATAGAGCTGAAATTATTGCATTTACACAATCTTATAATGAACCTCTGGATGCAGCCTGGGAAAGATTTCAAAGGTTGGTTC
GGAAGTGTCCACATCACGGATTGCCAGCTTGCATCATCTTAGAGCATTTTTATAGTGGATTAGATCAAGCTTCGAAGGCACTAGTCAATACATCTGCAAACAGATCTTTC
TTAAAGAAGTCTGCAAATGAGGCACTTGCTATCTTGGACACCATAGCTACAAACAATAGACATTGGGGAGAAACTGAGCCAACAATAATTTTGAAGAATCTGGCTAAAGC
AGTAGAGACAGAGGCTAATACTACAATGCAAGCTCAAATCAAGGTTATCCACAACATAATGATGGGCATGACTATGAGCAACCAAGTGAACATAGCCCCTACCAATGTTA
TTTCTTCTCCCTGCTGTGAAATATGTGGTGAGGAACACACTACTGACAACTGTCCACTAAATCCCGCGTCTGTCTTTTATGTAGGTCAAAAGGGAACCAGAACAGATGGA
ATCCATACTCAGCTACGACAACATCCAAACTTCTCTTGGGAGGACAAGCCGGCTCAAATTGCTCAAGAAATCAAAAATAGACCACAAGGGACATTGCCCAACAAGACTGA
GAACCCTCATCAACAAGGAAAAGAGAAGTGCAAGGCAGTCACCTTGAGAAGTGGACTAGAGTATGATGGCCCAAAATACCTCGTGAATCAAGAAATGCCTACTTGCGTGA
AGTTCCTGAAGGACATTCTATCAAAGAAAAGAAGGTTGGGAGAATATGAAACAGTTGCACTTACTGAATGTTCTAGTGCTCTGGTCAAAAATGAGATTCCTCCAAACTTA
AGGACCCAGGAATTGGGGATAGGAGAAGCAAGACCAACAACCGTAACCTTACAGTTAGCCGACAGATCACTAACACAACCTATTGGAAAAATCGAAGATGTGTTACTCAA
GGGTGGGATAACTGTAGTGGAGAACGCGAAAAACGAATTGATCCCGACAAGGACAGTCACAAGATGGCGGGACTGCATTGACTATCGTCGCCTCAATAAAGTAACCAGGA
AAGACCACTTCCCGTTACCATTCATAGATCAACTGCTGGATAGACATGCAGGGAAAGATTATTACTATTTTTTAGACGGTTACTCAGGTTACAATCAAATAACAATAGCC
CGAGAAGACCAGGAGAAGACTACGTTCACATGTCCGTACGGGACATTTTCCTTTCGAACAATGCCATTCAAATTATGCAATGCACCAGCCACCTTTCAAAGGTGTATGAT
GGCTATATTCTCCGATATGGTAGAATGA
Protein sequenceShow/hide protein sequence
MLSKLPSEATNKSTRVVHQVSPSTKSLISKGTNDIGFHKGDIGERARKSVFDVIGFRLVDEQGCPLLDIDPEIEITFRHRRREQRRKRREQQELSAQEPLEEASYIQEFL
MEQPGVDPQGDVEAWLDSHPPNSITSWNDLEEKFLEKFFPSNENAKYRAEIIAFTQSYNEPLDAAWERFQRLVRKCPHHGLPACIILEHFYSGLDQASKALVNTSANRSF
LKKSANEALAILDTIATNNRHWGETEPTIILKNLAKAVETEANTTMQAQIKVIHNIMMGMTMSNQVNIAPTNVISSPCCEICGEEHTTDNCPLNPASVFYVGQKGTRTDG
IHTQLRQHPNFSWEDKPAQIAQEIKNRPQGTLPNKTENPHQQGKEKCKAVTLRSGLEYDGPKYLVNQEMPTCVKFLKDILSKKRRLGEYETVALTECSSALVKNEIPPNL
RTQELGIGEARPTTVTLQLADRSLTQPIGKIEDVLLKGGITVVENAKNELIPTRTVTRWRDCIDYRRLNKVTRKDHFPLPFIDQLLDRHAGKDYYYFLDGYSGYNQITIA
REDQEKTTFTCPYGTFSFRTMPFKLCNAPATFQRCMMAIFSDMVE