; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005905 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005905
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:33516961..33521591
RNA-Seq ExpressionLag0005905
SyntenyLag0005905
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]6.9e-5127.41Show/hide
Query:  TIVVTRRDFHEDWGRILNILQEYTQHS---YIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHL
        T  + RR FH+DW +I++ L++ T      +   PF  +KALL     E+A+LL  N GW T GP  VK E W+ + +    VIPSYGGW +FR IPLH+
Subjt:  TIVVTRRDFHEDWGRILNILQEYTQHS---YIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHL

Query:  WSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----YSGEG
        W+L TF  IG+ YGGF++    + N +E  E  IKV+ NY GF+P+ +++ D E   FI Q VT      L  +    HG FT   A NF     Y+ + 
Subjt:  WSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----YSGEG

Query:  AWREILETQVTEYEEEREDLPSDFHVCFREEEN--------AIKPAGNHEALVESKSEGEG----------KLPFSRKMVKSLK-------KWNMCIRPI
         +R  L          + DLP       ++  N         IK      + +    EG+           K     +M +  K       ++N  I PI
Subjt:  AWREILETQVTEYEEEREDLPSDFHVCFREEEN--------AIKPAGNHEALVESKSEGEG----------KLPFSRKMVKSLK-------KWNMCIRPI

Query:  SLKGNVA-----------------------------------AKRKTREVTTQI-----RASKKEITTEGTSEEQGL-----------------------
        + K  V+                                   +KR T+ +  ++     R+ ++E +     +++G                        
Subjt:  SLKGNVA-----------------------------------AKRKTREVTTQI-----RASKKEITTEGTSEEQGL-----------------------

Query:  ----DGSY-------------EDSLM--------------------EREGDRSSAQRALIK-------SLFVKINSNVALLQETRMSSSGGILVMWKEDN
              SY             +DSL                     E E D  S +R L          L    NS    +   RM S          +N
Subjt:  ----DGSY-------------EDSLM--------------------EREGDRSSAQRALIK-------SLFVKINSNVALLQETRMSSSGGILVMWKEDN

Query:  ISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLK
        +S E+ I G FS+SI    +N  S W+S +Y PA    R +FW+EL +L  +    W L GDFNV+RW  E S     + +M+  N FI    L D PL 
Subjt:  ISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLK

Query:  NGIFTLSDLREEPTATKIDRFL---------------------------------PSLGPSPFQFENIWLDHPNF
        N  FT S+LR + T +++DRFL                                  S GPSPF+F N +L  P++
Subjt:  NGIFTLSDLREEPTATKIDRFL---------------------------------PSLGPSPFQFENIWLDHPNF

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.2e-4826.09Show/hide
Query:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM
        +SRKS+ +VL  + S   N+++ K  + +       + +G     +   ++++T+++TRR FH+DW RI+  L++ ++ ++   PFQ +KA+L  +  + 
Subjt:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM

Query:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE
        A+LL +N+   GW T G   VK ESW+ +L+   SVIPSYGGW++FR IPLHLW+  TF+ IG   GGFL+  +    + + ++  IKVR NY GF+P+ 
Subjt:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE

Query:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----------YSGEGAWREILETQVTEYEEEREDLPS-DFHVCFREEENAIKPAGN
        + + D + + FI   V       L+ + V  HG F T+ A  F           Y+G  A          +Y     D  S  +H   ++  ++      
Subjt:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----------YSGEGAWREILETQVTEYEEEREDLPS-DFHVCFREEENAIKPAGN

Query:  HEALVESKSEGEGKL----------PFSRKMVKSLKKWNMCIRPISLKGN-----VAAKRKTREVTT---------------------QIRASKKEITT-
         +  +  + + +GK            +S++  +   +    + P  ++ N     +  K K+ E++T                     +I+   +E T  
Subjt:  HEALVESKSEGEGKL----------PFSRKMVKSLKKWNMCIRPISLKGN-----VAAKRKTREVTT---------------------QIRASKKEITT-

Query:  ---------EGT----------------------------------------------------SEEQGLDGSYEDSLMEREGDRSSAQ--------RAL
                 EG+                                                    S ++G D +   S    EG+   A+        RA 
Subjt:  ---------EGT----------------------------------------------------SEEQGLDGSYEDSLMEREGDRSSAQ--------RAL

Query:  IKSLFVKINSNVALL-----QETRMSSS---------------------GGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRD
         + L + +  N   L      +   SSS                     GGILV+W +    V +  +G +SIS++    N  + W++ VY P   + R 
Subjt:  IKSLFVKINSNVALL-----QETRMSSS---------------------GGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRD

Query:  VFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG
          W EL  L  L    W + GDFN+VRW  E +      RNM   N FI  +EL D PL N  FT S+LR  PT +++DRFL S G
Subjt:  VFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.5e-4826.12Show/hide
Query:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM
        +SRKS+ +VL  + S   N+++ K  + +       + +G     +   ++++T+++TRR FH+DW RI+  L++ ++ ++   PFQ +KA+L  +  + 
Subjt:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM

Query:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE
        A+LL +N+   GW T G   VK ESW+ +L+   SVIPSYGGW++FR IPLHLW+  TF+ IG   GGFL+  +    + + ++  IKVR NY GF+P+ 
Subjt:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE

Query:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP
        + + D + + FI   V       L+ + V  HG F T+ A  F                            YS   + +  +   TQ  +    E E  P
Subjt:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP

Query:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N
         D  +  R +E                              + P G                                                     +
Subjt:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N

Query:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD
        H+  ++   EG  ++  S           M++S     L  +N      + K   +A+ K   V+ +  A + +  +  T+E    D      L   E D
Subjt:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD

Query:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH
        R+  ++ +I                    S F  I  + N+ +     +   GGILV+W + N  V +  +G +SIS++    N  + W++ VY P   +
Subjt:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH

Query:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG
         R   W EL  L  L    W + GDFN+VRW  E +      RNM   N FI  +EL D P  N  FT S+LR  PT +++DRFL S G
Subjt:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.5e-0844.3Show/hide
Query:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK
        S L  PF E EI   I S    K+ GPDG T  F+KK W  LK DL+  F+DF + GI+    + T+I LI KK + +K
Subjt:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.5e-4826.75Show/hide
Query:  DETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLW
        + T+V+ RR FH+DW +IL  L++ T+ S+  N F   KAL+  S    A LL  N+GW T G  +V+ E W+P  +    +IPSYGGW  FR IPLHLW
Subjt:  DETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLW

Query:  SLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARN------------
        ++ TF+ IG    G ++  +   +    +E  IKVR NY GF+P+ VR+ D E   F  Q VT      LI + V  HG F  + A +            
Subjt:  SLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARN------------

Query:  FYSGEGAWREILETQVTEYEEEREDLPSDF-HVCFREEENAIKPAGNHEALV----------ESKSE------GEGKLPFSRKMV-------------KS
        F   E    + L T     +    D PS    V  + + NA  P+  +E LV          +SK E       +G L   ++ V             KS
Subjt:  FYSGEGAWREILETQVTEYEEEREDLPSDF-HVCFREEENAIKPAGNHEALV----------ESKSE------GEGKLPFSRKMV-------------KS

Query:  LKK------------WNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSE-----------------------EQGL--------------DGS
         +K            +N    P +   ++ +  K ++V+ +    KK  +T+  S+                       ++GL              + S
Subjt:  LKK------------WNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSE-----------------------EQGL--------------DGS

Query:  YED-------------------------------------------------------------------------SLMEREG-------DRSSA-----
         ED                                                                         S +++ G       D S A     
Subjt:  YED-------------------------------------------------------------------------SLMEREG-------DRSSA-----

Query:  -------------QRALIKSLFVKINSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDL
                      + +IKSL+   ++++  + +    SSGGIL++W   N S+     G FS+S +F  +N  S W++G+Y P     R  FW EL +L
Subjt:  -------------QRALIKSLFVKINSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDL

Query:  ARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFL
          L+S  W L GD NV+R   E +     + N R+LN FI  + L D PL N  FT S+LR  PT ++IDRFL
Subjt:  ARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFL

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]3.0e-7054.85Show/hide
Query:  KSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVGKMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLS
        KS  E++K        R   K + I           G  EVR +NW+ETIV+TRRDFH+DW RIL+ ++E T+ SYIINPFQ +KAL+KC   ++A LL 
Subjt:  KSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVGKMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLS

Query:  TNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDREQ
        TN+GWVTFGP+TVKLE+WNP L+ R  + PSYG W+K RNIPLHLWSLATFKAIG+  GGF++YD  NS  IEC +VAIKV+ NYCGFIP+E+  +D   
Subjt:  TNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDREQ

Query:  LFIAQAVTFENHNLLISKVVGKHGGFTTEVARNFYSG
         F A+ V+FE+   L  K VG HGGF++E AR+F+ G
Subjt:  LFIAQAVTFENHNLLISKVVGKHGGFTTEVARNFYSG

TrEMBL top hitse value%identityAlignment
A0A5A7TTA1 DUF4283 domain-containing protein3.3e-5127.41Show/hide
Query:  TIVVTRRDFHEDWGRILNILQEYTQHS---YIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHL
        T  + RR FH+DW +I++ L++ T      +   PF  +KALL     E+A+LL  N GW T GP  VK E W+ + +    VIPSYGGW +FR IPLH+
Subjt:  TIVVTRRDFHEDWGRILNILQEYTQHS---YIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHL

Query:  WSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----YSGEG
        W+L TF  IG+ YGGF++    + N +E  E  IKV+ NY GF+P+ +++ D E   FI Q VT      L  +    HG FT   A NF     Y+ + 
Subjt:  WSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----YSGEG

Query:  AWREILETQVTEYEEEREDLPSDFHVCFREEEN--------AIKPAGNHEALVESKSEGEG----------KLPFSRKMVKSLK-------KWNMCIRPI
         +R  L          + DLP       ++  N         IK      + +    EG+           K     +M +  K       ++N  I PI
Subjt:  AWREILETQVTEYEEEREDLPSDFHVCFREEEN--------AIKPAGNHEALVESKSEGEG----------KLPFSRKMVKSLK-------KWNMCIRPI

Query:  SLKGNVA-----------------------------------AKRKTREVTTQI-----RASKKEITTEGTSEEQGL-----------------------
        + K  V+                                   +KR T+ +  ++     R+ ++E +     +++G                        
Subjt:  SLKGNVA-----------------------------------AKRKTREVTTQI-----RASKKEITTEGTSEEQGL-----------------------

Query:  ----DGSY-------------EDSLM--------------------EREGDRSSAQRALIK-------SLFVKINSNVALLQETRMSSSGGILVMWKEDN
              SY             +DSL                     E E D  S +R L          L    NS    +   RM S          +N
Subjt:  ----DGSY-------------EDSLM--------------------EREGDRSSAQRALIK-------SLFVKINSNVALLQETRMSSSGGILVMWKEDN

Query:  ISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLK
        +S E+ I G FS+SI    +N  S W+S +Y PA    R +FW+EL +L  +    W L GDFNV+RW  E S     + +M+  N FI    L D PL 
Subjt:  ISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLK

Query:  NGIFTLSDLREEPTATKIDRFL---------------------------------PSLGPSPFQFENIWLDHPNF
        N  FT S+LR + T +++DRFL                                  S GPSPF+F N +L  P++
Subjt:  NGIFTLSDLREEPTATKIDRFL---------------------------------PSLGPSPFQFENIWLDHPNF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.7e-4826.12Show/hide
Query:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM
        +SRKS+ +VL  + S   N+++ K  + +       + +G     +   ++++T+++TRR FH+DW RI+  L++ ++ ++   PFQ +KA+L  +  + 
Subjt:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM

Query:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE
        A+LL +N+   GW T G   VK ESW+ +L+   SVIPSYGGW++FR IPLHLW+  TF+ IG   GGFL+  +    + + ++  IKVR NY GF+P+ 
Subjt:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE

Query:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP
        + + D + + FI   V       L+ + V  HG F T+ A  F                            YS   + +  +   TQ  +    E E  P
Subjt:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP

Query:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N
         D  +  R +E                              + P G                                                     +
Subjt:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N

Query:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD
        H+  ++   EG  ++  S           M++S     L  +N      + K   +A+ K   V+ +  A + +  +  T+E    D      L   E D
Subjt:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD

Query:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH
        R+  ++ +I                    S F  I  + N+ +     +   GGILV+W + N  V +  +G +SIS++    N  + W++ VY P   +
Subjt:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH

Query:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG
         R   W EL  L  L    W + GDFN+VRW  E +      RNM   N FI  +EL D P  N  FT S+LR  PT +++DRFL S G
Subjt:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein4.6e-0844.3Show/hide
Query:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK
        S L  PF E EI   I S    K+ GPDG T  F+KK W  LK DL+  F+DF + GI+    + T+I LI KK + +K
Subjt:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.7e-4826.12Show/hide
Query:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM
        +SRKS+ +VL  + S   N+++ K  + +       + +G     +   ++++T+++TRR FH+DW RI+  L++ ++ ++   PFQ +KA+L  +  + 
Subjt:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM

Query:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE
        A+LL +N+   GW T G   VK ESW+ +L+   SVIPSYGGW++FR IPLHLW+  TF+ IG   GGFL+  +    + + ++  IKVR NY GF+P+ 
Subjt:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE

Query:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP
        + + D + + FI   V       L+ + V  HG F T+ A  F                            YS   + +  +   TQ  +    E E  P
Subjt:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF----------------------------YSGEGAWREIL--ETQVTE-YEEEREDLP

Query:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N
         D  +  R +E                              + P G                                                     +
Subjt:  SDFHVCFREEENA----------------------------IKPAG-----------------------------------------------------N

Query:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD
        H+  ++   EG  ++  S           M++S     L  +N      + K   +A+ K   V+ +  A + +  +  T+E    D      L   E D
Subjt:  HEALVESKSEGEGKLPFS---------RKMVKS-----LKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGD

Query:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH
        R+  ++ +I                    S F  I  + N+ +     +   GGILV+W + N  V +  +G +SIS++    N  + W++ VY P   +
Subjt:  RSSAQRALI-------------------KSLFVKI--NSNVALLQETRMSSSGGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNH

Query:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG
         R   W EL  L  L    W + GDFN+VRW  E +      RNM   N FI  +EL D P  N  FT S+LR  PT +++DRFL S G
Subjt:  RRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein2.0e-4826.09Show/hide
Query:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM
        +SRKS+ +VL  + S   N+++ K  + +       + +G     +   ++++T+++TRR FH+DW RI+  L++ ++ ++   PFQ +KA+L  +  + 
Subjt:  TSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVG--KMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEM

Query:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE
        A+LL +N+   GW T G   VK ESW+ +L+   SVIPSYGGW++FR IPLHLW+  TF+ IG   GGFL+  +    + + ++  IKVR NY GF+P+ 
Subjt:  ARLLSTNR---GWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSE

Query:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----------YSGEGAWREILETQVTEYEEEREDLPS-DFHVCFREEENAIKPAGN
        + + D + + FI   V       L+ + V  HG F T+ A  F           Y+G  A          +Y     D  S  +H   ++  ++      
Subjt:  VRLVDRE-QLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNF-----------YSGEGAWREILETQVTEYEEEREDLPS-DFHVCFREEENAIKPAGN

Query:  HEALVESKSEGEGKL----------PFSRKMVKSLKKWNMCIRPISLKGN-----VAAKRKTREVTT---------------------QIRASKKEITT-
         +  +  + + +GK            +S++  +   +    + P  ++ N     +  K K+ E++T                     +I+   +E T  
Subjt:  HEALVESKSEGEGKL----------PFSRKMVKSLKKWNMCIRPISLKGN-----VAAKRKTREVTT---------------------QIRASKKEITT-

Query:  ---------EGT----------------------------------------------------SEEQGLDGSYEDSLMEREGDRSSAQ--------RAL
                 EG+                                                    S ++G D +   S    EG+   A+        RA 
Subjt:  ---------EGT----------------------------------------------------SEEQGLDGSYEDSLMEREGDRSSAQ--------RAL

Query:  IKSLFVKINSNVALL-----QETRMSSS---------------------GGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRD
         + L + +  N   L      +   SSS                     GGILV+W +    V +  +G +SIS++    N  + W++ VY P   + R 
Subjt:  IKSLFVKINSNVALL-----QETRMSSS---------------------GGILVMWKEDNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRD

Query:  VFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG
          W EL  L  L    W + GDFN+VRW  E +      RNM   N FI  +EL D PL N  FT S+LR  PT +++DRFL S G
Subjt:  VFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSDLREEPTATKIDRFLPSLG

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein4.6e-0844.3Show/hide
Query:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK
        S L  PF E EI   I S    K+ GPDG T  F+KK W  LK DL+  F+DF + GI+    + T+I LI KK + +K
Subjt:  SWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKKLRANK

A0A6J1D6X4 uncharacterized protein LOC1110181861.4e-7054.85Show/hide
Query:  KSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVGKMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLS
        KS  E++K        R   K + I           G  EVR +NW+ETIV+TRRDFH+DW RIL+ ++E T+ SYIINPFQ +KAL+KC   ++A LL 
Subjt:  KSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVGKMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHSYIINPFQPNKALLKCSIGEMARLLS

Query:  TNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDREQ
        TN+GWVTFGP+TVKLE+WNP L+ R  + PSYG W+K RNIPLHLWSLATFKAIG+  GGF++YD  NS  IEC +VAIKV+ NYCGFIP+E+  +D   
Subjt:  TNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNYCGFIPSEVRLVDREQ

Query:  LFIAQAVTFENHNLLISKVVGKHGGFTTEVARNFYSG
         F A+ V+FE+   L  K VG HGGF++E AR+F+ G
Subjt:  LFIAQAVTFENHNLLISKVVGKHGGFTTEVARNFYSG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-0542.5Show/hide
Query:  LNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMK
        LNQ++  S L  P +  EI   I SL T KS GPDG T EF+++    L P L+K FQ   ++GI+     E  I LI K
Subjt:  LNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMK

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-0442.5Show/hide
Query:  LNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMK
        LNQDQ    L SP S +EI   I SL T KS GPDG + EF++     L P L K F     +G +     E  I LI K
Subjt:  LNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMK

P14381 Transposon TX1 uncharacterized 149 kDa protein3.0e-0430.21Show/hide
Query:  SPFQFENIWLDHPNFLNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKK
        SP   E +W   P  +  ++    LE+P + +E+ +A++ +   KS G DG+T EFF+  W+ L PD  +   + F+KG +        + L+ KK
Subjt:  SPFQFENIWLDHPNFLNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYICLIMKK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAAAGGATGGAAAGCCTGTCGTCAACTTCTGCTGGATTTCACAGACGGCCTGCAAAAGATGGAAAGGATAAAGATCGAAGAACTGAAGACCAGTAGAAAATC
TTTTGTGGAGGTGTTAAAGGGTGAACCTTCTATGGCGAAAAATAGAAGAAAACCGAAGCCAAAAAACATTGAGCAACCCCACGACCTTTGGTTTAACGATGTGGGTAAGA
TGGAAGTTAGGGTAATTAACTGGGATGAGACAATAGTGGTTACTAGAAGGGATTTTCATGAAGATTGGGGGAGGATTTTAAACATTCTTCAGGAATATACTCAACATTCC
TATATTATAAACCCTTTCCAGCCCAATAAAGCCCTCCTTAAATGCTCCATAGGGGAGATGGCTAGACTACTATCGACAAACAGGGGATGGGTCACTTTTGGACCAATAAC
GGTGAAATTAGAATCCTGGAACCCTCATTTGTACGAGAGAATAAGTGTCATTCCTTCCTATGGGGGTTGGATCAAATTCAGAAACATTCCTCTGCATTTATGGAGTTTGG
CCACTTTTAAAGCTATTGGGGATTGTTACGGGGGATTTCTTGAGTATGATCAGGCCAACTCAAATCTCATTGAATGCGTGGAAGTGGCTATTAAAGTCAGGGGGAATTAC
TGTGGATTTATCCCTAGTGAAGTGAGACTGGTAGACAGGGAGCAACTCTTTATTGCTCAAGCCGTAACTTTTGAGAACCATAACTTGCTGATCAGTAAAGTCGTCGGAAA
ACATGGAGGTTTCACGACAGAAGTAGCAAGAAATTTTTACAGCGGGGAAGGAGCTTGGAGGGAAATCCTCGAAACACAGGTTACTGAATATGAAGAAGAGAGGGAGGACC
TGCCTTCGGACTTCCATGTCTGCTTCAGGGAGGAGGAAAACGCCATCAAACCAGCAGGCAACCATGAAGCTTTAGTCGAGAGCAAATCTGAAGGGGAGGGAAAACTGCCT
TTCTCAAGGAAAATGGTGAAATCCCTCAAGAAATGGAACATGTGTATCAGACCAATATCCTTGAAAGGAAATGTGGCGGCCAAGAGGAAAACCAGAGAGGTGACAACTCA
AATTCGTGCTTCGAAAAAAGAGATTACAACAGAAGGAACTAGCGAGGAACAGGGCCTTGATGGCTCATATGAAGATAGTCTCATGGAACGTGAGGGGGATCGGAGCTCCG
CCCAAAGAGCTCTCATTAAAAGCCTCTTTGTGAAAATTAACTCGAATGTTGCGTTGCTTCAGGAGACGAGGATGAGCTCGTCTGGTGGCATCCTTGTTATGTGGAAAGAA
GACAACATTTCGGTTGAGGAATCAATTATAGGTGAGTTCTCCATCTCGATTTCTTTCTCTTGTGATAACTATTTTAGCGGTTGGATTTCAGGAGTTTATAGTCCTGCTTC
AAACCACAGAAGAGATGTTTTTTGGCAAGAACTTGGGGATTTGGCTAGGTTATCTTCGGATTTTTGGTGTTTAGTTGGCGACTTTAATGTCGTCAGATGGACTTCTGAGA
AATCAAAGGGTGGCAGAGTCACCAGAAATATGAGAATTCTTAACGCCTTTATTGACAGATCTGAACTCTTCGACGTTCCCTTAAAGAATGGTATTTTCACTTTGTCTGAT
CTTAGGGAGGAACCTACCGCCACAAAAATCGACAGATTTTTGCCAAGCCTGGGCCCTTCCCCATTTCAGTTTGAGAACATATGGCTGGATCACCCAAATTTCTTGAATCA
GGATCAGTGGGCTTCTTGGCTTGAATCTCCATTTTCTGAGGAGGAAATTCACAAGGCTATTCAAAGTTTGGGCACTCTTAAATCTCTGGGTCCGGATGGGATGACAAACG
AATTTTTTAAAAAGTCTTGGAACATCTTGAAGCCTGACCTAGTAAAGGCGTTCCAAGATTTTTTTGAAAAGGGGATTATATATAAACGCACGGATGAGACTTACATATGC
TTGATCATGAAGAAACTAAGAGCCAACAAGGGGCTTCGAGTTGGCAAGGGAAGGGTTTCTATTACCCATCTGCAATATGCAGATGATACTATCTTCTTCAGCCCAGCTGA
CAGAATATTTCTCAAAAGTCGTGAGGAAGAAGCCCCTTGGATAAAGGTCATTATTAGTATTTATGGTATAGATATTAGAGGGTGGTGCACCCTCCCCCGAAAGGGAAAGC
TAGAGGCAGATTGTGATTTGTATAATATCCCGATGAACAGATTGCTACGGTTGCTCAATGCTGGAACTCCGAGGCTAACGATTGGATTTGGGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAAAGGATGGAAAGCCTGTCGTCAACTTCTGCTGGATTTCACAGACGGCCTGCAAAAGATGGAAAGGATAAAGATCGAAGAACTGAAGACCAGTAGAAAATC
TTTTGTGGAGGTGTTAAAGGGTGAACCTTCTATGGCGAAAAATAGAAGAAAACCGAAGCCAAAAAACATTGAGCAACCCCACGACCTTTGGTTTAACGATGTGGGTAAGA
TGGAAGTTAGGGTAATTAACTGGGATGAGACAATAGTGGTTACTAGAAGGGATTTTCATGAAGATTGGGGGAGGATTTTAAACATTCTTCAGGAATATACTCAACATTCC
TATATTATAAACCCTTTCCAGCCCAATAAAGCCCTCCTTAAATGCTCCATAGGGGAGATGGCTAGACTACTATCGACAAACAGGGGATGGGTCACTTTTGGACCAATAAC
GGTGAAATTAGAATCCTGGAACCCTCATTTGTACGAGAGAATAAGTGTCATTCCTTCCTATGGGGGTTGGATCAAATTCAGAAACATTCCTCTGCATTTATGGAGTTTGG
CCACTTTTAAAGCTATTGGGGATTGTTACGGGGGATTTCTTGAGTATGATCAGGCCAACTCAAATCTCATTGAATGCGTGGAAGTGGCTATTAAAGTCAGGGGGAATTAC
TGTGGATTTATCCCTAGTGAAGTGAGACTGGTAGACAGGGAGCAACTCTTTATTGCTCAAGCCGTAACTTTTGAGAACCATAACTTGCTGATCAGTAAAGTCGTCGGAAA
ACATGGAGGTTTCACGACAGAAGTAGCAAGAAATTTTTACAGCGGGGAAGGAGCTTGGAGGGAAATCCTCGAAACACAGGTTACTGAATATGAAGAAGAGAGGGAGGACC
TGCCTTCGGACTTCCATGTCTGCTTCAGGGAGGAGGAAAACGCCATCAAACCAGCAGGCAACCATGAAGCTTTAGTCGAGAGCAAATCTGAAGGGGAGGGAAAACTGCCT
TTCTCAAGGAAAATGGTGAAATCCCTCAAGAAATGGAACATGTGTATCAGACCAATATCCTTGAAAGGAAATGTGGCGGCCAAGAGGAAAACCAGAGAGGTGACAACTCA
AATTCGTGCTTCGAAAAAAGAGATTACAACAGAAGGAACTAGCGAGGAACAGGGCCTTGATGGCTCATATGAAGATAGTCTCATGGAACGTGAGGGGGATCGGAGCTCCG
CCCAAAGAGCTCTCATTAAAAGCCTCTTTGTGAAAATTAACTCGAATGTTGCGTTGCTTCAGGAGACGAGGATGAGCTCGTCTGGTGGCATCCTTGTTATGTGGAAAGAA
GACAACATTTCGGTTGAGGAATCAATTATAGGTGAGTTCTCCATCTCGATTTCTTTCTCTTGTGATAACTATTTTAGCGGTTGGATTTCAGGAGTTTATAGTCCTGCTTC
AAACCACAGAAGAGATGTTTTTTGGCAAGAACTTGGGGATTTGGCTAGGTTATCTTCGGATTTTTGGTGTTTAGTTGGCGACTTTAATGTCGTCAGATGGACTTCTGAGA
AATCAAAGGGTGGCAGAGTCACCAGAAATATGAGAATTCTTAACGCCTTTATTGACAGATCTGAACTCTTCGACGTTCCCTTAAAGAATGGTATTTTCACTTTGTCTGAT
CTTAGGGAGGAACCTACCGCCACAAAAATCGACAGATTTTTGCCAAGCCTGGGCCCTTCCCCATTTCAGTTTGAGAACATATGGCTGGATCACCCAAATTTCTTGAATCA
GGATCAGTGGGCTTCTTGGCTTGAATCTCCATTTTCTGAGGAGGAAATTCACAAGGCTATTCAAAGTTTGGGCACTCTTAAATCTCTGGGTCCGGATGGGATGACAAACG
AATTTTTTAAAAAGTCTTGGAACATCTTGAAGCCTGACCTAGTAAAGGCGTTCCAAGATTTTTTTGAAAAGGGGATTATATATAAACGCACGGATGAGACTTACATATGC
TTGATCATGAAGAAACTAAGAGCCAACAAGGGGCTTCGAGTTGGCAAGGGAAGGGTTTCTATTACCCATCTGCAATATGCAGATGATACTATCTTCTTCAGCCCAGCTGA
CAGAATATTTCTCAAAAGTCGTGAGGAAGAAGCCCCTTGGATAAAGGTCATTATTAGTATTTATGGTATAGATATTAGAGGGTGGTGCACCCTCCCCCGAAAGGGAAAGC
TAGAGGCAGATTGTGATTTGTATAATATCCCGATGAACAGATTGCTACGGTTGCTCAATGCTGGAACTCCGAGGCTAACGATTGGATTTGGGTTTTAG
Protein sequenceShow/hide protein sequence
MDFKGWKACRQLLLDFTDGLQKMERIKIEELKTSRKSFVEVLKGEPSMAKNRRKPKPKNIEQPHDLWFNDVGKMEVRVINWDETIVVTRRDFHEDWGRILNILQEYTQHS
YIINPFQPNKALLKCSIGEMARLLSTNRGWVTFGPITVKLESWNPHLYERISVIPSYGGWIKFRNIPLHLWSLATFKAIGDCYGGFLEYDQANSNLIECVEVAIKVRGNY
CGFIPSEVRLVDREQLFIAQAVTFENHNLLISKVVGKHGGFTTEVARNFYSGEGAWREILETQVTEYEEEREDLPSDFHVCFREEENAIKPAGNHEALVESKSEGEGKLP
FSRKMVKSLKKWNMCIRPISLKGNVAAKRKTREVTTQIRASKKEITTEGTSEEQGLDGSYEDSLMEREGDRSSAQRALIKSLFVKINSNVALLQETRMSSSGGILVMWKE
DNISVEESIIGEFSISISFSCDNYFSGWISGVYSPASNHRRDVFWQELGDLARLSSDFWCLVGDFNVVRWTSEKSKGGRVTRNMRILNAFIDRSELFDVPLKNGIFTLSD
LREEPTATKIDRFLPSLGPSPFQFENIWLDHPNFLNQDQWASWLESPFSEEEIHKAIQSLGTLKSLGPDGMTNEFFKKSWNILKPDLVKAFQDFFEKGIIYKRTDETYIC
LIMKKLRANKGLRVGKGRVSITHLQYADDTIFFSPADRIFLKSREEEAPWIKVIISIYGIDIRGWCTLPRKGKLEADCDLYNIPMNRLLRLLNAGTPRLTIGFGF