; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008490 (gene) of Snake gourd v1 genome

Gene IDTan0008490
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMFP1 attachment factor 1-like
Genome locationLG02:82358180..82359060
RNA-Seq ExpressionTan0008490
SyntenyTan0008490
Gene Ontology termsGO:0000278 - mitotic cell cycle (biological process)
GO:0048527 - lateral root development (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR025265 - WPP domain
IPR038214 - WPP domain superfamily
IPR044692 - WPP domain-containing protein 1/2/3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598350.1 MFP1 attachment factor 1, partial [Cucurbita argyrosperma subsp. sororia]7.3e-5375.64Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIKP  PF TKFSIWPPTQRTRDAV+SRL+ETLSTPSILSKRYGTIPP+ AA +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQ+YSKEISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+PT E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

XP_022961994.1 MFP1 attachment factor 1-like [Cucurbita moschata]5.6e-5375.64Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIKP  PF TKFSIWPPTQRTRDAV+SRL+ETLSTPSILSKRYGTIPP+ AA +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQ+YSKEISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+PT E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

XP_022997483.1 MFP1 attachment factor 1-like [Cucurbita maxima]2.0e-5073.72Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIK   PF TKFSIWPPTQRTRDAV+SRL+ETLST SILSKRYGTIPP+ AA +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQVYSK+ISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+P  E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

XP_023545637.1 MFP1 attachment factor 1-like [Cucurbita pepo subsp. pepo]9.6e-5375.64Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIKP  PF TKFSIWPPTQRTRDAV+SRL+ETLSTPSILSKRYGTIPP+ A  +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQVYSKEISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+PT E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

XP_038885619.1 MFP1 attachment factor 1-like [Benincasa hispida]2.2e-4973.72Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD DNS         +    EIK+P KPPKPF TKFSIWPPTQRTRDAV SRL+ETLSTPSILSKR+GT+PP+EAAAVA+ IE+EAYATA+GSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAI---PAENGEKEGVTSPVAAPRDESPTLETET
        GIEILQVYSKEISKRMLEAVKGRAI   PAENGE EGV SPV  P+DE+ TLE+E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAI---PAENGEKEGVTSPVAAPRDESPTLETET

TrEMBL top hitse value%identityAlignment
A0A1S3BBH7 MFP1 attachment factor 11.2e-4570.51Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD +NS    S  +      + K+PIKPP PF TKFSIWPPTQRTRDAV SRL+ETLSTPSILSKR+G IPPDEAA VA+ IEEEAYA+A+GSP + DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGR---AIPAENGEKEGVTSPVAAPRDESPTLETET
        G+EILQVYSKEISKRMLEAVKGR   A PAENGE E V SPV  P  E+PTLETE+
Subjt:  GIEILQVYSKEISKRMLEAVKGR---AIPAENGEKEGVTSPVAAPRDESPTLETET

A0A5A7V571 MFP1 attachment factor 11.2e-4570.51Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD +NS    S  +      + K+PIKPP PF TKFSIWPPTQRTRDAV SRL+ETLSTPSILSKR+G IPPDEAA VA+ IEEEAYA+A+GSP + DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGR---AIPAENGEKEGVTSPVAAPRDESPTLETET
        G+EILQVYSKEISKRMLEAVKGR   A PAENGE E V SPV  P  E+PTLETE+
Subjt:  GIEILQVYSKEISKRMLEAVKGR---AIPAENGEKEGVTSPVAAPRDESPTLETET

A0A6J1BSC5 MFP1 attachment factor 1-like1.4e-4475.71Show/hide
Query:  AEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS
        A +   + E KEP + PKPFGTKFSIWPPTQRTRDAV SRL+ETLSTPSILSKRYGTIPPDEAAA A+ IEEEAY  A  SPATEDDGIEILQVYSKEIS
Subjt:  AEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS

Query:  KRMLEAVKGRAI---PAENGEKEGVTSPVAAPRDESPTLE
        +RMLE VKGRAI   PAENGE E  T PV AP +E+PT E
Subjt:  KRMLEAVKGRAI---PAENGEKEGVTSPVAAPRDESPTLE

A0A6J1HFL2 MFP1 attachment factor 1-like2.7e-5375.64Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIKP  PF TKFSIWPPTQRTRDAV+SRL+ETLSTPSILSKRYGTIPP+ AA +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQ+YSKEISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+PT E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

A0A6J1KE17 MFP1 attachment factor 1-like9.7e-5173.72Show/hide
Query:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD
        MSD D+S PV    + ESR+PEIK+PIK   PF TKFSIWPPTQRTRDAV+SRL+ETLST SILSKRYGTIPP+ AA +A+ IE+EAYATADGSP T DD
Subjt:  MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDD

Query:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET
        GIEILQVYSK+ISKRMLEAVKGRAIPA   ENGEKEG +SPV  P DE+P  E E+
Subjt:  GIEILQVYSKEISKRMLEAVKGRAIPA---ENGEKEGVTSPVAAPRDESPTLETET

SwissProt top hitse value%identityAlignment
Q0WQ91 WPP domain-containing protein 32.0e-1648.57Show/hide
Query:  RQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLE
        R     E  K  K  G  FS+WPP Q++RD V + +++TLST SILS +YGTI P+EA+AVA+ IEE+AY  A  S     DGI+ L+VY  E S+RM+E
Subjt:  RQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLE

Query:  AVKGR
        + + R
Subjt:  AVKGR

Q9C500 WPP domain-containing protein 26.1e-2651.41Show/hide
Query:  EAESRQPEI-KEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS
        EA S+  ++ KE     KP G    IWPPTQ+TRDAV +RL+ETLST SILSKRYGT+  D+A  VA+ IEEEAY  A  + +++DDGI+IL++YSKEIS
Subjt:  EAESRQPEI-KEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS

Query:  KRMLEAVKGRA-IPAENGEKEGVTSPVA--APRDESPTLETE
        KRMLE+VK R+     NG  E   +  +  +  D  P  E E
Subjt:  KRMLEAVKGRA-IPAENGEKEGVTSPVA--APRDESPTLETE

Q9FMH6 WPP domain-containing protein 11.5e-2450.35Show/hide
Query:  NSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEIL
        +S P  S  E  +  P   E  K P P      IWPPTQ+TRDAV +RL+ETLST SILSKR+G++  +EA++VA+ IE+EAYA A  +   +DDGIEIL
Subjt:  NSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEIL

Query:  QVYSKEISKRMLEAVKGRAIPAENGEK--EGVTSPVAAPRDES
        + YSKEISKRMLE+VK ++  A    K  +G+ S V +  D S
Subjt:  QVYSKEISKRMLEAVKGRAIPAENGEK--EGVTSPVAAPRDES

Q9LE82 RAN GTPase-activating protein 11.2e-0839.18Show/hide
Query:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE
        +WPP++ TR  +  R+ + ++TPSI S++YG +  +EA   A+ IE+ A+ATA+     E   DG   + VY+KE SK ML+ +K    P E  E E
Subjt:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE

Q9M7N6 MFP1 attachment factor 12.2e-3163.79Show/hide
Query:  KPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLEAVKGRAIPAENG
        KP  T FSIWPPTQRTRDAV +RL+E+LSTPSILSKRYGT+P DEA+  A+ IEEEA+A A  + +  DDGIEILQVYSKEISKRM++ VK R+ PA   
Subjt:  KPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLEAVKGRAIPAENG

Query:  EKEGVTSPVAAPRDES
          EG + P   P D S
Subjt:  EKEGVTSPVAAPRDES

Arabidopsis top hitse value%identityAlignment
AT1G47200.1 WPP domain protein 24.3e-2751.41Show/hide
Query:  EAESRQPEI-KEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS
        EA S+  ++ KE     KP G    IWPPTQ+TRDAV +RL+ETLST SILSKRYGT+  D+A  VA+ IEEEAY  A  + +++DDGI+IL++YSKEIS
Subjt:  EAESRQPEI-KEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEIS

Query:  KRMLEAVKGRA-IPAENGEKEGVTSPVA--APRDESPTLETE
        KRMLE+VK R+     NG  E   +  +  +  D  P  E E
Subjt:  KRMLEAVKGRA-IPAENGEKEGVTSPVA--APRDESPTLETE

AT3G63130.1 RAN GTPase activating protein 18.2e-1039.18Show/hide
Query:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE
        +WPP++ TR  +  R+ + ++TPSI S++YG +  +EA   A+ IE+ A+ATA+     E   DG   + VY+KE SK ML+ +K    P E  E E
Subjt:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE

AT3G63130.2 RAN GTPase activating protein 18.2e-1039.18Show/hide
Query:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE
        +WPP++ TR  +  R+ + ++TPSI S++YG +  +EA   A+ IE+ A+ATA+     E   DG   + VY+KE SK ML+ +K    P E  E E
Subjt:  IWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATE--DDGIEILQVYSKEISKRMLEAVKGRAIPAENGEKE

AT5G27940.1 WPP domain protein 31.4e-1748.57Show/hide
Query:  RQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLE
        R     E  K  K  G  FS+WPP Q++RD V + +++TLST SILS +YGTI P+EA+AVA+ IEE+AY  A  S     DGI+ L+VY  E S+RM+E
Subjt:  RQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSKEISKRMLE

Query:  AVKGR
        + + R
Subjt:  AVKGR

AT5G43070.1 WPP domain protein 11.1e-2550.35Show/hide
Query:  NSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEIL
        +S P  S  E  +  P   E  K P P      IWPPTQ+TRDAV +RL+ETLST SILSKR+G++  +EA++VA+ IE+EAYA A  +   +DDGIEIL
Subjt:  NSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEIL

Query:  QVYSKEISKRMLEAVKGRAIPAENGEK--EGVTSPVAAPRDES
        + YSKEISKRMLE+VK ++  A    K  +G+ S V +  D S
Subjt:  QVYSKEISKRMLEAVKGRAIPAENGEK--EGVTSPVAAPRDES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATCACGACAACTCCACTCCCGTCGATTCTCCGGCGGAAGCAGAATCCCGACAGCCGGAAATCAAAGAACCCATTAAACCGCCCAAGCCATTTGGTACCAAATT
CAGCATATGGCCACCAACGCAGCGCACTCGCGACGCCGTTAGCAGCCGTCTGATGGAAACCCTCTCCACCCCTTCCATTCTCTCTAAGCGTTACGGCACGATTCCGCCGG
ACGAGGCCGCCGCCGTCGCGCAGGGCATCGAGGAAGAGGCCTATGCCACCGCCGATGGCTCTCCGGCCACCGAGGACGACGGCATCGAGATTCTCCAGGTCTACTCCAAG
GAGATCAGTAAGCGGATGCTTGAGGCGGTGAAGGGTCGAGCGATTCCGGCGGAAAATGGTGAGAAGGAGGGAGTAACATCGCCTGTTGCTGCACCTAGAGATGAAAGCCC
AACATTGGAAACTGAAACTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAATAGAGATATCGAGGTGAAACATTCTCGTCATCACCATCTTCACCGCACATCGTGATCTTCAATTCTAGAAGGGCGATCCGTCGTTTTAACTTCGCCCACAAAT
AAACCCTCATATGGCAACCTAACTGATCATATTCTTCCCCCATTCCACCAATCTCCCGTATTTAATTCAGATCTCATTTTCCTTAATCCCAGATTCAGATTCAGATTCCC
CCTTCACGGCCATGTCTGATCACGACAACTCCACTCCCGTCGATTCTCCGGCGGAAGCAGAATCCCGACAGCCGGAAATCAAAGAACCCATTAAACCGCCCAAGCCATTT
GGTACCAAATTCAGCATATGGCCACCAACGCAGCGCACTCGCGACGCCGTTAGCAGCCGTCTGATGGAAACCCTCTCCACCCCTTCCATTCTCTCTAAGCGTTACGGCAC
GATTCCGCCGGACGAGGCCGCCGCCGTCGCGCAGGGCATCGAGGAAGAGGCCTATGCCACCGCCGATGGCTCTCCGGCCACCGAGGACGACGGCATCGAGATTCTCCAGG
TCTACTCCAAGGAGATCAGTAAGCGGATGCTTGAGGCGGTGAAGGGTCGAGCGATTCCGGCGGAAAATGGTGAGAAGGAGGGAGTAACATCGCCTGTTGCTGCACCTAGA
GATGAAAGCCCAACATTGGAAACTGAAACTTGATCTTAATGTTATTGTATTTGGGGGTATTGTTTGTTATTGCCCTGTTTGAGCTTCTTTTACTGGAAATCTGCATGTCC
ACTGTCCACACATTACTTACAAGATAATTGTTGCTTACAAGATAATTATTGTTCTTCTTGTGTTCTATGGCCTTAAATGCAATGAAAAATAGAAGCCTTTGTTTTTGCCC
A
Protein sequenceShow/hide protein sequence
MSDHDNSTPVDSPAEAESRQPEIKEPIKPPKPFGTKFSIWPPTQRTRDAVSSRLMETLSTPSILSKRYGTIPPDEAAAVAQGIEEEAYATADGSPATEDDGIEILQVYSK
EISKRMLEAVKGRAIPAENGEKEGVTSPVAAPRDESPTLETET