; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003468 (gene) of Snake gourd v1 genome

Gene IDTan0003468
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF1685 domain-containing protein
Genome locationLG05:76520144..76521260
RNA-Seq ExpressionTan0003468
SyntenyTan0003468
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583944.1 hypothetical protein SDJN03_19876, partial [Cucurbita argyrosperma subsp. sororia]7.1e-8776.54Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVLNLFDS WFEREIFN HPF + PQNP+PEN    PLKNS PPEEPFVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS DIA  E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
         PE  RR + R+++ +SR R+ GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEE       EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEELKK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

XP_022927119.1 uncharacterized protein LOC111434058 [Cucurbita moschata]7.9e-8675.72Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVL+LFDS WFEREIFN HPF + PQNP+PEN    PLKNS PPEEPFVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS +IA  E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
         PE  RR +  R+K + R R+ GR  RG ES+SLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEE       EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEELKK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

XP_023001638.1 uncharacterized protein LOC111495710 [Cucurbita maxima]1.6e-9178.6Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVLNLFDSFWFEREIFN HPF + PQNP+PEN    PLKNS PPEEPFVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS DIA  E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
         PE  RR +  R+K + R R+ GR  RG ES+SLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEEE+E EE+EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEE+KK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

XP_023520224.1 uncharacterized protein LOC111783529 [Cucurbita pepo subsp. pepo]1.3e-8575.31Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVLNLFDSFWFEREIFN HPF + PQNP+PEN    PL+NS PPEE FVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS D+   E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
        +PE  RR +  R+K + R R+ GR  RG ES+SLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEE       EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEELKK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

XP_038895996.1 uncharacterized protein LOC120084174 [Benincasa hispida]3.3e-9280.08Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN
        MD EQ+LNLFDSFWFE EIFN HPF S PQNP+PEN   NSLP E   VPR+RTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTI SS DIA  ESPEN
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN

Query:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEAWEA
         R+ GI RR KTESR ++RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK EEE+ +E EE+ +  GEISRPYLSEAWEA
Subjt:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEAWEA

Query:  M-EKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        M E+EEELK PL MKW+FP+N+IDMKDNLKWWAHAVASTVR
Subjt:  M-EKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LV26 Uncharacterized protein7.2e-8572.11Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN
        MD + +LNLFDSFWF+R++ N+HPF S PQ  +P+    + LP E   +PR+RTRSISEDLSSKLSFMSNSNSPDSVL SPKLQTI SS DIA  ESPE 
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN

Query:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRE-------EEEEQEPEEQEEFNGEISRPY
          +  I RR KTE R R+RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRE       EEEE+E EE+ +  GEISRPY
Subjt:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRE-------EEEEQEPEEQEEFNGEISRPY

Query:  LSEAWEAM----EKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        LSEAWEA+    EKEE LK+PLMMKWRFP+N+IDMKDNLKWWAHAVASTVR
Subjt:  LSEAWEAM----EKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

A0A5A7UPL7 DUF1685 domain-containing protein3.2e-8574.7Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN
        MD EQ+LNLFDSFWFER +FN HPF S  Q P+ ++P  +SLP E   +PR+ TRSISEDLSSKLSFMS+SNSPDSVLFSPKLQTI SS DIA  ESPE 
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN

Query:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRE-----EEEEQEPEEQEEFNGEISRPYLS
         R+  I RR KTE R R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+E     EEEE+E EE+ +  GEISRPYLS
Subjt:  DRR-GIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRE-----EEEEQEPEEQEEFNGEISRPYLS

Query:  EAWEAMEKEEE----LKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        EAWEAME+EEE     KKPLMMKWRFP+N+IDMKDNLKWWAHAVASTVR
Subjt:  EAWEAMEKEEE----LKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

A0A6J1CEZ9 uncharacterized protein LOC1110109014.2e-8574.69Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN
        MDVEQ+LNLFDS WFER IFN     S P+    E PLKNS PP    +PRI TRSISEDLSSKLSFMSNSNSPDS+LFSPKLQTI SS DIA TESPEN
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPEN

Query:  DRRGIGRRRKTESR----NRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
        +R+ I    +TE R     R RGRRRR PESKSLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K+E EEE+E EE     GEISRPYLSEA
Subjt:  DRRGIGRRRKTESR----NRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPA--NEIDMKDNLKWWAHAVASTVR
        WEAMEKE ELKKP +M+W FPA  NEIDMKDNLKWWAH VASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPA--NEIDMKDNLKWWAHAVASTVR

A0A6J1EH41 uncharacterized protein LOC1114340583.8e-8675.72Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVL+LFDS WFEREIFN HPF + PQNP+PEN    PLKNS PPEEPFVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS +IA  E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
         PE  RR +  R+K + R R+ GR  RG ES+SLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEE       EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEELKK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

A0A6J1KR35 uncharacterized protein LOC1114957107.9e-9278.6Show/hide
Query:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE
        MDVEQVLNLFDSFWFEREIFN HPF + PQNP+PEN    PLKNS PPEEPFVPRI  RSISEDLSSKL+FMS+S+SPDSVLFSPKLQTILSS DIA  E
Subjt:  MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPEN----PLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETE

Query:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA
         PE  RR +  R+K + R R+ GR  RG ES+SLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR+EEEE+E EE+EE  G ISRPYLSEA
Subjt:  SPENDRRGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR
        W AME+EEE+KK L+MKWR PANEIDMKDNLKWWAHAVASTVR
Subjt:  WEAMEKEEELKKPLMMKWRFPANEIDMKDNLKWWAHAVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42760.1 unknown protein1.5e-2940.3Show/hide
Query:  EQVLNLFDSFWFEREIF--NSHPFSSKPQNPKPENPLKNSLPPEEPF----VPRIRTRSISED----LSSKLSFMSNSN-----SPDSVL----FSPKLQ
        E++L LF+  W ER IF  +    + K +  + E  +      EE      V  +  R++S++     SSK S  S+S+     SP SVL       KLQ
Subjt:  EQVLNLFDSFWFEREIF--NSHPFSSKPQNPKPENPLKNSLPPEEPF----VPRIRTRSISED----LSSKLSFMSNSN-----SPDSVL----FSPKLQ

Query:  TILSSNDIAETESPENDR---RGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEED-KGSSLASIVPGLNRLGKREE---EEEQE
        TILS  ++      E +R       +R+K + ++ VR R+      KS+S+LE+EELKGFMDLGFVFSE+D K S L SI+PGL RL K+++   +EE+E
Subjt:  TILSSNDIAETESPENDR---RGIGRRRKTESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEED-KGSSLASIVPGLNRLGKREE---EEEQE

Query:  PEEQEEFNG-EISRPYLSEAWEAMEKEEELKK-PLMMKWRFP----ANEIDMKDNLKWWAHAVASTVR
         EE+++  G   +RPYLSEAW+     +  K+    +KWR P    A+E+D+KDNL+ WAHAVAST+R
Subjt:  PEEQEEFNG-EISRPYLSEAWEAMEKEEELKK-PLMMKWRFP----ANEIDMKDNLKWWAHAVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTGAGCAAGTTCTGAATCTCTTCGATTCTTTCTGGTTCGAGCGTGAAATCTTCAACAGTCATCCTTTTTCATCAAAACCCCAAAACCCAAAACCTGAAAATCC
ATTAAAAAACTCCCTGCCGCCGGAGGAGCCTTTCGTCCCACGGATTCGCACGAGGTCCATAAGCGAAGATTTAAGCTCCAAATTGAGCTTTATGTCCAATTCCAACTCGC
CTGATTCAGTTCTGTTTTCACCAAAGCTTCAAACGATTCTTTCCAGCAACGACATCGCCGAAACGGAGTCGCCGGAGAACGACCGGAGGGGAATTGGGCGGCGGAGGAAA
ACAGAGTCGAGAAACAGAGTTAGAGGTAGGAGAAGAAGGGGGCCGGAAAGTAAGAGCCTGTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTTATGGATTTGGGATTTGT
TTTCTCGGAAGAGGATAAAGGTTCGAGCTTGGCGTCGATTGTTCCAGGATTGAACAGGCTGGGGAAAAGGGAGGAAGAAGAAGAACAAGAACCAGAAGAACAGGAAGAAT
TTAATGGTGAAATTTCGAGGCCTTATCTTTCGGAAGCTTGGGAGGCTATGGAGAAAGAGGAGGAATTGAAGAAGCCATTGATGATGAAATGGAGGTTTCCGGCTAATGAG
ATTGATATGAAAGATAATCTCAAATGGTGGGCTCATGCTGTTGCTTCTACTGTTAGATGA
mRNA sequenceShow/hide mRNA sequence
GATTCACCAACGTCTTTAGCCTCTTTACGTATATAAAGCACATTCATTTGTCAAAACTCAATTAACCAACTCTAAACACAAATACAAATCTCTGTTCTCAGCCGCCATTG
ATGGACGTTGAGCAAGTTCTGAATCTCTTCGATTCTTTCTGGTTCGAGCGTGAAATCTTCAACAGTCATCCTTTTTCATCAAAACCCCAAAACCCAAAACCTGAAAATCC
ATTAAAAAACTCCCTGCCGCCGGAGGAGCCTTTCGTCCCACGGATTCGCACGAGGTCCATAAGCGAAGATTTAAGCTCCAAATTGAGCTTTATGTCCAATTCCAACTCGC
CTGATTCAGTTCTGTTTTCACCAAAGCTTCAAACGATTCTTTCCAGCAACGACATCGCCGAAACGGAGTCGCCGGAGAACGACCGGAGGGGAATTGGGCGGCGGAGGAAA
ACAGAGTCGAGAAACAGAGTTAGAGGTAGGAGAAGAAGGGGGCCGGAAAGTAAGAGCCTGTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTTATGGATTTGGGATTTGT
TTTCTCGGAAGAGGATAAAGGTTCGAGCTTGGCGTCGATTGTTCCAGGATTGAACAGGCTGGGGAAAAGGGAGGAAGAAGAAGAACAAGAACCAGAAGAACAGGAAGAAT
TTAATGGTGAAATTTCGAGGCCTTATCTTTCGGAAGCTTGGGAGGCTATGGAGAAAGAGGAGGAATTGAAGAAGCCATTGATGATGAAATGGAGGTTTCCGGCTAATGAG
ATTGATATGAAAGATAATCTCAAATGGTGGGCTCATGCTGTTGCTTCTACTGTTAGATGACACTGCTTTTTCATTCATTCTTCTTCTTCTTCTTTGTAATCTTTTTTTTT
TCTTTTGTTAATATTGTAAATTCTTGGGGTTTCTTAAATTTTGATCTAGTCCAAAGTTTTGAATCGAAAGCCATTGTAAAGGAATTATTGTTGCTATTTTTTTTAGGATA
TGGATTTGTGGGAGTGAGAGTTTATGATATTCATGGAATATGGTTATGCTTTGGTGGTCAAATGTGGTAGATATCAAAATTAAAAGCTCATGCTAAAGAGTAGGTAACTT
CGAGGAGTTTTTCGGAG
Protein sequenceShow/hide protein sequence
MDVEQVLNLFDSFWFEREIFNSHPFSSKPQNPKPENPLKNSLPPEEPFVPRIRTRSISEDLSSKLSFMSNSNSPDSVLFSPKLQTILSSNDIAETESPENDRRGIGRRRK
TESRNRVRGRRRRGPESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEEEEQEPEEQEEFNGEISRPYLSEAWEAMEKEEELKKPLMMKWRFPANE
IDMKDNLKWWAHAVASTVR