; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G016310 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G016310
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr02:22205293..22208232
RNA-Seq ExpressionLsi02G016310
SyntenyLsi02G016310
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578695.1 hypothetical protein SDJN03_23143, partial [Cucurbita argyrosperma subsp. sororia]1.1e-11991.9Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        ETT+PITT+EKK    EEQ+MEETETKK +KKN KKQKHQHPNDQTTK  SDFSFKPSS VKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        GF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

XP_004142759.2 uncharacterized protein LOC101214484 [Cucumis sativus]2.3e-12595.93Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKE   EEQ MEETETKKSKKNNKKQKHQHPNDQTTK  SDFSFKP SDVKGLRFGGQFIVKSFTIRRARPLE L+LLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

XP_008458876.1 PREDICTED: uncharacterized protein LOC103498150 [Cucumis melo]8.6e-12896.34Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKE  +EEQ+MEETETKKSKK+NKKQKHQHPNDQTTK ASDFSFKP SDVKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

XP_022939264.1 uncharacterized protein LOC111445234 [Cucurbita moschata]1.1e-11991.9Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        ETT+PITT+EKK    EEQ+MEETETKK +KKN KKQKHQHPNDQTTK  SDFSFKPSS VKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        GF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

XP_038889618.1 uncharacterized protein LOC120079488 [Benincasa hispida]3.5e-12997.15Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKEGR+EEQ+MEETE KKSKKNNKKQKHQHPNDQ TK ASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMK AIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

TrEMBL top hitse value%identityAlignment
A0A0A0KNW4 Uncharacterized protein1.1e-12595.93Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKE   EEQ MEETETKKSKKNNKKQKHQHPNDQTTK  SDFSFKP SDVKGLRFGGQFIVKSFTIRRARPLE L+LLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

A0A1S3C904 uncharacterized protein LOC1034981504.2e-12896.34Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKE  +EEQ+MEETETKKSKK+NKKQKHQHPNDQTTK ASDFSFKP SDVKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

A0A5A7T4D4 Uncharacterized protein4.2e-12896.34Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETTKPITTKEKKE  +EEQ+MEETETKKSKK+NKKQKHQHPNDQTTK ASDFSFKP SDVKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTRNSGHK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

A0A6J1FGM9 uncharacterized protein LOC1114452345.4e-12091.9Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        ETT+PITT+EKK    EEQ+MEETETKK +KKN KKQKHQHPNDQTTK  SDFSFKPSS VKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKK-SKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        GF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  GFSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

A0A6J1JQM2 uncharacterized protein LOC1114889603.3e-11790.24Show/hide
Query:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK
        ETT+PITT+EKK    EEQ+MEETET   KKNNK QKHQHPNDQTTK  SDFSFKP+S VKGLRFGGQ IVKSFTIRRARPLEFLQLLSFPATTRNS HK
Subjt:  ETTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHK

Query:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
        PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG
Subjt:  PPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFG

Query:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY
        F+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  FSCTDDLRTILESVVALKDFLDHTAMLAMPNQRTISFAVPPVAMAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G55420.1 unknown protein1.3e-7364.5Show/hide
Query:  MEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFPSATAFIPTNFTILAHH
        M E E KK+KK  KK+KHQ         +SD SFKPSSDVKGL+FGGQ IVKSFTIRRAR  E L+LLS P     S   PP  S  A++PTNFTILAHH
Subjt:  MEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFPSATAFIPTNFTILAHH

Query:  AWHTLTLGLGTKKSKVLLFVFENETMK----AAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFSCTDDLRTILESVVA
        AWHTLTLGLGT+KSKV++FVFE E MK    AA   +WP+EIPLG+VNKKMIR L   EMARFKFRKGCITFYVYAVR  G  GF+  +DL+ IL++VVA
Subjt:  AWHTLTLGLGTKKSKVLLFVFENETMK----AAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFSCTDDLRTILESVVA

Query:  LKDFLDHTAMLAMPNQRTISF-AVPPVAMAY
        LKDF+DHTAML MP+Q++I++ + PP AMA+
Subjt:  LKDFLDHTAMLAMPNQRTISF-AVPPVAMAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAACAAAACCCATCACCACAAAGGAGAAGAAAGAAGGGCGCAGAGAAGAACAAAACATGGAAGAAACAGAGACAAAGAAGAGCAAGAAGAACAACAAGAAGCA
AAAACACCAACACCCAAATGACCAAACCACAAAACCAGCCTCTGATTTCTCTTTCAAGCCAAGCTCTGATGTAAAGGGTCTTAGATTCGGTGGCCAATTCATTGTTAAAT
CCTTCACAATCCGGCGAGCTAGGCCTTTAGAGTTCCTCCAGCTTCTCTCTTTTCCGGCTACCACCAGAAATTCCGGCCACAAACCACCGTTCCCATCAGCCACAGCTTTC
ATTCCTACAAACTTCACAATCCTTGCTCACCATGCGTGGCACACACTCACTCTAGGCCTTGGCACTAAGAAGTCCAAAGTTCTTCTCTTTGTGTTTGAAAATGAGACCAT
GAAAGCAGCCATAGATCGTGTTTGGCCAACAGAAATCCCTCTAGGCGAAGTGAACAAGAAGATGATTAGAGGGCTAAGTGGGTGTGAGATGGCTCGGTTCAAATTCAGAA
AAGGTTGCATAACTTTTTATGTTTATGCAGTTCGAAGAGAAGGGTGTTTTGGGTTTTCTTGTACTGATGATTTGAGAACTATTTTGGAGTCTGTTGTTGCTCTCAAAGAT
TTCTTGGATCACACTGCTATGCTTGCTATGCCTAATCAGAGAACCATCAGCTTTGCTGTGCCTCCTGTTGCAATGGCTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAACAAAACCCATCACCACAAAGGAGAAGAAAGAAGGGCGCAGAGAAGAACAAAACATGGAAGAAACAGAGACAAAGAAGAGCAAGAAGAACAACAAGAAGCA
AAAACACCAACACCCAAATGACCAAACCACAAAACCAGCCTCTGATTTCTCTTTCAAGCCAAGCTCTGATGTAAAGGGTCTTAGATTCGGTGGCCAATTCATTGTTAAAT
CCTTCACAATCCGGCGAGCTAGGCCTTTAGAGTTCCTCCAGCTTCTCTCTTTTCCGGCTACCACCAGAAATTCCGGCCACAAACCACCGTTCCCATCAGCCACAGCTTTC
ATTCCTACAAACTTCACAATCCTTGCTCACCATGCGTGGCACACACTCACTCTAGGCCTTGGCACTAAGAAGTCCAAAGTTCTTCTCTTTGTGTTTGAAAATGAGACCAT
GAAAGCAGCCATAGATCGTGTTTGGCCAACAGAAATCCCTCTAGGCGAAGTGAACAAGAAGATGATTAGAGGGCTAAGTGGGTGTGAGATGGCTCGGTTCAAATTCAGAA
AAGGTTGCATAACTTTTTATGTTTATGCAGTTCGAAGAGAAGGGTGTTTTGGGTTTTCTTGTACTGATGATTTGAGAACTATTTTGGAGTCTGTTGTTGCTCTCAAAGAT
TTCTTGGATCACACTGCTATGCTTGCTATGCCTAATCAGAGAACCATCAGCTTTGCTGTGCCTCCTGTTGCAATGGCTTATTAG
Protein sequenceShow/hide protein sequence
METTKPITTKEKKEGRREEQNMEETETKKSKKNNKKQKHQHPNDQTTKPASDFSFKPSSDVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFPSATAF
IPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENETMKAAIDRVWPTEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFSCTDDLRTILESVVALKD
FLDHTAMLAMPNQRTISFAVPPVAMAY