; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g314000 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g314000
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionUnknown protein
Genome locationCsor_Chr15:1788866..1789609
RNA-Seq ExpressionCsor.00g314000
SyntenyCsor.00g314000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578695.1 hypothetical protein SDJN03_23143, partial [Cucurbita argyrosperma subsp. sororia]3.93e-172100Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

XP_008458876.1 PREDICTED: uncharacterized protein LOC103498150 [Cucumis melo]6.03e-15490.44Show/hide
Query:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR
        ME+ ETT+PITT+EKKE    EQDMEETETKK+ KK+ KKQKHQHPNDQTTKS SDFSFKP S VKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTR
Subjt:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR

Query:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
        NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
Subjt:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR

Query:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        EGCFGF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

XP_022939264.1 uncharacterized protein LOC111445234 [Cucurbita moschata]5.66e-172100Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

XP_022992697.1 uncharacterized protein LOC111488960 [Cucurbita maxima]4.08e-16496.76Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNNK     QKHQHPNDQTTKSPSDFSFKP+SHVKGLRFGGQ IVKSFTIRRARPLEFLQLLSFPATTRNS H
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

XP_023549698.1 uncharacterized protein LOC111808119 [Cucurbita pepo subsp. pepo]7.60e-17098.79Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNN+KNTKKQKHQHPNDQTTKS SDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSAT FIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

TrEMBL top hitse value%identityAlignment
A0A0A0KNW4 Uncharacterized protein2.13e-15189.64Show/hide
Query:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR
        ME+ ETT+PITT+EKKE    EQ+MEETETKK+ KKN KKQKHQHPNDQTTKS  DFSFKP S VKGLRFGGQFIVKSFTIRRARPLE L+LLSFPATTR
Subjt:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR

Query:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
        NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
Subjt:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR

Query:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        EGCFGF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

A0A1S3C904 uncharacterized protein LOC1034981502.92e-15490.44Show/hide
Query:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR
        ME+ ETT+PITT+EKKE    EQDMEETETKK+ KK+ KKQKHQHPNDQTTKS SDFSFKP S VKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTR
Subjt:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR

Query:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
        NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
Subjt:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR

Query:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        EGCFGF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

A0A5A7T4D4 Uncharacterized protein2.92e-15490.44Show/hide
Query:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR
        ME+ ETT+PITT+EKKE    EQDMEETETKK+ KK+ KKQKHQHPNDQTTKS SDFSFKP S VKGLRFGGQFIVKSFTIRRARPLE LQLLSFPATTR
Subjt:  MEAAETTEPITTREKKE----EQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTR

Query:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
        NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGT+KSKVLLFVFENE MKAAIDRVW  EIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR
Subjt:  NSGHKPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRR

Query:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        EGCFGF+C DDLRTILESVVALKDFLDHTAMLAMPNQ+TISFA PPVAMAY
Subjt:  EGCFGFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

A0A6J1FGM9 uncharacterized protein LOC1114452342.74e-172100Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

A0A6J1JQM2 uncharacterized protein LOC1114889601.98e-16496.76Show/hide
Query:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH
        MEAAETTEPITTREKKEEQDMEETETKKNNK     QKHQHPNDQTTKSPSDFSFKP+SHVKGLRFGGQ IVKSFTIRRARPLEFLQLLSFPATTRNS H
Subjt:  MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGH

Query:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
        KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF
Subjt:  KPPFPSATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCF

Query:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
        GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY
Subjt:  GFACADDLRTILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G55420.1 unknown protein8.4e-7364.91Show/hide
Query:  ETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFPSATAFIPTNFTILAHHAWH
        E +K  KK  KK+KHQ          SD SFKPSS VKGL+FGGQ IVKSFTIRRAR  E L+LLS P     S   PP  S  A++PTNFTILAHHAWH
Subjt:  ETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFPSATAFIPTNFTILAHHAWH

Query:  TLTLGLGTKKSKVLLFVFENEAMK----AAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFACADDLRTILESVVALKD
        TLTLGLGT+KSKV++FVFE EAMK    AA   +W +EIPLG+VNKKMIR L   EMARFKFRKGCITFYVYAVR  G  GFA A+DL+ IL++VVALKD
Subjt:  TLTLGLGTKKSKVLLFVFENEAMK----AAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFACADDLRTILESVVALKD

Query:  FLDHTAMLAMPNQKTISFAA-PPVAMAY
        F+DHTAML MP+QK+I++++ PP AMA+
Subjt:  FLDHTAMLAMPNQKTISFAA-PPVAMAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGCAGAAACAACAGAGCCCATCACCACAAGGGAGAAGAAAGAAGAACAAGACATGGAGGAAACAGAGACAAAGAAGAACAACAAGAAGAACACCAAG
AAGCAAAAACATCAACACCCAAATGACCAAACCACAAAATCACCCTCTGATTTCTCTTTCAAGCCAAGCTCTCATGTAAAGGGTCTCAGATTCGGTGGCCAATTC
ATCGTTAAATCCTTCACAATCCGGCGAGCTAGGCCTTTAGAGTTTCTCCAGCTCCTCTCTTTTCCGGCCACCACCAGAAATTCCGGCCATAAACCGCCGTTCCCA
TCTGCCACAGCTTTCATTCCCACAAACTTCACAATCCTCGCTCACCATGCGTGGCACACACTCACTCTAGGCCTCGGCACAAAGAAGTCCAAAGTTCTTCTCTTT
GTGTTTGAAAATGAGGCCATGAAGGCAGCAATAGACCGTGTTTGGGCAGCAGAAATCCCTCTAGGCGAAGTGAATAAGAAGATGATTAGAGGGCTAAGTGGGTGT
GAGATGGCTCGGTTCAAATTCAGAAAAGGGTGCATAACTTTTTATGTTTATGCAGTTCGAAGAGAAGGGTGTTTTGGGTTTGCTTGTGCTGATGATTTGAGAACC
ATTTTGGAGTCTGTTGTGGCTCTCAAAGATTTCTTGGATCACACTGCTATGCTTGCTATGCCTAACCAGAAAACCATCAGCTTTGCGGCGCCTCCTGTTGCAATG
GCTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAGCAGAAACAACAGAGCCCATCACCACAAGGGAGAAGAAAGAAGAACAAGACATGGAGGAAACAGAGACAAAGAAGAACAACAAGAAGAACACCAAG
AAGCAAAAACATCAACACCCAAATGACCAAACCACAAAATCACCCTCTGATTTCTCTTTCAAGCCAAGCTCTCATGTAAAGGGTCTCAGATTCGGTGGCCAATTC
ATCGTTAAATCCTTCACAATCCGGCGAGCTAGGCCTTTAGAGTTTCTCCAGCTCCTCTCTTTTCCGGCCACCACCAGAAATTCCGGCCATAAACCGCCGTTCCCA
TCTGCCACAGCTTTCATTCCCACAAACTTCACAATCCTCGCTCACCATGCGTGGCACACACTCACTCTAGGCCTCGGCACAAAGAAGTCCAAAGTTCTTCTCTTT
GTGTTTGAAAATGAGGCCATGAAGGCAGCAATAGACCGTGTTTGGGCAGCAGAAATCCCTCTAGGCGAAGTGAATAAGAAGATGATTAGAGGGCTAAGTGGGTGT
GAGATGGCTCGGTTCAAATTCAGAAAAGGGTGCATAACTTTTTATGTTTATGCAGTTCGAAGAGAAGGGTGTTTTGGGTTTGCTTGTGCTGATGATTTGAGAACC
ATTTTGGAGTCTGTTGTGGCTCTCAAAGATTTCTTGGATCACACTGCTATGCTTGCTATGCCTAACCAGAAAACCATCAGCTTTGCGGCGCCTCCTGTTGCAATG
GCTTATTAG
Protein sequenceShow/hide protein sequence
MEAAETTEPITTREKKEEQDMEETETKKNNKKNTKKQKHQHPNDQTTKSPSDFSFKPSSHVKGLRFGGQFIVKSFTIRRARPLEFLQLLSFPATTRNSGHKPPFP
SATAFIPTNFTILAHHAWHTLTLGLGTKKSKVLLFVFENEAMKAAIDRVWAAEIPLGEVNKKMIRGLSGCEMARFKFRKGCITFYVYAVRREGCFGFACADDLRT
ILESVVALKDFLDHTAMLAMPNQKTISFAAPPVAMAY