; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000048 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000048
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionNodulin homeobox isoform X2
Genome locationchr10:25919080..25924806
RNA-Seq ExpressionPay0000048
SyntenyPay0000048
Gene Ontology termsGO:0009908 - flower development (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR039325 - Nodulin homeobox protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054201.1 nodulin homeobox isoform X2 [Cucumis melo var. makuwa]1.8e-96100Show/hide
Query:  MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV
        MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV
Subjt:  MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV

Query:  LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ
        LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ
Subjt:  LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ

XP_008457501.1 PREDICTED: nodulin homeobox isoform X2 [Cucumis melo]8.7e-8075.83Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGG+AAGSCDSPDS CEDK+V N GRDRRTASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NN KNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLV+DIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

XP_011658033.1 nodulin homeobox isoform X1 [Cucumis sativus]5.1e-8076.3Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGGM AGSCDSPDS CEDK+V N GRDRR+ASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NNSKNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLVVDIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

XP_011658036.1 nodulin homeobox isoform X2 [Cucumis sativus]5.1e-8076.3Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGGM AGSCDSPDS CEDK+V N GRDRR+ASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NNSKNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLVVDIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

XP_031744039.1 nodulin homeobox isoform X3 [Cucumis sativus]5.1e-8076.3Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGGM AGSCDSPDS CEDK+V N GRDRR+ASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NNSKNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLVVDIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

TrEMBL top hitse value%identityAlignment
A0A0A0LVA2 Homeobox domain-containing protein2.5e-8076.3Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGGM AGSCDSPDS CEDK+V N GRDRR+ASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NNSKNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLVVDIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

A0A1S3C587 nodulin homeobox isoform X14.2e-8075.83Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGG+AAGSCDSPDS CEDK+V N GRDRRTASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NN KNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLV+DIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

A0A1S4E1L6 nodulin homeobox isoform X24.2e-8075.83Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGG+AAGSCDSPDS CEDK+V N GRDRRTASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NN KNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLV+DIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

A0A5A7UKY3 Nodulin homeobox isoform X28.5e-97100Show/hide
Query:  MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV
        MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV
Subjt:  MNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDV

Query:  LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ
        LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ
Subjt:  LGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKIFMLKSQ

A0A5D3DAP2 Nodulin homeobox isoform X24.2e-8075.83Show/hide
Query:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT
        +A   + YG   +   LKN  WL             NNRKARLARTARD RATLEADNAIPDKQGG+AAGSCDSPDS CEDK+V N GRDRRTASRTNT 
Subjt:  YAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTT

Query:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW
        NN KNST +FNDSGPTE VHFKP  YVIL+DVLGEEI KGKVHQVHGKWYGRNLEE ETLV+DIDELK DKNTVLPYP+E T T FHEAETKIGVMRVLW
Subjt:  NNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTCTLFHEAETKIGVMRVLW

Query:  DFNKIFMLKSQ
        DFNKIFML+SQ
Subjt:  DFNKIFMLKSQ

SwissProt top hitse value%identityAlignment
A9LNK9 30-kDa cleavage and polyadenylation specificity factor 304.9e-0966.67Show/hide
Query:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK
        C KM SRIG ++ GGNWK+ HGT  YG+NFS+KWLK
Subjt:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK

F4JI44 Nodulin homeobox6.4e-0929.89Show/hide
Query:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG
        NNRKA+LAR  +      + +++  +P+  G         P +  +D          +T + T  T  +   T   ++ G       K    V L+D  G
Subjt:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG

Query:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI
        +EI KG V +  G+W G +LE  +  VVD+ EL         ++PY  +D    F EA ++ GVMRV WD NK+
Subjt:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI

Q0DA50 Zinc finger CCCH domain-containing protein 453.8e-0966.67Show/hide
Query:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK
        C KM SRIG ++ GGNWK AHGT +YG+NFS++WLK
Subjt:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK

Arabidopsis top hitse value%identityAlignment
AT1G30460.1 cleavage and polyadenylation specificity factor 303.5e-1066.67Show/hide
Query:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK
        C KM SRIG ++ GGNWK+ HGT  YG+NFS+KWLK
Subjt:  CEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLK

AT4G03090.1 sequence-specific DNA binding;sequence-specific DNA binding transcription factors4.6e-1029.89Show/hide
Query:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG
        NNRKA+LAR  +      + +++  +P+  G         P +  +D          +T + T  T  +   T   ++ G       K    V L+D  G
Subjt:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG

Query:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI
        +EI KG V +  G+W G +LE  +  VVD+ EL         ++PY  +D    F EA ++ GVMRV WD NK+
Subjt:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI

AT4G03090.2 sequence-specific DNA binding;sequence-specific DNA binding transcription factors4.6e-1029.89Show/hide
Query:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG
        NNRKA+LAR  +      + +++  +P+  G         P +  +D          +T + T  T  +   T   ++ G       K    V L+D  G
Subjt:  NNRKARLARTARDRRATLEADNA--IPDKQGGMAAGSCDSPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLG

Query:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI
        +EI KG V +  G+W G +LE  +  VVD+ EL         ++PY  +D    F EA ++ GVMRV WD NK+
Subjt:  EEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTD---KNTVLPYPHEDTCTLFHEAETKIGVMRVLWDFNKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGTACAATCCTCTTGGATGGATCCATAAGGATGTTGTTAGAATACTGGCTTGTGAAAAGATGATGTCCAGAATTGGTGATTTTGTTAGTGGGGGCAATTGGAA
ATATGCACATGGAACTGTATATTATGGTCAAAACTTTTCACTCAAATGGCTTAAGAATCTTGGTTGGCTTGTGTTTTTAAATGTTTGTGCTTGTAGAATGAACAGGCCGA
ACAATAGGAAAGCGAGGCTAGCACGCACAGCTAGGGATAGGCGTGCAACCTTAGAAGCTGACAATGCAATTCCAGATAAGCAAGGGGGTATGGCAGCTGGATCCTGTGAT
TCACCTGATAGCCAATGTGAAGATAAAAATGTACATAATATAGGAAGGGATCGAAGAACTGCATCAAGAACTAACACGACTAATAATTCTAAGAATTCAACAATGAAGTT
CAATGACAGTGGCCCAACAGAACTTGTTCACTTCAAGCCACGACTGTATGTCATTCTTATAGACGTGCTCGGAGAGGAGATTGTGAAAGGAAAAGTGCATCAGGTACATG
GTAAATGGTATGGGAGAAACCTGGAGGAATTTGAAACATTAGTTGTTGATATTGATGAATTGAAGACTGATAAAAACACAGTGCTTCCATACCCACACGAGGACACATGC
ACCTTATTCCATGAGGCAGAAACTAAAATTGGTGTTATGAGAGTTTTGTGGGATTTTAACAAAATCTTCATGTTAAAGTCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGTACAATCCTCTTGGATGGATCCATAAGGATGTTGTTAGAATACTGGCTTGTGAAAAGATGATGTCCAGAATTGGTGATTTTGTTAGTGGGGGCAATTGGAA
ATATGCACATGGAACTGTATATTATGGTCAAAACTTTTCACTCAAATGGCTTAAGAATCTTGGTTGGCTTGTGTTTTTAAATGTTTGTGCTTGTAGAATGAACAGGCCGA
ACAATAGGAAAGCGAGGCTAGCACGCACAGCTAGGGATAGGCGTGCAACCTTAGAAGCTGACAATGCAATTCCAGATAAGCAAGGGGGTATGGCAGCTGGATCCTGTGAT
TCACCTGATAGCCAATGTGAAGATAAAAATGTACATAATATAGGAAGGGATCGAAGAACTGCATCAAGAACTAACACGACTAATAATTCTAAGAATTCAACAATGAAGTT
CAATGACAGTGGCCCAACAGAACTTGTTCACTTCAAGCCACGACTGTATGTCATTCTTATAGACGTGCTCGGAGAGGAGATTGTGAAAGGAAAAGTGCATCAGGTACATG
GTAAATGGTATGGGAGAAACCTGGAGGAATTTGAAACATTAGTTGTTGATATTGATGAATTGAAGACTGATAAAAACACAGTGCTTCCATACCCACACGAGGACACATGC
ACCTTATTCCATGAGGCAGAAACTAAAATTGGTGTTATGAGAGTTTTGTGGGATTTTAACAAAATCTTCATGTTAAAGTCACAATGA
Protein sequenceShow/hide protein sequence
MVVYNPLGWIHKDVVRILACEKMMSRIGDFVSGGNWKYAHGTVYYGQNFSLKWLKNLGWLVFLNVCACRMNRPNNRKARLARTARDRRATLEADNAIPDKQGGMAAGSCD
SPDSQCEDKNVHNIGRDRRTASRTNTTNNSKNSTMKFNDSGPTELVHFKPRLYVILIDVLGEEIVKGKVHQVHGKWYGRNLEEFETLVVDIDELKTDKNTVLPYPHEDTC
TLFHEAETKIGVMRVLWDFNKIFMLKSQ