; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF707)
Genome locationchr6:12957144..12968165
RNA-Seq ExpressionMoc06g16460
SyntenyMoc06g16460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142563.1 uncharacterized protein LOC101221459 isoform X2 [Cucumis sativus]5.8e-20887.56Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP LAE KSRNSCLC+  PTASLLCL LF+GS YVAPDYREKISRWGIDGLVGSKFNKCE QCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSN+F +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK FNWSNRVIHVTAVNQTKWWFAKRFLHPDIV EYNYVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +PK YV+II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCDVNS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSS S VKDHRADVRRQSYIEL +FR RW+ A +QDECW+DPYP T+  K
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK

XP_008443712.1 PREDICTED: uncharacterized protein LOC103487236 isoform X1 [Cucumis melo]1.9e-20686.79Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MK S CLP LAE KSR+SCLC+L PTASLLCL LF+GS YVAP+YREKISRWGIDGLV SKFNKCE QCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSN+F +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK+F+WSNRVIHVTAVNQTKWWFAKRFLHPDIV EY+YVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        DPK YV+II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCDVNS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSSNS VKDHRADVRRQSYIEL +FR RW+ A +QDECW+DPYP T+  K
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK

XP_022158746.1 uncharacterized protein LOC111025211 isoform X1 [Momordica charantia]1.5e-23299.74Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI
        LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGT+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI

XP_031736155.1 uncharacterized protein LOC101221459 isoform X1 [Cucumis sativus]6.7e-20482.44Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP LAE KSRNSCLC+  PTASLLCL LF+GS YVAPDYREKISRWGIDGLVGSKFNKCE QCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSN+F +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK FNWSNRVIHVTAVNQTKWWFAKRFLHPDIV EYNYVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +PK YV+II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCDVNS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRAD------------------------VRRQSYIELGIFRTRWRNAVKQDECWK
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSS S VKDHRAD                        VRRQSYIEL +FR RW+ A +QDECW+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRAD------------------------VRRQSYIELGIFRTRWRNAVKQDECWK

Query:  DPYPGTIASK
        DPYP T+  K
Subjt:  DPYPGTIASK

XP_038880303.1 uncharacterized protein LOC120071938 isoform X1 [Benincasa hispida]1.3e-20787.47Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP L+E KSRNSCLC+L PTASL+CLVLF+GS YVAPDYREKISRWGIDGLVGSKFNKCENQCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSS NLF +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEW+EF+WSNRV+HVTAVNQTKWWFAKRFLHPDIVAEYNY+FLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +P+QYV+II SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCD NS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSS S VKDHRADVRRQSYIEL +FR RW+ A +QDECW DPYP T+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI

TrEMBL top hitse value%identityAlignment
A0A0A0M0M3 Uncharacterized protein2.8e-20887.56Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP LAE KSRNSCLC+  PTASLLCL LF+GS YVAPDYREKISRWGIDGLVGSKFNKCE QCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSN+F +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK FNWSNRVIHVTAVNQTKWWFAKRFLHPDIV EYNYVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +PK YV+II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCDVNS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSS S VKDHRADVRRQSYIEL +FR RW+ A +QDECW+DPYP T+  K
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK

A0A1S3B872 uncharacterized protein LOC103487236 isoform X19.1e-20786.79Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MK S CLP LAE KSR+SCLC+L PTASLLCL LF+GS YVAP+YREKISRWGIDGLV SKFNKCE QCRP GSEPLPKDIV  ASNLEMRPLWGAS++ 
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSN+F +AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK+F+WSNRVIHVTAVNQTKWWFAKRFLHPDIV EY+YVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        DPK YV+II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRP+NGGKGCDVNS +PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK
        LGYCAQGDRTKNVGVVD+EYV+HYGRPTLGGPEENETSSNS VKDHRADVRRQSYIEL +FR RW+ A +QDECW+DPYP T+  K
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASK

A0A6J1E1V6 uncharacterized protein LOC111025211 isoform X17.4e-23399.74Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI
        LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGT+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI

A0A6J1F2L3 uncharacterized protein LOC111439192 isoform X21.2e-20185.9Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP LAE KSRNS LC +FPT SLLCLVLF+GSAYVAP YRE+I RWGIDGLV SKFNKCENQCRP GSEPLPKDIV  ASNLEMRPLWGAS+  
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSNLF  AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK+F+WSNRVIHVTA+NQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGV+ F
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +PKQYV+II SEGLEISQPALDPY+SEVHHQITARGRRSTVHRRTF+ +NGGK CDVNSK+PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIH WGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI
        LGYCAQGDRTKNVGVVDAEY++HYGRPTLGGPEENETSS S VKDHRADVRRQSYIEL +FR RW+ A +QDECW+DPY  T+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI

A0A6J1KNU9 uncharacterized protein LOC111495920 isoform X27.2e-20486.42Show/hide
Query:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        MKFS CLP LAE KSRNS LC +FP ASLLCLVLF+GSAYVAPDYRE+I RWGIDGLV SKFNKCENQCRP GSEPLPKDIV  ASNLEMRPLWGAS+  
Subjt:  MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
        Y+NPVNSSSNLF  AVGIKQKDLVNKMVTKFLSSDFAVMLFHYD I+DEWK+F+WSNRVIHVTA+NQTKWWFAKRFLHPDIVAEYNY+FLWDEDLGV+ F
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +PKQYV+II+SEGLEISQPALDPY+SEVHHQITARGRRSTVHRRTF+ +NGGK CDVNSK+PPCTGWIEMMAPVFSRAAWRC WYMIQNDLIH WGLDMQ
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI
        LGYCAQGDRTKNVGVVDAEY++HYGRPTLGGPEENETSS S VKDHRADVRRQSYIEL +FR RW+ A KQDECW+DPYP T+
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)4.8e-9951.41Show/hide
Query:  LPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRF
        LP+ I+ + S+LE++PLW       +    ++ NL  + VG+KQK  V+ +V KFL ++F ++LFHYD  MD+W +  WS++ IH+ A NQTKWWFAKRF
Subjt:  LPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRF

Query:  LHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFS
        LHPD+V+ Y+Y+FLWDEDLGV+NF+P++Y+ I+KS GLEISQPALD   +E+HH+IT R +    HRR +  N G K C   S  PPCTG++E MAPVFS
Subjt:  LHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFS

Query:  RAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGG--PEENETSSNSPVK-------DHRADVRRQSYIELGIFRTRWRN
        +AAW C W +IQNDL+H WG+DM+LGYCAQGDRTKNVG+VD+EY++H G  TLG   PE+ +T+ +   +       D R ++RRQS  EL  F+ RW  
Subjt:  RAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGG--PEENETSSNSPVK-------DHRADVRRQSYIELGIFRTRWRN

Query:  AVKQDECWKDPYPGTIASK
        AV++D  W DP   +  +K
Subjt:  AVKQDECWKDPYPGTIASK

AT1G61240.1 Protein of unknown function (DUF707)7.6e-9752.75Show/hide
Query:  LPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRF
        LP  I+   S+LE++PLW +S    ++   ++ NL  + VG+KQKD V+ +V KFL ++F V+LFHYD  MD+W +  WS++ IH+ A NQTKWWFAKRF
Subjt:  LPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRF

Query:  LHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFS
        LHPDIV+ Y+YVFLWDEDLGV+NF+P++Y+ I+K+ GLEISQPAL P  +EVHH+IT R R    HRR +  + G   C   S+ PPCTG++E MAPVFS
Subjt:  LHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFS

Query:  RAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGG---PEENETSSN-------SPVKDHRADVRRQSYIELGIFRTRWR
        R+AW C W +IQNDL+H WG+DM+LGYCAQGDR+K VG+VD+EY+ H G  TLGG   P++  ++ +       S   D R ++RRQS  EL  F+ RW 
Subjt:  RAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGG---PEENETSSN-------SPVKDHRADVRRQSYIELGIFRTRWR

Query:  NAVKQDECW
         AV +D+ W
Subjt:  NAVKQDECW

AT4G12840.1 Protein of unknown function (DUF707)3.2e-12757.07Show/hide
Query:  KSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRW-GIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLF
        K +   L  LF     L ++  +G+A++  DY+E I+ W  I  L  +K   C+ Q RP GSE LP+ IVA+ S+LEMRPLWGA R     P     +L 
Subjt:  KSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRW-GIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLF

Query:  TVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSE
         +AVGI+QK+ VNK+V KF SS+F VMLFHYD  +DEWKEF WS+  IH++ VNQTKWWFAKRFLHPDIV+ Y+Y+FLWDEDLGVD+FD ++YV+IIK E
Subjt:  TVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSE

Query:  GLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKN
         LEISQPALDP  SEVHHQ+T+R ++S VHRRT++   G   C+ NS  PPCTG++EMMAPVFSRAAWRC W+MIQNDL H WG+D QLGYCAQGDRTKN
Subjt:  GLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKN

Query:  VGVVDAEYVVHYGRPTL-GGPEENETSS--------------NSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY
        +G+VD+EY++H G PTL GG  EN+T S              +S V   R +VR+Q+Y+EL  F+ RW+NAVK DECW D +
Subjt:  VGVVDAEYVVHYGRPTL-GGPEENETSS--------------NSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY

AT4G12840.2 Protein of unknown function (DUF707)3.2e-12757.07Show/hide
Query:  KSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRW-GIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLF
        K +   L  LF     L ++  +G+A++  DY+E I+ W  I  L  +K   C+ Q RP GSE LP+ IVA+ S+LEMRPLWGA R     P     +L 
Subjt:  KSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRW-GIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKYYRNPVNSSSNLF

Query:  TVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSE
         +AVGI+QK+ VNK+V KF SS+F VMLFHYD  +DEWKEF WS+  IH++ VNQTKWWFAKRFLHPDIV+ Y+Y+FLWDEDLGVD+FD ++YV+IIK E
Subjt:  TVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSE

Query:  GLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKN
         LEISQPALDP  SEVHHQ+T+R ++S VHRRT++   G   C+ NS  PPCTG++EMMAPVFSRAAWRC W+MIQNDL H WG+D QLGYCAQGDRTKN
Subjt:  GLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKN

Query:  VGVVDAEYVVHYGRPTL-GGPEENETSS--------------NSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY
        +G+VD+EY++H G PTL GG  EN+T S              +S V   R +VR+Q+Y+EL  F+ RW+NAVK DECW D +
Subjt:  VGVVDAEYVVHYGRPTL-GGPEENETSS--------------NSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY

AT4G18530.1 Protein of unknown function (DUF707)1.2e-12955.24Show/hide
Query:  LAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNK---------CENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY
        + +     SCLC++  T +L+C   F+ +AY+A D++EK+ +W I   + +  +K         C+N  +P G+E LP+ I+   SNLE + LW      
Subjt:  LAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNK---------CENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKY

Query:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF
         R P N S +L  +AVGIKQK+LVNK++ KF   DFAVMLFHYD ++D+WK++ W+N  IHV+ +NQTKWWFAKRFLHPDIVAEY Y+FLWDEDLGV +F
Subjt:  YRNPVNSSSNLFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNF

Query:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ
        +P++Y++I+K EGLEISQPALD  KSEVHH ITAR ++S VHRR ++    G+ CD +S +PPC GW+EMMAPVFSRAAWRC+WYMIQNDLIHAWGLD Q
Subjt:  DPKQYVNIIKSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPE------ENETSSNSPVK------DHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY
        LGYCAQGDR KNVGVVDAEY++HYG PTLG  E       NET S S         D+R +VR +S++E+  F+ RW+ AV+ D CW DPY
Subjt:  LGYCAQGDRTKNVGVVDAEYVVHYGRPTLGGPE------ENETSSNSPVK------DHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTCTCCAGTTGTTTGCCCTTTCTTGCAGAGGCTAAAAGTAGGAATTCATGTCTTTGTACCCTTTTTCCAACTGCTTCTTTGCTTTGTCTTGTACTGTTCTTGGG
GAGTGCATATGTAGCACCAGACTATAGGGAGAAAATTTCTAGATGGGGAATAGATGGTTTAGTTGGTTCAAAGTTCAATAAATGTGAGAATCAATGCAGGCCGTATGGAA
GCGAGCCGCTTCCTAAAGACATTGTTGCCAATGCATCTAACTTGGAAATGCGACCATTATGGGGTGCATCAAGGAAATATTATCGGAATCCTGTTAACTCATCAAGTAAT
TTATTTACTGTGGCCGTTGGGATCAAACAAAAAGATCTTGTGAATAAAATGGTAACAAAGTTTCTTTCTAGCGACTTTGCTGTGATGCTCTTCCATTACGATAGTATCAT
GGACGAGTGGAAGGAGTTTAATTGGAGTAACCGCGTAATACACGTAACAGCAGTCAATCAAACTAAATGGTGGTTTGCCAAGCGCTTCTTACATCCTGATATTGTGGCAG
AATATAACTATGTCTTTCTTTGGGATGAGGACCTTGGAGTTGACAATTTCGATCCGAAACAGTATGTAAATATTATTAAAAGTGAAGGGTTAGAGATATCACAACCAGCA
CTTGATCCATATAAATCAGAGGTGCACCATCAAATTACTGCTCGTGGGAGGCGATCGACAGTGCACAGGAGAACGTTTAGGCCTAATAATGGTGGAAAAGGTTGTGATGT
CAACAGTAAATCTCCTCCATGCACCGGATGGATAGAAATGATGGCCCCGGTTTTTTCCCGAGCGGCATGGCGTTGTGCTTGGTATATGATCCAGAATGATTTGATCCATG
CTTGGGGCTTGGATATGCAACTTGGATATTGTGCACAGGGCGATCGAACAAAGAACGTCGGTGTTGTTGACGCCGAGTATGTAGTCCATTATGGACGACCTACACTTGGT
GGTCCAGAAGAAAATGAGACATCTTCCAACTCTCCCGTAAAGGATCATAGAGCTGACGTGAGAAGACAGTCCTATATCGAACTAGGTATATTTAGAACAAGATGGAGGAA
TGCTGTTAAGCAAGACGAGTGCTGGAAAGATCCATACCCCGGGACGATTGCATCCAAAGCAATTACAGGATTTGGGAGTGAAAGAGCAGAAACCAAATGGCCTGTTGGGG
ATATTGCAATTGATGGCATAGCAACAAGGAGAAGGCAAGCACAGGCCCTTAGCGCCTCCACAGCAAGTGCAGGGCGCGCATATTTGGAAGCTCCCCAGTTTCCTCCTGGT
CTCATTCATCTCCATCATCTTCATCTTTCCCTCCTCAATCATCGCGATTCCGGCGAGAAAGAGCAGGAAAACGAGGGAGAATCCGGCGCCATATATAGACATCATGCAAT
TCCAAAGGAATTCGAACTGTGGAACTGTGAAATGGGAGTTTCAGAAACTCGTGGGGGAGATCGATTCAGAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTCTCCAGTTGTTTGCCCTTTCTTGCAGAGGCTAAAAGTAGGAATTCATGTCTTTGTACCCTTTTTCCAACTGCTTCTTTGCTTTGTCTTGTACTGTTCTTGGG
GAGTGCATATGTAGCACCAGACTATAGGGAGAAAATTTCTAGATGGGGAATAGATGGTTTAGTTGGTTCAAAGTTCAATAAATGTGAGAATCAATGCAGGCCGTATGGAA
GCGAGCCGCTTCCTAAAGACATTGTTGCCAATGCATCTAACTTGGAAATGCGACCATTATGGGGTGCATCAAGGAAATATTATCGGAATCCTGTTAACTCATCAAGTAAT
TTATTTACTGTGGCCGTTGGGATCAAACAAAAAGATCTTGTGAATAAAATGGTAACAAAGTTTCTTTCTAGCGACTTTGCTGTGATGCTCTTCCATTACGATAGTATCAT
GGACGAGTGGAAGGAGTTTAATTGGAGTAACCGCGTAATACACGTAACAGCAGTCAATCAAACTAAATGGTGGTTTGCCAAGCGCTTCTTACATCCTGATATTGTGGCAG
AATATAACTATGTCTTTCTTTGGGATGAGGACCTTGGAGTTGACAATTTCGATCCGAAACAGTATGTAAATATTATTAAAAGTGAAGGGTTAGAGATATCACAACCAGCA
CTTGATCCATATAAATCAGAGGTGCACCATCAAATTACTGCTCGTGGGAGGCGATCGACAGTGCACAGGAGAACGTTTAGGCCTAATAATGGTGGAAAAGGTTGTGATGT
CAACAGTAAATCTCCTCCATGCACCGGATGGATAGAAATGATGGCCCCGGTTTTTTCCCGAGCGGCATGGCGTTGTGCTTGGTATATGATCCAGAATGATTTGATCCATG
CTTGGGGCTTGGATATGCAACTTGGATATTGTGCACAGGGCGATCGAACAAAGAACGTCGGTGTTGTTGACGCCGAGTATGTAGTCCATTATGGACGACCTACACTTGGT
GGTCCAGAAGAAAATGAGACATCTTCCAACTCTCCCGTAAAGGATCATAGAGCTGACGTGAGAAGACAGTCCTATATCGAACTAGGTATATTTAGAACAAGATGGAGGAA
TGCTGTTAAGCAAGACGAGTGCTGGAAAGATCCATACCCCGGGACGATTGCATCCAAAGCAATTACAGGATTTGGGAGTGAAAGAGCAGAAACCAAATGGCCTGTTGGGG
ATATTGCAATTGATGGCATAGCAACAAGGAGAAGGCAAGCACAGGCCCTTAGCGCCTCCACAGCAAGTGCAGGGCGCGCATATTTGGAAGCTCCCCAGTTTCCTCCTGGT
CTCATTCATCTCCATCATCTTCATCTTTCCCTCCTCAATCATCGCGATTCCGGCGAGAAAGAGCAGGAAAACGAGGGAGAATCCGGCGCCATATATAGACATCATGCAAT
TCCAAAGGAATTCGAACTGTGGAACTGTGAAATGGGAGTTTCAGAAACTCGTGGGGGAGATCGATTCAGAACATGA
Protein sequenceShow/hide protein sequence
MKFSSCLPFLAEAKSRNSCLCTLFPTASLLCLVLFLGSAYVAPDYREKISRWGIDGLVGSKFNKCENQCRPYGSEPLPKDIVANASNLEMRPLWGASRKYYRNPVNSSSN
LFTVAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDSIMDEWKEFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVAEYNYVFLWDEDLGVDNFDPKQYVNIIKSEGLEISQPA
LDPYKSEVHHQITARGRRSTVHRRTFRPNNGGKGCDVNSKSPPCTGWIEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLG
GPEENETSSNSPVKDHRADVRRQSYIELGIFRTRWRNAVKQDECWKDPYPGTIASKAITGFGSERAETKWPVGDIAIDGIATRRRQAQALSASTASAGRAYLEAPQFPPG
LIHLHHLHLSLLNHRDSGEKEQENEGESGAIYRHHAIPKEFELWNCEMGVSETRGGDRFRT