; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098811 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098811
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionCCHC-type domain-containing protein
Genome locationCMiso1.1chr04:14520366..14526912
RNA-Seq ExpressionCmc04g0098811
SyntenyCmc04g0098811
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038119.1 Zinc knuckle family protein isoform 1 [Cucumis melo var. makuwa]2.3e-259100Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

XP_008447472.1 PREDICTED: uncharacterized protein LOC103489910 [Cucumis melo]2.3e-259100Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

XP_011651535.1 uncharacterized protein LOC101215062 isoform X2 [Cucumis sativus]1.7e-25498.28Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTVV QEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK CGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMS+SRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNK AYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELE+EYKREATELGKIRDKEEDEENYK+RETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAAS FGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

XP_031738009.1 uncharacterized protein LOC101215062 isoform X1 [Cucumis sativus]5.5e-25397.86Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTVV QEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK CGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMS+SRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSV--QAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNE
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSV  QAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNK AYGRTEGWDNE
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSV--QAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNE

Query:  RRGSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSF
        RRGSDLQSSRQFEYPAFPQSLEELE+EYKREATELGKIRDKEEDEENYK+RETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAAS F
Subjt:  RRGSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSF

Query:  GGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        GGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  GGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

XP_038905439.1 uncharacterized protein LOC120091471 isoform X2 [Benincasa hispida]2.7e-24795.05Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEE EPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSHMS+SRSPDRSP 
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTN+SQFQPRFSRQEKVVENR RDDLQKY RSSVQAYGNDRVRGRSSHSKSPAHPPYSG S  SYDSYQNK AYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREA ELGKIRDKEEDEENYKHRETIREMRESYTKKLTH+RGTHAKQWDEFLQLDAQRRQQQVHQQMAAS F G
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVN+HYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENA+KRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

TrEMBL top hitse value%identityAlignment
A0A0A0L866 CCHC-type domain-containing protein8.3e-25598.28Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTVV QEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK CGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMS+SRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNK AYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELE+EYKREATELGKIRDKEEDEENYK+RETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAAS FGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

A0A1S3BHI0 uncharacterized protein LOC1034899101.1e-259100Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

A0A5D3DAF0 Zinc knuckle family protein isoform 11.1e-259100Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

A0A6J1DRY4 uncharacterized protein LOC111023835 isoform X12.6e-24091.22Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRST+V +QEKT T KRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGS+RKSQDFFER+PARDKHVRA+FTD+V+QKIEKD+GCKIK+DEKFIIVSGKDRLILLKG+DAV+K+IKE+GDQKGSSSSHMS+SRSP+RSP 
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQ--AYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNE
        GSRSQRS+VHRSHSGPTNASQFQPRFSR+EKVVENR RDDLQKYPR S+Q  AYGNDR RGRSSHSKSPAHPPYSGSSF SYDSYQNK AYGRTEGWDNE
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQ--AYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNE

Query:  RRGSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSF
        RRGSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRE+Y KKL HLRGTHAKQWDEFLQLDAQRRQQQVHQQMAAS F
Subjt:  RRGSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSF

Query:  GGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
         GYKQHNYSEY+GGSVN+HY+GANLA LDSRSKY NHMENYPSRPHGNFGEFQRQRRDDY NAYKRY
Subjt:  GGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

A0A6J1DWP5 uncharacterized protein LOC111023835 isoform X28.1e-24291.61Show/hide
Query:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRST+V +QEKT T KRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA
        QGCPSTLGS+RKSQDFFER+PARDKHVRA+FTD+V+QKIEKD+GCKIK+DEKFIIVSGKDRLILLKG+DAV+K+IKE+GDQKGSSSSHMS+SRSP+RSP 
Subjt:  QGCPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPA

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR
        GSRSQRS+VHRSHSGPTNASQFQPRFSR+EKVVENR RDDLQKYPR S+QAYGNDR RGRSSHSKSPAHPPYSGSSF SYDSYQNK AYGRTEGWDNERR
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERR

Query:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG
        GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRE+Y KKL HLRGTHAKQWDEFLQLDAQRRQQQVHQQMAAS F G
Subjt:  GSDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGG

Query:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY
        YKQHNYSEY+GGSVN+HY+GANLA LDSRSKY NHMENYPSRPHGNFGEFQRQRRDDY NAYKRY
Subjt:  YKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYPSRPHGNFGEFQRQRRDDYENAYKRY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62330.1 Zinc knuckle (CCHC-type) family protein3.8e-13558.95Show/hide
Query:  DVDGDDDFSELYKEYTGPPRSTT---VVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
        D + DDDFSE+YKEYTGP  + T   +  ++K    +      +EE++  DPN+VPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
Subjt:  DVDGDDDFSELYKEYTGPPRSTT---VVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG

Query:  CPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPAG-
        CPSTLG++RKSQ+FFERVPARD +VR +FT++V++ IE++  CKIK+DEKFIIVSGKDRLIL KG+DAV+K +KEDG+ K SS SH S+SRSP R+  G 
Subjt:  CPSTLGSSRKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPAG-

Query:  SRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVEN------RARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGW
        SR++ S+  R       +S F  R  RQ+K V+N      R R++ +  PR S QAYG+DR R RS+HSKSP  P YSG     YD  + + +  R+E W
Subjt:  SRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVEN------RARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGW

Query:  DNERRG--SDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQM
        D ER G  SD+Q S QFE P FPQ+LEELE+EY R+A EL K RDKEEDEEN KHRETIRE+RESY KKL  LRG +AKQWD+FLQLDAQRRQQQ  QQ 
Subjt:  DNERRG--SDLQSSRQFEYPAFPQSLEELEMEYKREATELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQM

Query:  AASSFGGYKQH-NYSEYEGG-SVNAHYEGANLAALDSRSKYQNHMENYPSR-PHGNFGEFQRQRRDDYENAYKRY
        +  S+G Y+Q   Y+E++ G S N    G N   +DS+ +Y NH +NY SR    N+G FQRQRR++Y  AY RY
Subjt:  AASSFGGYKQH-NYSEYEGG-SVNAHYEGANLAALDSRSKYQNHMENYPSR-PHGNFGEFQRQRRDDYENAYKRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATTCACCAGATGTGGATGGAGATGATGACTTTAGTGAACTCTACAAGGAATACACAGGCCCTCCACGATCGACCACTGTTGTTTCACAAGAGAAGACGAATAC
AAATAAAAGGTCTCATGCCGGTTCCGATGAGGAGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCAAGTCAA
AAGCTACAGAGAGGAATTGGAAGAAGAGAAAAGAGGAAGAAATGATCTGCAAAATATGTGGTGAATCAGGCCATTTTACTCAGGGATGCCCCTCAACGTTGGGATCAAGT
CGTAAATCTCAAGATTTTTTTGAAAGGGTACCAGCCAGGGATAAACACGTGAGAGCAATTTTCACTGATAGAGTAATACAGAAGATAGAAAAGGACGTTGGTTGTAAGAT
CAAGATGGATGAGAAATTCATAATTGTTAGTGGCAAGGACAGGTTAATTTTGTTAAAGGGATTGGATGCAGTCAACAAGTTAATTAAGGAGGACGGCGATCAAAAGGGTT
CTTCTAGTTCTCATATGAGTAAATCCAGGTCACCTGATCGAAGCCCTGCTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGTCCTACAAATGCATCA
CAATTTCAACCTAGGTTTAGCAGACAGGAGAAAGTTGTTGAAAACCGTGCTCGTGATGATTTGCAGAAATATCCAAGGAGCTCGGTTCAAGCTTATGGCAATGACAGAGT
TAGAGGTCGTTCAAGCCACTCAAAGTCTCCAGCTCATCCACCTTATTCTGGCAGCTCGTTTGGTTCATATGATAGTTATCAGAACAAGGGTGCATATGGTAGAACCGAAG
GATGGGACAATGAGAGAAGAGGATCCGATTTGCAATCTAGTCGTCAGTTCGAGTATCCAGCTTTTCCCCAATCTCTTGAAGAACTGGAGATGGAGTATAAAAGGGAAGCA
ACTGAACTTGGAAAGATTCGTGATAAAGAAGAAGATGAAGAAAATTATAAACACCGTGAGACTATAAGGGAGATGAGAGAGAGCTACACGAAGAAATTGACTCATTTGAG
GGGCACACATGCAAAGCAGTGGGATGAGTTTCTCCAACTTGATGCCCAAAGGCGTCAGCAACAAGTGCACCAGCAAATGGCCGCTTCAAGTTTTGGTGGTTATAAGCAGC
ATAACTATTCTGAATATGAAGGTGGCTCAGTCAATGCCCATTACGAAGGGGCTAATTTGGCGGCCCTCGATTCAAGAAGCAAGTATCAAAATCACATGGAGAATTATCCT
TCAAGACCTCATGGTAATTTTGGGGAGTTTCAACGTCAGAGGCGCGATGATTACGAGAATGCTTACAAACGATACTAA
mRNA sequenceShow/hide mRNA sequence
GGGGTGTATTAGTAATTTAACTAATATCATCAAACCGTTCAAACCCTATCGGAGTCCCTGGTTTTATTCCGCGGCAGTTCCACCCATAAACACACAGACCCTCTACCTTT
CTTCCGGCAGCAGCCTCCGCCTCCGGCAGCTTCTTCACTTCGCCGACAAGGAGCTCAACTGAATTTGCTCTTTTTGTGATGGCAAATTCACCAGATGTGGATGGAGATGA
TGACTTTAGTGAACTCTACAAGGAATACACAGGCCCTCCACGATCGACCACTGTTGTTTCACAAGAGAAGACGAATACAAATAAAAGGTCTCATGCCGGTTCCGATGAGG
AGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCAAGTCAAAAGCTACAGAGAGGAATTGGAAGAAGAGAAAA
GAGGAAGAAATGATCTGCAAAATATGTGGTGAATCAGGCCATTTTACTCAGGGATGCCCCTCAACGTTGGGATCAAGTCGTAAATCTCAAGATTTTTTTGAAAGGGTACC
AGCCAGGGATAAACACGTGAGAGCAATTTTCACTGATAGAGTAATACAGAAGATAGAAAAGGACGTTGGTTGTAAGATCAAGATGGATGAGAAATTCATAATTGTTAGTG
GCAAGGACAGGTTAATTTTGTTAAAGGGATTGGATGCAGTCAACAAGTTAATTAAGGAGGACGGCGATCAAAAGGGTTCTTCTAGTTCTCATATGAGTAAATCCAGGTCA
CCTGATCGAAGCCCTGCTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGTCCTACAAATGCATCACAATTTCAACCTAGGTTTAGCAGACAGGAGAA
AGTTGTTGAAAACCGTGCTCGTGATGATTTGCAGAAATATCCAAGGAGCTCGGTTCAAGCTTATGGCAATGACAGAGTTAGAGGTCGTTCAAGCCACTCAAAGTCTCCAG
CTCATCCACCTTATTCTGGCAGCTCGTTTGGTTCATATGATAGTTATCAGAACAAGGGTGCATATGGTAGAACCGAAGGATGGGACAATGAGAGAAGAGGATCCGATTTG
CAATCTAGTCGTCAGTTCGAGTATCCAGCTTTTCCCCAATCTCTTGAAGAACTGGAGATGGAGTATAAAAGGGAAGCAACTGAACTTGGAAAGATTCGTGATAAAGAAGA
AGATGAAGAAAATTATAAACACCGTGAGACTATAAGGGAGATGAGAGAGAGCTACACGAAGAAATTGACTCATTTGAGGGGCACACATGCAAAGCAGTGGGATGAGTTTC
TCCAACTTGATGCCCAAAGGCGTCAGCAACAAGTGCACCAGCAAATGGCCGCTTCAAGTTTTGGTGGTTATAAGCAGCATAACTATTCTGAATATGAAGGTGGCTCAGTC
AATGCCCATTACGAAGGGGCTAATTTGGCGGCCCTCGATTCAAGAAGCAAGTATCAAAATCACATGGAGAATTATCCTTCAAGACCTCATGGTAATTTTGGGGAGTTTCA
ACGTCAGAGGCGCGATGATTACGAGAATGCTTACAAACGATACTAATTTGTTAACCAGGAAGAAGTTGGATTCAGATCCATCTTGGGTAGGGCATAATGCTTGGGAGGAA
CGCCTAATGAAAGGTAAGTATGTATCTTACGGTCTTGGACCAATTAGCATTCGTAGTAGCCTACTACCAGTTGTTTTTATAGTTAAGCTTGATAGTTCATCTGGTTCAGA
ACTTGTTGTAAAGAGCGGCATGCAATGCCATCTTGGATTTATTAGTCAGATGAAGTTCTCATGAACGGTTGAGATTTTGAATTGGTTATAAATTGATACCTTGGCAAACT
TTTCTTTCTACTGTTTTCCCTATCTAATTGTTGGGAACAAGCGGGATGACTGTATTGAAACAGTCCTTCACTGTCATGGAAGCTCAGGATAGTTGACCTTTACCGGTTGT
TAAGTTTAAAAACTGTTTATCACTGCTATCTGGACTTGGGAGTGTTGAATAGTGTACTAGTTGTGATCCGTTATAAATTGGTAATGCCACACAGTTCACCGATTATGTTT
TATATACTTATCTGAATCTGTAAACGCTTTTCCAGAATCCCACAACCCATAACCCATAAACTATAGGTATGGCTTGGTCCATATGTTTTTGGCTGGGAAATGAATGGATT
TTGATGTAGAGAAGGGGGAATTCTTTAATGTTTCGTGTGGTCTTCTTTTTCTTTCTTATTCATATGAAATAGGAATGTCACAGAAGGGTATAGGTTTTCTTAGTATATAA
TCTTTAGAAATGACTGTTTTACTTTAAATTTTTATTTCATCAGAACTGCTTAATTTGAAGAAATGCGTCTGGAAAAGTAGTAAGTAAATGTAAGTAGAAGCATTTGGTTC
TACCGGATATTAAGATATTAATTGAAGCTAATGGAAAATATTACATTTGCCATCATCATTCCCCTACTTTATGGTATTGAACAAATCGCCTGAATTCCTGTGATGACATA
ATAGGCCCCAAATTGAACGTTCGACTTCCAGTTAGGTCATGCAAATTACCATTGAGATAAGCTCGTTTATGCATATAATACACCCAATTGTTGAGAAGGAACTGACTAAA
ACTTTTCTTACACATTAAGGTTTTGGAGTTTTGGGTCTTTTCATTAGCCTGCATCAGAATATTCCATCAGATGAAATTATTTTGTATGTACTTGTGAAGGCCAGGAAGTG
ACATGTTTTGATATTAAATAACTGCTTTAACAGGGGACTAGTGTGGATTATTGGGGTAAACTAGAAGTTTGGGTAAAGTCTATCTTAATTATAGTTTTTTAAGTGTCTAG
TTGTGTTATTATAAGTAGTGATGAATTCTGTTGACTTGCATGGGGTGGTCAAGGGTGGGCTTCAACTTATAGGGAAGAAGCCTTTCAATGGATAGAACCATTTTATGTGG
ATAATACATATTCTAGTTATTTTATTTGATAACGACCTATTAAGATGTTCATAACCAAGAAATATTGGATACTTCGCTGGGTATGTTCGATTCTCTTATTTCCCTTTTTA
TCTATATAACGCAAATGGGGAGAAACGACAGGAATATATTTTGGAACATTCATTTGGAGGATAGGAGAGGAACATATATTTATTTTGGTCATGGTGCTAGGAAATCCTAG
GATAACACCTGAACACTTTTGGTTGATTTGTATTTCTATTATTGAAGGTTCCATGTGTGTTTGATTCTAGAAGGTCAAAACTGTGTGAGCGCTTAATTCTTCCAATCTCT
TGTGACTAATATCAGGTTTTATTTTATTTATATTTATTATCCAAGGGATTCGTAGGTTTAGGAAGAGATTTCATTACTATTTCAATTTCAATTCAAATAGGAGTTTATTC
CTCCGATTTGGCTTTGTACTCGATAATAGTTAGATTTTCTCAAATGATATCGTAAAATATCAATTCAAGATAAAAAGGTTGTGTATTTTCGATTTCTTAGCTTTCAAATG
CTGTACCTATTAGGATAAATCAGTGACGTGGAA
Protein sequenceShow/hide protein sequence
MANSPDVDGDDDFSELYKEYTGPPRSTTVVSQEKTNTNKRSHAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQGCPSTLGSS
RKSQDFFERVPARDKHVRAIFTDRVIQKIEKDVGCKIKMDEKFIIVSGKDRLILLKGLDAVNKLIKEDGDQKGSSSSHMSKSRSPDRSPAGSRSQRSDVHRSHSGPTNAS
QFQPRFSRQEKVVENRARDDLQKYPRSSVQAYGNDRVRGRSSHSKSPAHPPYSGSSFGSYDSYQNKGAYGRTEGWDNERRGSDLQSSRQFEYPAFPQSLEELEMEYKREA
TELGKIRDKEEDEENYKHRETIREMRESYTKKLTHLRGTHAKQWDEFLQLDAQRRQQQVHQQMAASSFGGYKQHNYSEYEGGSVNAHYEGANLAALDSRSKYQNHMENYP
SRPHGNFGEFQRQRRDDYENAYKRY