; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0630 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0630
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationMC11:5062449..5066153
RNA-Seq ExpressionMC11g0630
SyntenyMC11g0630
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024602.1 hypothetical protein SDJN02_13420, partial [Cucurbita argyrosperma subsp. argyrosperma]3.62e-31494.85Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP NQFPDGQV STAA TFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQPLDL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

XP_022936339.1 uncharacterized protein LOC111442989 [Cucurbita moschata]1.89e-31494.85Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQSTAA TFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRS PGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQPLDL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

XP_022976329.1 uncharacterized protein LOC111476762 [Cucurbita maxima]2.37e-31094.18Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPC FVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KL +QTAVVPFIQ +NQFPDGQVQSTAA TFHLSK DL KLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQPLDL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

XP_022980993.1 uncharacterized protein LOC111480276 [Cucurbita maxima]1.09e-31094.41Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV STAA TFHLSKSDL+KLAGKSLFAS+PCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQPLD LA
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

XP_023535757.1 uncharacterized protein LOC111797093 [Cucurbita pepo subsp. pepo]1.89e-31494.85Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQSTAA TFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQP DL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

TrEMBL top hitse value%identityAlignment
A0A0A0LEU0 Uncharacterized protein2.80e-30692.39Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QT VVPFIQ  NQFPDGQVQSTAA TFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCG+DSGRLLG+VSVPLDL GTES+ATVFHNGWISVGK+SK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQR RSLPTESS  RGWLSSFGSERERPGKERKGWSIT+HDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD TWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKF+IDLGG+SNGR TP N  SPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
         VQHVNCTED AAFVALAAAIDLS+DACRLFSH+LRKELCQPLDLLA
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

A0A6J1F809 uncharacterized protein LOC1114429899.17e-31594.85Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQSTAA TFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRS PGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQPLDL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

A0A6J1FMY7 uncharacterized protein LOC1114466452.51e-30994.18Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV STAA TFHLSKSDL KLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRG+TCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT+GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQPLD LA
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

A0A6J1IN75 uncharacterized protein LOC1114767621.15e-31094.18Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPC FVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KL +QTAVVPFIQ +NQFPDGQVQSTAA TFHLSK DL KLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQPLDL+A
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

A0A6J1J0W7 uncharacterized protein LOC1114802765.28e-31194.41Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV STAA TFHLSKSDL+KLAGKSLFAS+PCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQPLD LA
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)6.8e-19073.59Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPE-NQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKIS
        MDPCPF+RLT+GNLALKVP+A+K   S+VHPSSSPCFCKIK +  P QTA +P+I  E  QFP+ Q   T AATFHLS SD+ +LA +S+F SKPCLKI 
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPE-NQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKIS

Query:  IYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKES--KGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFS
        IY+GR G  CGV SGRLL KVSVPLDL+GT+S+  VFHNGWISVGK +    S AQFHLNVKAEPDPRFVFQFDGEPECSPQV QIQGNIRQPVFTCKFS
Subjt:  IYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKES--KGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFS

Query:  FR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWR
         R TGDRTQRSRSLPTE+S SR WL+SFGSERERPGKERKGWSITVHDLSGSPVA AS+VTPFVASPG+DRVSRSNPGSWLILRPGD TW+PWGRLEAWR
Subjt:  FR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWR

Query:  ERGG-SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSN----------------GRLTPGNLTSPACSPR-SSGDYGYGLWPYCVY
        ERGG +DGLGYRFEL+PD   G S AGIVLAES ++S++GGKF I+LG + +                G    G   SPA SPR  SGDYGYGLWP+ VY
Subjt:  ERGG-SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSN----------------GRLTPGNLTSPACSPR-SSGDYGYGLWPYCVY

Query:  RGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
        +GFVM+ASVEGEGKCSKP VEVSVQHV+C EDAAA+VAL+AAIDLSMDACRLF+ R+RKELC
Subjt:  RGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC

AT1G50040.1 Protein of unknown function (DUF1005)1.0e-10850Show/hide
Query:  MDPCPFVRLTVGNLALKVP--------VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGK
        MDPC FVR+ VGNLA++ P         +S    S+   SS  C+CKIKF+  P Q   VP +     + E++   G V ST AA F LSKS ++    K
Subjt:  MDPCPFVRLTVGNLALKVP--------VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGK

Query:  SLFASKPCLKISIYSGRRGTTCG---VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESK-----GSCAQFHLNVKAEPDPRFVFQFDGEPECSPQ
        + ++    L + +YS RR  +CG       +L+G+  V LDL   ES+  + HNGW+ +G +SK     GS  + H++V+ EPD RFVFQFDGEPECSPQ
Subjt:  SLFASKPCLKISIYSGRRGTTCG---VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESK-----GSCAQFHLNVKAEPDPRFVFQFDGEPECSPQ

Query:  VFQIQGNIRQPVFTCKFSFR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLI
        VFQ+QGN +Q VFTCKF FR +GD   R+ SL          LSS  S +E+  KERKGWSIT+HDLSGSPVA ASMVTPFV SPGS+RVSRS+PG+WLI
Subjt:  VFQIQGNIRQPVFTCKFSFR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLI

Query:  LRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVID---------LGGNSNGRLTPGNLTSPACSPRSSG---
        LRP   TWKPW RL+AWRE G SD LGYRFEL  D   G++ A  V A S++++  GG F+ID            +S G     + +S   S   SG   
Subjt:  LRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVID---------LGGNSNGRLTPGNLTSPACSPRSSG---

Query:  DYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQP
        D+ + L       GFVM+  V+G  K SKP VEV V+HV CTEDAAA VALAAA+DLSMDACRLFS +LR EL QP
Subjt:  DYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQP

AT3G19680.1 Protein of unknown function (DUF1005)1.2e-11750.81Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASK-------PARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGKS
        MDPC FVR+ VGNLA++ P +S        P+ S ++P++  C+CKI+F+  P +   VP +     + E +       ST AA F LSK+ ++    K 
Subjt:  MDPCPFVRLTVGNLALKVPVASK-------PARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGKS

Query:  LFASKPCLKISIYS--------GRRGTTCGVDSG--RLLGKVSVPLDLAGTESRATVFHNGWISV----GKESKGSCAQFHLNVKAEPDPRFVFQFDGEP
         F+    L +  YS        G  G +CG+ +   +LLG+  V LDL   E+++ + HNGW+++     K   GS  + H++V+ EPDPRFVFQFDGEP
Subjt:  LFASKPCLKISIYS--------GRRGTTCGVDSG--RLLGKVSVPLDLAGTESRATVFHNGWISV----GKESKGSCAQFHLNVKAEPDPRFVFQFDGEP

Query:  ECSPQVFQIQGNIRQPVFTCKFSFR---TGDRT-QRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSR
        ECSPQVFQ+QGN +Q VFTCKF  R   +GDR    S S+ +E SS+R  +SS  SE+E+P KERKGWSITVHDLSGSPVA ASMVTPFV SPGS+RV+R
Subjt:  ECSPQVFQIQGNIRQPVFTCKFSFR---TGDRT-QRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSR

Query:  SNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDL-GGNSNGRLTP----------GNLTS--
        S+PG+WLILRP   TWKPWGRLEAWRE G SD LGYRFEL  D   G++ A  V A S+++   GG FVID+ GG S    TP          G+ +S  
Subjt:  SNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDL-GGNSNGRLTP----------GNLTS--

Query:  --PACSP--RSSGDYGYGLWPY----CVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDL
          PA  P   S  D+GY L  +       RGFVM+A+VEG GK SKP VEV V HV CTEDAAA VALAAA+DLS+DACRLFSH+LRKEL Q   L
Subjt:  --PACSP--RSSGDYGYGLWPY----CVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQPLDL

AT4G29310.1 Protein of unknown function (DUF1005)1.2e-12554.2Show/hide
Query:  MDPCPFVRLTVGNLALKVP--VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQ---VQSTAAATFHLSKSDLDKLAGKSLFASKPC
        MDPCPFVRLT+ +LAL++P    +K     VHPSS+PC+CK++ +  P Q A++P     + F D       ST+A  FHL    + +++GK     K  
Subjt:  MDPCPFVRLTVGNLALKVP--VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQ---VQSTAAATFHLSKSDLDKLAGKSLFASKPC

Query:  LKISIYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK
        L++S+Y+GR G TCGV SG+LLGKV V +DLA   SR   FHNGW  +G +     A+ HL V AEPDPRFVFQF GEPECSP V+QIQ N++QPVF+CK
Subjt:  LKISIYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK

Query:  FSFRTGDRTQRSRSLPTE-SSSSRGWLS---SFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD---GTWKP
        FS    DR  RSRSLP+  + SSRGW++   S     ++  +ERKGW IT+HDLSGSPVAAASM+TPFVASPGSDRVSRSNPG+WLILRP      +WKP
Subjt:  FSFRTGDRTQRSRSLPTE-SSSSRGWLS---SFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD---GTWKP

Query:  WGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVE
        WGRLEAWRERG  DGLGY+FEL+ D S   ++ GI +AE  +++ +GGKF ID       R   G   SPA S                 +GFVM +SVE
Subjt:  WGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVE

Query:  GEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
        GEGK SKP V V  QHV C  DAA FVAL+AA+DLS+DAC+LFS +LRKELC
Subjt:  GEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC

AT5G17640.1 Protein of unknown function (DUF1005)1.7e-8742.02Show/hide
Query:  MDPCPFVRLTVGNLALKVP---VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPC-L
        MDP  F+RL+VG+LAL++P   + S    +     SS C C+IK R  P+QT  +P +   +  PD    ST   +F+L +SDL  L     F S    L
Subjt:  MDPCPFVRLTVGNLALKVP---VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPC-L

Query:  KISIYSGRRGTTCGVDSGR-LLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK
        +IS+++G++   CGV   R  +G   + +     E +  +  NGWIS+GK  +   A+ HL VK +PDPR+VFQF+     SPQ+ Q++G+++QP+F+CK
Subjt:  KISIYSGRRGTTCGVDSGR-LLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK

Query:  FSFRTGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPG---DGTWKPWGRL
        FS    DR  +   L    SS     S  G+E E   +ERKGW + +HDLSGS VAAA + TPFV S G D V++SNPG+WL++RP      +W+PWG+L
Subjt:  FSFRTGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPG---DGTWKPWGRL

Query:  EAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGK
        EAWRERG  D +  RF L+   S G+    ++++E  +++ KGG+F+ID        LT     +P  SP+SSGD+  GL       GFVM++ V+GEGK
Subjt:  EAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGK

Query:  CSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRK
         SKP V+++++HV C EDAA F+ALAAA+DLS+ AC+ F    R+
Subjt:  CSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGCCCTTTTGTGCGTCTCACCGTTGGGAATCTCGCCCTTAAGGTTCCGGTGGCGTCGAAGCCTGCTCGCTCCCTGGTTCATCCTTCTTCGTCTCCCTGTTT
CTGTAAAATCAAGTTCAGGAAACTGCCGTTGCAAACGGCGGTTGTGCCATTTATTCAGCCGGAGAATCAGTTTCCGGATGGTCAGGTTCAGTCCACGGCTGCTGCTACTT
TTCATCTCAGTAAGTCTGATCTCGACAAGCTCGCTGGAAAGTCTTTGTTCGCTTCGAAACCATGCCTCAAAATCTCCATCTACAGCGGGCGGAGAGGCACGACCTGCGGT
GTCGACTCCGGGAGGCTCTTGGGTAAAGTGTCCGTGCCTTTGGATCTGGCGGGAACTGAATCTAGGGCAACGGTGTTCCACAATGGGTGGATTTCGGTGGGGAAAGAGTC
CAAAGGTTCGTGTGCGCAATTTCATTTGAATGTGAAAGCAGAACCTGACCCGAGATTCGTGTTTCAGTTCGACGGCGAGCCCGAATGTAGTCCACAGGTGTTTCAGATTC
AAGGCAACATCAGACAACCCGTATTCACCTGCAAGTTCAGTTTCAGAACCGGCGACCGAACCCAGAGATCCAGGTCGTTGCCCACAGAATCGAGCAGCTCGAGGGGGTGG
CTCAGCTCGTTCGGAAGCGAAAGGGAACGCCCAGGAAAGGAGCGGAAGGGTTGGTCAATCACCGTCCACGACCTTTCCGGTTCTCCGGTCGCGGCTGCTTCTATGGTTAC
ACCCTTTGTAGCGTCGCCGGGTTCTGACCGTGTCAGCCGGTCCAACCCCGGGTCGTGGCTCATTCTCCGCCCTGGGGACGGCACCTGGAAGCCATGGGGCCGGCTCGAGG
CTTGGCGGGAGCGTGGTGGATCGGACGGACTTGGGTACAGGTTCGAGCTAATGCCGGACACGAGCGGCGGGATGAGCGCGGCGGGGATTGTACTGGCGGAGTCGGCTCTG
AACTCGAACAAGGGCGGAAAGTTCGTGATCGACTTGGGGGGAAACTCGAACGGGAGATTGACGCCGGGGAATTTGACGTCGCCGGCGTGCAGCCCGAGGAGCAGCGGAGA
CTACGGGTACGGGCTTTGGCCGTACTGCGTGTATAGAGGGTTCGTGATGGCAGCTAGCGTAGAGGGCGAAGGAAAATGCAGCAAGCCCACGGTGGAAGTTAGCGTGCAGC
ACGTGAACTGCACAGAAGACGCGGCCGCTTTCGTGGCATTGGCAGCGGCCATCGATCTTAGCATGGACGCTTGCAGGCTTTTCTCTCACAGGCTCAGGAAGGAGCTCTGC
CAGCCCTTGGATCTTCTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
TGGAAATGAAGGGAAAATTTCGAACGTGGGAACGGTGGAGCCCACAAGTATAGGGGTAGAATTTTATGATGGAGCCGAGGGAGGGACGTGGTTAGTTAAGACTTGAGTAT
AAGTTTAATAATAATTAATAATATAATAGAAATTAGAAATAGATGGCGGGGTTTGAATAATAAAGAAAAAGGGAGATTAGAGGGACTGGATTGGAATTAGATTGACAGGC
TGTACACAGCTGGAAACCAAATTCTGAAACCTTCTCCAAACGCAGCCCCATTCCCAGCCTGTTCGCAACTAATGCTTCTCTCTACTCTACTCGCCCTCTCTAAAACTCCT
CTTCTCCCCTCTCTACTTTCGTACTTCTCTTCTCCTACGCACAGCAGCAGTAATTGCCTTCCATTTTCAGTCTCATTGATTCAACTGTTTTTCCCTTTCTGCGGGAAACT
CTGAGCGCAAGAATCGCTGATGAATCGAGGTTCTGTTGGTAATGGGGTTTCGGAGGGTATGTGTGCGCCATTGTGCATGCATTTCTAGGTTTTGCTTCCATGATTAAGGC
TGCTTGTTCTGACAGATTCAACTTTCTCTGCGAGTTGTTCTTCTTGTTCCGACTCGCTCTTGTGTGATATTGAGTTTTGGAACCGAGGGGCGTCCATGGATCCCTGCCCT
TTTGTGCGTCTCACCGTTGGGAATCTCGCCCTTAAGGTTCCGGTGGCGTCGAAGCCTGCTCGCTCCCTGGTTCATCCTTCTTCGTCTCCCTGTTTCTGTAAAATCAAGTT
CAGGAAACTGCCGTTGCAAACGGCGGTTGTGCCATTTATTCAGCCGGAGAATCAGTTTCCGGATGGTCAGGTTCAGTCCACGGCTGCTGCTACTTTTCATCTCAGTAAGT
CTGATCTCGACAAGCTCGCTGGAAAGTCTTTGTTCGCTTCGAAACCATGCCTCAAAATCTCCATCTACAGCGGGCGGAGAGGCACGACCTGCGGTGTCGACTCCGGGAGG
CTCTTGGGTAAAGTGTCCGTGCCTTTGGATCTGGCGGGAACTGAATCTAGGGCAACGGTGTTCCACAATGGGTGGATTTCGGTGGGGAAAGAGTCCAAAGGTTCGTGTGC
GCAATTTCATTTGAATGTGAAAGCAGAACCTGACCCGAGATTCGTGTTTCAGTTCGACGGCGAGCCCGAATGTAGTCCACAGGTGTTTCAGATTCAAGGCAACATCAGAC
AACCCGTATTCACCTGCAAGTTCAGTTTCAGAACCGGCGACCGAACCCAGAGATCCAGGTCGTTGCCCACAGAATCGAGCAGCTCGAGGGGGTGGCTCAGCTCGTTCGGA
AGCGAAAGGGAACGCCCAGGAAAGGAGCGGAAGGGTTGGTCAATCACCGTCCACGACCTTTCCGGTTCTCCGGTCGCGGCTGCTTCTATGGTTACACCCTTTGTAGCGTC
GCCGGGTTCTGACCGTGTCAGCCGGTCCAACCCCGGGTCGTGGCTCATTCTCCGCCCTGGGGACGGCACCTGGAAGCCATGGGGCCGGCTCGAGGCTTGGCGGGAGCGTG
GTGGATCGGACGGACTTGGGTACAGGTTCGAGCTAATGCCGGACACGAGCGGCGGGATGAGCGCGGCGGGGATTGTACTGGCGGAGTCGGCTCTGAACTCGAACAAGGGC
GGAAAGTTCGTGATCGACTTGGGGGGAAACTCGAACGGGAGATTGACGCCGGGGAATTTGACGTCGCCGGCGTGCAGCCCGAGGAGCAGCGGAGACTACGGGTACGGGCT
TTGGCCGTACTGCGTGTATAGAGGGTTCGTGATGGCAGCTAGCGTAGAGGGCGAAGGAAAATGCAGCAAGCCCACGGTGGAAGTTAGCGTGCAGCACGTGAACTGCACAG
AAGACGCGGCCGCTTTCGTGGCATTGGCAGCGGCCATCGATCTTAGCATGGACGCTTGCAGGCTTTTCTCTCACAGGCTCAGGAAGGAGCTCTGCCAGCCCTTGGATCTT
CTTGCTTGAGATTCTGAACAAAATGGAAGTCTCGTGATCATCCTCTTCTTTTTTTTTTTTCTTTTTGGGTTTATTATGGTTTCTTTCTTTAATGCGTTTGGGTTTTTTGA
GTGGCGCAATTTGTCGATGGGTTTTTTATTCCGTCGCCTGTGAGGAATTTGGTTCTGCTAACAAACATTTTACAGAGTATTGTACAGTCTGTGAGAGAGAGAAAGAGAGA
GTAGAGAGTCAGAGACAGACAGGGCACTGTTTTTTTCTCTCTCTTTTTCCCTTTTTCTATAACTGGTGTGGCATCTGTCCGATTTCCATCCTTTTTTTGAATGGTTTGGT
AAAAGAAAAAGGTTGGTTATGCAGATAAATTTTCAGTGTCTCTGACTTGACCATCCCATTTTTCATGTAACTTTGTTGATAAATTCACCTCTGACAATTAACCTACTTGG
TTAAGTTTATGTCAGTGCTGAATGGAACAATTGTTAATTTGTTATATATATATATATATATATATAGCAAGCCTCTCTCTCTCTCTCTCCTTTTTATTATGTTATCAAGT
ATAGGTTTTATCTTTAATTATAGCCTATTTTGTTGAAAAACAGGCATGAAATTATCTTTAATAATTTCAGGAGGAGTTGGCAAGCTGTATTTTGCTGTGAGTTCAATCTA
GTGGGCGGAGGAGAGGAGACCTTTCCAAACTCCAATGGTTTTAATATTCTTGATTACTTTTCTTTGATCCTGTCTTATGACTTGGGTTTTAGGTGATGCCTCCAACGTAG
AAAGATTGAACTGTAGAATTAGATAAATGTTGTGTTGATTTAATATGAGAGCCGAGGGAATAGAAAATTTGAAAAATTGTCTAGTTTGGTGAGAGCAAATGGGAACATGA
ATGAGGATGATGATGGAAGAATGTAGAGCAAGGCATAGATGGTCCAAAATAGGAGCAACCAATGGTCCAAATGGCGCCATAATAGGACACACTCAGTTGCCCATCTCAAG
ATCTGTCACGCGCCCTCTTTCAAAACTCTCAGCCAGCTTAACGAATGGTCAGAAACCAGTCAGAGTTCACGTTCATCTTGTTTTAAAAAATTAAGCAAAAATGACAACTC
CGGGGGGCCTTGGTAGTTAAGAGAAATTAGAGGAGAATTAAGGCATAATCACAAAATTAGAGATAATGG
Protein sequenceShow/hide protein sequence
MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISIYSGRRGTTCG
VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRTGDRTQRSRSLPTESSSSRGW
LSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESAL
NSNKGGKFVIDLGGNSNGRLTPGNLTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
QPLDLLA