; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008314 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008314
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationscaffold4:373734..375609
RNA-Seq ExpressionMS008314
SyntenyMS008314
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024602.1 hypothetical protein SDJN02_13420, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-24395.01Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP NQFPDGQV ST AATFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

XP_022936339.1 uncharacterized protein LOC111442989 [Cucurbita moschata]1.7e-24395.01Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQST AATFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRS PGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

XP_022980993.1 uncharacterized protein LOC111480276 [Cucurbita maxima]3.5e-24194.78Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV ST AATFHLSKSDL+KLAGKSLFAS+PCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP NSTSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

XP_023525893.1 uncharacterized protein LOC111789374 [Cucurbita pepo subsp. pepo]1.8e-24094.33Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV ST AATFHLSKSDL+KLAGKSLFAS+PCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPL LAGTES+ TVFHNGWISVGK+SKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT+GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP NSTSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

XP_023535757.1 uncharacterized protein LOC111797093 [Cucurbita pepo subsp. pepo]3.4e-24495.24Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQST AATFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LEU0 Uncharacterized protein5.7e-23792.29Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QT VVPFIQ  NQFPDGQVQST AATFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCG+DSGRLLG+VSVPLDL GTES+ATVFHNGWISVGK+SK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQR RSLPTE  SSRGWLSSFGSERERPGKERKGWSIT+HDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD TWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKF+IDLGG+SNGR TP N  SPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
         VQHVNCTED AAFVALAAAIDLS+DACRLFSH+LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

A0A6J1F809 uncharacterized protein LOC1114429898.2e-24495.01Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KLP+QTAVVPFIQP+NQFPDGQVQST AATFHLSK DLDKLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRS PGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

A0A6J1FMY7 uncharacterized protein LOC1114466453.2e-24094.56Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV ST AATFHLSKSDL KLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRG+TCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT+GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP NSTSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

A0A6J1IN75 uncharacterized protein LOC1114767621.1e-24094.33Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPC FVRLTVGNLALKVPVASKPARS+VHPSSSPCFCKIKF+KL +QTAVVPFIQ +NQFPDGQVQST AATFHLSK DL KLAGKSLFASKPCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPLDLAGTE+RATVFHNGWISVGKESK SCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDR QRSRSLPTESS SRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFEL+PDT+GGMSAAGIVLAESALN NKGGKFVIDLGG+SNGR TP N TSPACSPRSSGDYGYGLWPYCVYRGFVM ASVEGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS +LRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

A0A6J1J0W7 uncharacterized protein LOC1114802761.7e-24194.78Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI
        MDPCPFVRLTVGNLALKVPVASKP RS+VHPSSSPCFCKIKF+KLP+QTAVVPF   ENQFPDGQV ST AATFHLSKSDL+KLAGKSLFAS+PCLKISI
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISI

Query:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
        YSGRRGTTCGVDSGRLLGKVSVPL LAGTESRATVFHNGWISVGK+SKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT
Subjt:  YSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRT

Query:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
        GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG
Subjt:  GDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGG

Query:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV
        SDGLGYRFELMPDT GGMSAAGIVLAESALN+NKGGKFVIDLGG+S GR TP NSTSPACSPRSSGDYGYGLWPYCVYRGFVM AS+EGEGKCSKP VEV
Subjt:  SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEV

Query:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        SVQHVNCTEDAAAFVALAAAIDLSMDACRLFS RLRKELCQ
Subjt:  SVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)5.2e-19073.59Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPE-NQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKIS
        MDPCPF+RLT+GNLALKVP+A+K   S+VHPSSSPCFCKIK +  P QTA +P+I  E  QFP+ Q   T AATFHLS SD+ +LA +S+F SKPCLKI 
Subjt:  MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPE-NQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKIS

Query:  IYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKES--KGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFS
        IY+GR G  CGV SGRLL KVSVPLDL+GT+S+  VFHNGWISVGK +    S AQFHLNVKAEPDPRFVFQFDGEPECSPQV QIQGNIRQPVFTCKFS
Subjt:  IYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKES--KGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFS

Query:  FR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWR
         R TGDRTQRSRSLPTE+S SR WL+SFGSERERPGKERKGWSITVHDLSGSPVA AS+VTPFVASPG+DRVSRSNPGSWLILRPGD TW+PWGRLEAWR
Subjt:  FR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWR

Query:  ERGG-SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSN----------------GRLTPGNSTSPACSPR-SSGDYGYGLWPYCVY
        ERGG +DGLGYRFEL+PD   G S AGIVLAES ++S++GGKF I+LG + +                G    G   SPA SPR  SGDYGYGLWP+ VY
Subjt:  ERGG-SDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSN----------------GRLTPGNSTSPACSPR-SSGDYGYGLWPYCVY

Query:  RGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
        +GFVM+ASVEGEGKCSKP VEVSVQHV+C EDAAA+VAL+AAIDLSMDACRLF+ R+RKELC
Subjt:  RGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC

AT1G50040.1 Protein of unknown function (DUF1005)1.9e-10750.1Show/hide
Query:  MDPCPFVRLTVGNLALKVP--------VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGK
        MDPC FVR+ VGNLA++ P         +S    S+   SS  C+CKIKF+  P Q   VP +     + E++   G V ST AA F LSKS ++    K
Subjt:  MDPCPFVRLTVGNLALKVP--------VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGK

Query:  SLFASKPCLKISIYSGRRGTTCG---VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESK-----GSCAQFHLNVKAEPDPRFVFQFDGEPECSPQ
        + ++    L + +YS RR  +CG       +L+G+  V LDL   ES+  + HNGW+ +G +SK     GS  + H++V+ EPD RFVFQFDGEPECSPQ
Subjt:  SLFASKPCLKISIYSGRRGTTCG---VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESK-----GSCAQFHLNVKAEPDPRFVFQFDGEPECSPQ

Query:  VFQIQGNIRQPVFTCKFSFR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLI
        VFQ+QGN +Q VFTCKF FR +GD   R+ SL          LSS  S +E+  KERKGWSIT+HDLSGSPVA ASMVTPFV SPGS+RVSRS+PG+WLI
Subjt:  VFQIQGNIRQPVFTCKFSFR-TGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLI

Query:  LRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPA-------CSPRSS------
        LRP   TWKPW RL+AWRE G SD LGYRFEL  D   G++ A  V A S++++  GG F+ID  G+++   T   S+S          S RSS      
Subjt:  LRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPA-------CSPRSS------

Query:  -GDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
          D+ + L       GFVM+  V+G  K SKP VEV V+HV CTEDAAA VALAAA+DLSMDACRLFS +LR EL Q
Subjt:  -GDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

AT3G19680.1 Protein of unknown function (DUF1005)9.0e-11850.81Show/hide
Query:  MDPCPFVRLTVGNLALKVPVASK-------PARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGKS
        MDPC FVR+ VGNLA++ P +S        P+ S ++P++  C+CKI+F+  P +   VP +     + E +       ST AA F LSK+ ++    K 
Subjt:  MDPCPFVRLTVGNLALKVPVASK-------PARSLVHPSSSPCFCKIKFRKLPLQTAVVPFI-----QPENQFPDGQVQSTAAATFHLSKSDLDKLAGKS

Query:  LFASKPCLKISIYS--------GRRGTTCGVDSG--RLLGKVSVPLDLAGTESRATVFHNGWISV----GKESKGSCAQFHLNVKAEPDPRFVFQFDGEP
         F+    L +  YS        G  G +CG+ +   +LLG+  V LDL   E+++ + HNGW+++     K   GS  + H++V+ EPDPRFVFQFDGEP
Subjt:  LFASKPCLKISIYS--------GRRGTTCGVDSG--RLLGKVSVPLDLAGTESRATVFHNGWISV----GKESKGSCAQFHLNVKAEPDPRFVFQFDGEP

Query:  ECSPQVFQIQGNIRQPVFTCKFSFR---TGDRT-QRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSR
        ECSPQVFQ+QGN +Q VFTCKF  R   +GDR    S S+ +E SS+R  +SS  SE+E+P KERKGWSITVHDLSGSPVA ASMVTPFV SPGS+RV+R
Subjt:  ECSPQVFQIQGNIRQPVFTCKFSFR---TGDRT-QRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSR

Query:  SNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDL-GGNSNGRLTPGNS--------------
        S+PG+WLILRP   TWKPWGRLEAWRE G SD LGYRFEL  D   G++ A  V A S+++   GG FVID+ GG S    TP  S              
Subjt:  SNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDL-GGNSNGRLTPGNS--------------

Query:  TSPACSP--RSSGDYGYGLWPY----CVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ
        + PA  P   S  D+GY L  +       RGFVM+A+VEG GK SKP VEV V HV CTEDAAA VALAAA+DLS+DACRLFSH+LRKEL Q
Subjt:  TSPACSP--RSSGDYGYGLWPY----CVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELCQ

AT4G29310.1 Protein of unknown function (DUF1005)6.9e-12654.2Show/hide
Query:  MDPCPFVRLTVGNLALKVP--VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQ---VQSTAAATFHLSKSDLDKLAGKSLFASKPC
        MDPCPFVRLT+ +LAL++P    +K     VHPSS+PC+CK++ +  P Q A++P     + F D       ST+A  FHL    + +++GK     K  
Subjt:  MDPCPFVRLTVGNLALKVP--VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQ---VQSTAAATFHLSKSDLDKLAGKSLFASKPC

Query:  LKISIYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK
        L++S+Y+GR G TCGV SG+LLGKV V +DLA   SR   FHNGW  +G +     A+ HL V AEPDPRFVFQF GEPECSP V+QIQ N++QPVF+CK
Subjt:  LKISIYSGRRGTTCGVDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK

Query:  FSFRTGDRTQRSRSLPTE-SSSSRGWLS---SFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD---GTWKP
        FS    DR  RSRSLP+  + SSRGW++   S     ++  +ERKGW IT+HDLSGSPVAAASM+TPFVASPGSDRVSRSNPG+WLILRP      +WKP
Subjt:  FSFRTGDRTQRSRSLPTE-SSSSRGWLS---SFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGD---GTWKP

Query:  WGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVE
        WGRLEAWRERG  DGLGY+FEL+ D S   ++ GI +AE  +++ +GGKF ID       R   G   SPA S                 +GFVM +SVE
Subjt:  WGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVE

Query:  GEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
        GEGK SKP V V  QHV C  DAA FVAL+AA+DLS+DAC+LFS +LRKELC
Subjt:  GEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC

AT5G17640.1 Protein of unknown function (DUF1005)1.7e-8742.02Show/hide
Query:  MDPCPFVRLTVGNLALKVP---VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPC-L
        MDP  F+RL+VG+LAL++P   + S    +     SS C C+IK R  P+QT  +P +   +  PD    ST   +F+L +SDL  L     F S    L
Subjt:  MDPCPFVRLTVGNLALKVP---VASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPC-L

Query:  KISIYSGRRGTTCGVDSGR-LLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK
        +IS+++G++   CGV   R  +G   + +     E +  +  NGWIS+GK  +   A+ HL VK +PDPR+VFQF+     SPQ+ Q++G+++QP+F+CK
Subjt:  KISIYSGRRGTTCGVDSGR-LLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCK

Query:  FSFRTGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPG---DGTWKPWGRL
        FS    DR  +   L    SS     S  G+E E   +ERKGW + +HDLSGS VAAA + TPFV S G D V++SNPG+WL++RP      +W+PWG+L
Subjt:  FSFRTGDRTQRSRSLPTESSSSRGWLSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPG---DGTWKPWGRL

Query:  EAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGK
        EAWRERG  D +  RF L+   S G+    ++++E  +++ KGG+F+ID        LT   + +P  SP+SSGD+  GL       GFVM++ V+GEGK
Subjt:  EAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESALNSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGK

Query:  CSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRK
         SKP V+++++HV C EDAA F+ALAAA+DLS+ AC+ F    R+
Subjt:  CSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGCCCTTTTGTGCGTCTCACCGTTGGGAATCTCGCCCTTAAGGTTCCGGTGGCGTCGAAGCCTGCTCGCTCCCTGGTTCATCCTTCTTCGTCTCCCTGTTT
CTGTAAAATCAAGTTCAGGAAACTGCCGTTGCAAACGGCGGTTGTGCCATTTATTCAGCCGGAGAATCAGTTTCCGGATGGTCAGGTTCAGTCCACGGCTGCTGCTACTT
TTCATCTCAGTAAGTCTGATCTCGACAAGCTCGCTGGAAAGTCTTTGTTCGCTTCGAAACCATGCCTCAAAATCTCCATCTACAGCGGGCGGAGAGGCACGACCTGCGGT
GTCGACTCCGGGAGGCTCTTGGGTAAAGTGTCCGTGCCTTTGGATCTGGCGGGAACTGAATCTAGGGCAACGGTGTTCCACAATGGGTGGATTTCGGTGGGGAAAGAGTC
CAAAGGTTCGTGTGCGCAATTTCATTTGAATGTGAAAGCAGAACCTGACCCGAGATTCGTGTTTCAGTTCGACGGCGAGCCCGAATGTAGTCCACAGGTGTTTCAGATTC
AAGGCAACATCAGACAACCCGTATTCACCTGCAAGTTCAGTTTCAGAACCGGCGACCGAACCCAGAGATCCAGGTCGTTGCCCACAGAATCGAGCAGCTCGAGGGGGTGG
CTCAGCTCGTTCGGAAGCGAAAGGGAACGCCCAGGAAAGGAGCGGAAGGGTTGGTCAATCACCGTCCACGACCTTTCCGGTTCTCCGGTCGCGGCTGCTTCTATGGTTAC
ACCCTTTGTAGCGTCGCCGGGTTCTGACCGTGTCAGCCGGTCCAACCCCGGGTCGTGGCTCATTCTCCGCCCTGGGGACGGCACCTGGAAGCCATGGGGCCGGCTCGAGG
CTTGGCGGGAGCGTGGTGGATCGGACGGACTTGGGTACCGGTTCGAGCTAATGCCGGACACGAGCGGCGGGATGAGCGCGGCGGGGATTGTACTGGCGGAGTCGGCTCTG
AACTCGAACAAGGGCGGAAAGTTCGTGATCGACTTGGGGGGAAACTCGAACGGGAGATTGACGCCGGGGAATTCGACGTCGCCGGCGTGCAGCCCGAGGAGCAGCGGAGA
CTACGGGTACGGGCTTTGGCCGTACTGCGTGTATAGAGGGTTCGTGATGGCAGCTAGCGTAGAGGGCGAAGGAAAATGCAGCAAGCCCACGGTGGAAGTTAGCGTGCAGC
ACGTGAACTGCACAGAAGACGCGGCCGCTTTCGTGGCATTGGCAGCGGCCATCGATCTTAGCATGGACGCTTGCAGGCTTTTCTCTCACAGGCTCAGGAAGGAGCTCTGC
CAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCTGCCCTTTTGTGCGTCTCACCGTTGGGAATCTCGCCCTTAAGGTTCCGGTGGCGTCGAAGCCTGCTCGCTCCCTGGTTCATCCTTCTTCGTCTCCCTGTTT
CTGTAAAATCAAGTTCAGGAAACTGCCGTTGCAAACGGCGGTTGTGCCATTTATTCAGCCGGAGAATCAGTTTCCGGATGGTCAGGTTCAGTCCACGGCTGCTGCTACTT
TTCATCTCAGTAAGTCTGATCTCGACAAGCTCGCTGGAAAGTCTTTGTTCGCTTCGAAACCATGCCTCAAAATCTCCATCTACAGCGGGCGGAGAGGCACGACCTGCGGT
GTCGACTCCGGGAGGCTCTTGGGTAAAGTGTCCGTGCCTTTGGATCTGGCGGGAACTGAATCTAGGGCAACGGTGTTCCACAATGGGTGGATTTCGGTGGGGAAAGAGTC
CAAAGGTTCGTGTGCGCAATTTCATTTGAATGTGAAAGCAGAACCTGACCCGAGATTCGTGTTTCAGTTCGACGGCGAGCCCGAATGTAGTCCACAGGTGTTTCAGATTC
AAGGCAACATCAGACAACCCGTATTCACCTGCAAGTTCAGTTTCAGAACCGGCGACCGAACCCAGAGATCCAGGTCGTTGCCCACAGAATCGAGCAGCTCGAGGGGGTGG
CTCAGCTCGTTCGGAAGCGAAAGGGAACGCCCAGGAAAGGAGCGGAAGGGTTGGTCAATCACCGTCCACGACCTTTCCGGTTCTCCGGTCGCGGCTGCTTCTATGGTTAC
ACCCTTTGTAGCGTCGCCGGGTTCTGACCGTGTCAGCCGGTCCAACCCCGGGTCGTGGCTCATTCTCCGCCCTGGGGACGGCACCTGGAAGCCATGGGGCCGGCTCGAGG
CTTGGCGGGAGCGTGGTGGATCGGACGGACTTGGGTACCGGTTCGAGCTAATGCCGGACACGAGCGGCGGGATGAGCGCGGCGGGGATTGTACTGGCGGAGTCGGCTCTG
AACTCGAACAAGGGCGGAAAGTTCGTGATCGACTTGGGGGGAAACTCGAACGGGAGATTGACGCCGGGGAATTCGACGTCGCCGGCGTGCAGCCCGAGGAGCAGCGGAGA
CTACGGGTACGGGCTTTGGCCGTACTGCGTGTATAGAGGGTTCGTGATGGCAGCTAGCGTAGAGGGCGAAGGAAAATGCAGCAAGCCCACGGTGGAAGTTAGCGTGCAGC
ACGTGAACTGCACAGAAGACGCGGCCGCTTTCGTGGCATTGGCAGCGGCCATCGATCTTAGCATGGACGCTTGCAGGCTTTTCTCTCACAGGCTCAGGAAGGAGCTCTGC
CAG
Protein sequenceShow/hide protein sequence
MDPCPFVRLTVGNLALKVPVASKPARSLVHPSSSPCFCKIKFRKLPLQTAVVPFIQPENQFPDGQVQSTAAATFHLSKSDLDKLAGKSLFASKPCLKISIYSGRRGTTCG
VDSGRLLGKVSVPLDLAGTESRATVFHNGWISVGKESKGSCAQFHLNVKAEPDPRFVFQFDGEPECSPQVFQIQGNIRQPVFTCKFSFRTGDRTQRSRSLPTESSSSRGW
LSSFGSERERPGKERKGWSITVHDLSGSPVAAASMVTPFVASPGSDRVSRSNPGSWLILRPGDGTWKPWGRLEAWRERGGSDGLGYRFELMPDTSGGMSAAGIVLAESAL
NSNKGGKFVIDLGGNSNGRLTPGNSTSPACSPRSSGDYGYGLWPYCVYRGFVMAASVEGEGKCSKPTVEVSVQHVNCTEDAAAFVALAAAIDLSMDACRLFSHRLRKELC
Q