; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg10184 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg10184
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationCarg_Chr17:223590..227300
RNA-Seq ExpressionCarg10184
SyntenyCarg10184
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574807.1 hypothetical protein SDJN03_25446, partial [Cucurbita argyrosperma subsp. sororia]5.4e-258100Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

XP_022959142.1 uncharacterized protein LOC111460223 [Cucurbita moschata]7.8e-25799.77Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPI SPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

XP_023006148.1 uncharacterized protein LOC111498975 [Cucurbita maxima]3.0e-25699.09Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNS KPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWG+GKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ+DSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQ+VGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

XP_023548058.1 uncharacterized protein LOC111806814 [Cucurbita pepo subsp. pepo]2.3e-25699.32Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNS KPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWG+GKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQVDSLSNYW GSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]5.1e-24093.39Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFS+PCSCEIRLRGFPVQTSSIPL+PSPEAIPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWG+GKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ DSLSNYWS           G GDGSDL+VERRERKGWKVKIHDLSGSAVAAAFITTPFVPS GCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAE+GGEFFIDTDKQLRAAT PIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTC EDAAIFMALAAAV LSIEACRPFRRKIRRA RHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein1.6e-23691.57Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFS+PCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWG+GKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ DSLSNYWS           G GDGSDL+VERRERKGWKVKIHDLSGSAVAAAFITTPFVPS GCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAE+GGEFFIDTDKQLRAAT PIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTC EDAAIFMALAAAV LSIEACRPFRRKIRR  RHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990022.2e-23691.8Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFS+PCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWG+GKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQL GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ DSLSNYWS           G GDGSDL+VERRERKGWKVKIHDLSGSAVAAAFITTPFVPS GCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAE+GGEFFIDTDKQLRAAT PIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTC EDAAIFMALAAAV LSIEACRPFRRKIRR  RHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 29.7e-23792.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFS+PCSCEIRLRGFP+QTSSIPLLPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWG+GKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQL GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ DSLSNYWS           G GDGSDL+VERRERKGWKVKIHDLSGSAVAAAFITTPFVPS GCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAE+GGEFFIDTDKQLRAAT PIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTC EDAAIFMALAAAV LSIEACRPFRRKIRR  RHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

A0A6J1H3Q9 uncharacterized protein LOC1114602233.8e-25799.77Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPI SPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

A0A6J1L446 uncharacterized protein LOC1114989751.4e-25699.09Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGAALNS KPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWG+GKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        RDRVSQ+DSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
Subjt:  RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR
        LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQ+VGGFVMSCRVQGEGKSSKPMVQLAMR
Subjt:  LEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMR

Query:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
        HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS
Subjt:  HVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)2.9e-9241.28Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P AA  ++   V+  S+PC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP V  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+ G+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKF

Query:  S----RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESW
        S     DR  +  SL    S S S     WL S  GS+ +   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W
Subjt:  S----RDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESW

Query:  QPWGKLEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAAT----------------------VPIPSPQ-SSGDFAA-
        +PWG+LEAWRER G  D +  RF L+ +   G  ++++E  I++ RGG+F I+      +++                       P  SP+  SGD+   
Subjt:  QPWGKLEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAAT----------------------VPIPSPQ-SSGDFAA-

Query:  --LGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH
             V  GFVMS  V+GEGK SKP V+++++HV+C EDAA ++AL+AA+ LS++ACR F +++R+   H
Subjt:  --LGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH

AT1G50040.1 Protein of unknown function (DUF1005)1.4e-7035.34Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GAALNSAKPGV-NAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S+ P V +  S  C C+I+ + FP Q  S+P+L   E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GAALNSAKPGV-NAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +    K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLLGSIKQPIFSCKF----SRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSN
        + Q+ G+ KQ +F+CKF    S DR   + SLS+  SG                  +   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+
Subjt:  IVQLLGSIKQPIFSCKF----SRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSN

Query:  PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVG--
        PG+WLI+RPD     +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        T    S + S D ++   +    
Subjt:  PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVG--

Query:  --------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASR
                            GFVMS RVQG  K SKP V++ ++HVTCTEDAA  +ALAAAV LS++ACR F +K+R   R
Subjt:  --------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASR

AT3G19680.1 Protein of unknown function (DUF1005)2.7e-7435.76Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAK------PGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S+        G+N  +  C C+IR + FP +  S+P++   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAK------PGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGNGKPVVLFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +    K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGNGKPVVLFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLLGSIKQPIFSCKF-SRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWV
            SPQ+ Q+ G+ KQ +F+CKF SR+  S   +L +  S       +    S   S+ +   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLLGSIKQPIFSCKF-SRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAERGGEFFID-TDKQLRAATVPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A+ P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAERGGEFFID-TDKQLRAATVPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASR
           G                              GFVMS  V+G GK SKP V++ + HVTCTEDAA  +ALAAAV LS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASR

AT4G29310.1 Protein of unknown function (DUF1005)9.3e-8340.54Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPG-VNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P  A N    G V+  S PC C++R++ FP Q + +PL    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPG-VNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+  ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSC

Query:  KFSRDRVSQVDSLSNYWSGSGSGDGSDWLG---SGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPES
        KFS DR  +  SL + ++ S  G    W+    SGD  + K + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      S
Subjt:  KFSRDRVSQVDSLSNYWSGSGSGDGSDWLG---SGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPES

Query:  WQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPM
        W+PWG+LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+
Subjt:  WQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPM

Query:  VQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH
        V +  +HVTC  DAA+F+AL+AAV LS++AC+ F RK+R+   H
Subjt:  VQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH

AT5G17640.1 Protein of unknown function (DUF1005)5.7e-18974.27Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSAKPG--VNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   +NS         FS+ CSCEI+LRGFPVQT+SIPL+PS +A PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSAKPG--VNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEWG GKP++LFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCK

Query:  FSRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPW
        FSRDRVSQVD L+ YWS            SGDG++L+ ERRERKGWKVKIHDLSGSAVAAAFITTPFVPS GCDWVA+SNPG+WL+VRPD   P SWQPW
Subjt:  FSRDRVSQVDSLSNYWSGSGSGDGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPW

Query:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQ-LRAATVPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMV
        GKLEAWRERGIRD+VCCRFHLLS   E G++LMSEI I+AE+GGEF IDTDKQ L  A  PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+V
Subjt:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAERGGEFFIDTDKQ-LRAATVPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMV

Query:  QLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH
        QLAMRHVTC EDAAIFMALAAAV LSI AC+PFRR  RR  RH
Subjt:  QLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGATCTTTGGGATTGAGAATCCCAGGAGCTGCTCTAAACTCTGCAAAACCTGGAGTCAATGCTTTCTCTGCACCGTG
TTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCAATCCCGCTTCTCCCGTCTCCTGAAGCCATACCTGATTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAAGAGTCTGATCTGAAAGCATTACTAGCACCTGGCTGCTTCTATAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCTCATTGTGGTGTT
GGCATCAAGAGGCAGCTGATTGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGCAATGGGAAGCCAGTCGTTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGCAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTTGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACGAGGTTAAGCCCGCAAATCG
TCCAGCTTCTAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTATCCCAGGTGGATTCATTGAGCAACTATTGGTCAGGGTCAGGTTCTGGT
GATGGCTCAGATTGGTTAGGTTCTGGTGATGGCTCGGACCTCAAGGTCGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGC
AGCAGCCTTCATAACTACTCCCTTTGTGCCATCAGCAGGTTGCGATTGGGTTGCCAGGTCAAACCCCGGGTCTTGGCTGATTGTTCGTCCTGATGTCTGCATACCTGAAA
GTTGGCAGCCATGGGGAAAGCTCGAGGCATGGCGCGAGCGAGGAATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCCGAAGCTCAAGAGGGAGGGGAACTTCTC
ATGTCTGAGATCCATATCAATGCTGAGAGAGGTGGGGAGTTCTTCATAGACACTGACAAACAGTTACGAGCAGCAACAGTTCCAATACCGAGCCCGCAGAGCAGCGGAGA
CTTTGCAGCATTAGGCCAAGTGGTCGGAGGTTTCGTCATGAGTTGTAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCAATGGTGCAACTCGCCATGCGACATGTTACAT
GTACAGAGGATGCTGCCATTTTCATGGCACTAGCTGCAGCAGTTGGTCTCAGCATTGAGGCGTGTAGGCCATTCCGAAGGAAGATTAGGAGAGCGTCTCGGCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
CACTCATCTTCAAACTCCAAATCAAAGTCAAAATTTCGCATCCAAATTTCACTTTCTTTTTTGGATCTTTCTCAAAATTTCACCTCAAACAACCCCCCCCCCCTACCCCT
TAAATTTCTCCGCCTATTTCGTGACAAATAGTGTAGAGTTAAGAGGAATAGGGAGTTTTTTCTTGGCGGCCTCGTGATCTGTGAAGTCTCTACGTGATATTCAAGACTTT
TGTTGCTATCGGGATTTGGGTTTTTCTTTTTCTAAAAGCTCGTTGTTAGATCTCTAAAAACATTCATCCTTTCACATTTAGAGGGAAAATTAGAAGTGGGATTTGATTTT
TGGGTGAAATGTTGCTGTTCTTCTTGTTCTTCAGGGATTCTGCCTGCTCTGCTTTCTGTGGAGCACTACAGTTCTGTTTCTACTTGGCAACTAATCACAAGGATACTTGA
GCTGGTGGCATTATAAACTATCTATATCAAATGGACAAAAAATTGTGACCCAAATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGATCTTTGGGATTGAGAATCCCA
GGAGCTGCTCTAAACTCTGCAAAACCTGGAGTCAATGCTTTCTCTGCACCGTGTTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCAATCCCGCTTCT
CCCGTCTCCTGAAGCCATACCTGATTCTCATAGCATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTGAAAGCATTACTAGCACCTGGCTGCTTCTATAACACTCATG
CCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCTCATTGTGGTGTTGGCATCAAGAGGCAGCTGATTGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGC
AATGGGAAGCCAGTCGTTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGCAAGAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTTGATCCTGA
TCCAAGATATGTTTTTCAGTTTGAAGATGTTACGAGGTTAAGCCCGCAAATCGTCCAGCTTCTAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACA
GGGTATCCCAGGTGGATTCATTGAGCAACTATTGGTCAGGGTCAGGTTCTGGTGATGGCTCAGATTGGTTAGGTTCTGGTGATGGCTCGGACCTCAAGGTCGAGCGGAGA
GAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAGCAGGTTGCGATTGGGTTGCCAG
GTCAAACCCCGGGTCTTGGCTGATTGTTCGTCCTGATGTCTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGCGCGAGCGAGGAATTAGAGACACTG
TCTGCTGTCGCTTTCACCTTCTCTCCGAAGCTCAAGAGGGAGGGGAACTTCTCATGTCTGAGATCCATATCAATGCTGAGAGAGGTGGGGAGTTCTTCATAGACACTGAC
AAACAGTTACGAGCAGCAACAGTTCCAATACCGAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTCGGAGGTTTCGTCATGAGTTGTAGAGTACAAGG
GGAAGGAAAGAGCAGTAAGCCAATGGTGCAACTCGCCATGCGACATGTTACATGTACAGAGGATGCTGCCATTTTCATGGCACTAGCTGCAGCAGTTGGTCTCAGCATTG
AGGCGTGTAGGCCATTCCGAAGGAAGATTAGGAGAGCGTCTCGGCATTCTTGA
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGAALNSAKPGVNAFSAPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVGPEWGNGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLLGSIKQPIFSCKFSRDRVSQVDSLSNYWSGSGSG
DGSDWLGSGDGSDLKVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSAGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELL
MSEIHINAERGGEFFIDTDKQLRAATVPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCTEDAAIFMALAAAVGLSIEACRPFRRKIRRASRHS