; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007892 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007892
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationscaffold13:2146748..2148153
RNA-Seq ExpressionMS007892
SyntenyMS007892
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]2.9e-24094.61Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP+LPSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]1.4e-23993.91Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP++PSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]6.4e-24094.38Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP++PSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]2.4e-25099.77Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRRAPRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]7.4e-24496.02Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPEAIP+SHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRRAPRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein6.9e-24093.91Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP++PSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

A0A1S3CCZ5 uncharacterized protein LOC1034990023.1e-24094.38Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP++PSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

A0A5A7TE92 Nuclear factor 1 A-type isoform 21.4e-24094.61Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP+LPSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRR PRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

A0A6J1DQ78 uncharacterized protein LOC1110228011.1e-25099.77Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MALAAAVDLSIEACRPFRRKIRRAPRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

A0A6J1HM61 uncharacterized protein LOC1114642382.6e-23993.68Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG ++NSTKPG+++ SSPCSCEIRLRGFPVQ+SSIP++ SPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGS DG D+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRH
        MA+AAAVDLSIEACRPFRRKIRRAPRH
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)9.7e-9341.08Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILP-SPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        M+P  FIRL+IG+L L+VP  +  +T   ++  SSPC C+I+L+ FP Q+++IP +P      PE  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILP-SPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQADSL------SNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGK
        S     DR  ++ SL      S  W  S  G + E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RS+PGSWLI+RP  C   +W+PWG+
Subjt:  S----RDRVSQADSL------SNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      ++                      ASP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)3.5e-7437.23Show/hide
Query:  MNPQAFIRLSIGSLGLRVP------GDSLNSTKPGINSFSS-PCSCEIRLRGFPVQSSSIPILPSPEAIPESH-------SIASSFYLEESDLKALLAPG
        M+P +F+R+ +G+L +R P        S +S+ P ++  SS  C C+I+ + FP Q  S+P+L   E+  ES        ++A+ F L +S ++  L   
Subjt:  MNPQAFIRLSIGSLGLRVP------GDSLNSTKPGINSFSS-PCSCEIRLRGFPVQSSSIPILPSPEAIPESH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR + + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RSSPG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        AS   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
                 GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)3.7e-7636.16Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTK------PGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESH--------SIASSFYLEESDLKALLAPG
        M+P +F+R+ +G+L +R P  S +S+        GIN  +  C C+IR + FP +  S+P++   E+  E+         ++A+ F L ++ ++A L   
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTK------PGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S     S+   + E   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG
         RSSPG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
           G                              GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)4.5e-8239.41Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPG-INSFSSPCSCEIRLRGFPVQSSSIPILPSPEAI--PESHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        M+P  F+RL+I SL LR+P  + N    G ++  S+PC C++R++ FP Q + +P+    +A   PES + A  F+L+   ++ +            L +
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPG-INSFSSPCSCEIRLRGFPVQSSSIPILPSPEAI--PESHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSLSNYWSGSADGLDL---------EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWG
        KFS DR  ++ SL + ++ S+ G            + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RS+PG+WLI+RP      SW+PWG
Subjt:  KFSRDRVSQADSLSNYWSGSADGLDL---------EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWG

Query:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAM
        +LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+V +  
Subjt:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAM

Query:  RHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  RHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)4.1e-19276.39Show/hide
Query:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPG--INSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        M+PQAFIRLS+GSL LR+P   +NST       +FSS CSCEI+LRGFPVQ++SIP++PS +A P+ HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MNPQAFIRLSIGSLGLRVPGDSLNSTKPG--INSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEWG+GKP+ILFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS S DG +LE+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+S+PG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGSADGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  AA+PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTCVE
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCGCAGGCTTTTATTAGGTTGTCGATTGGCTCGCTGGGATTGAGAGTCCCGGGAGATTCTCTAAACTCTACAAAACCTGGAATCAATTCTTTCTCTTCTCCATG
CTCATGCGAAATTCGTCTTCGAGGTTTCCCCGTGCAGTCATCTTCAATTCCCATACTCCCGTCTCCCGAAGCCATACCTGAGTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAGGAGTCTGATCTTAAAGCATTACTTGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTGTTCTCTGGAAGGAAAGGATCCCACTGTGGTGTT
GGCATCAAAAGGCAGCTGATTGGGACGTTTAAGCTGGATGTCGGTCCCGAATGGGGCGATGGGAAGCCGGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGAGAATGGAAGGCATGGTGCAGAGCTTCATTTGAGAGTGAAACTGGATCCAGATCCAAGATATGTTTTCCAGTTTGAAGACGTCACGAGGCTAAGTCCACAAATCG
TCCAGCTTCAAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTTCGGCCGATGGC
CTCGATCTTGAGGCCGAGAGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCGAGCCCCGGGTCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTTGAGGCGTGGC
GCGAGCGTGGAATTAGAGACACCGTCTGCTGTCGCTTCCACCTTCTCTCTGAAGCGCAGGAGGGAGGTGAACTTCTCATGTCTGAGATCCACATCAACGCCGAGAAAGGT
GGGGAGTTCTTCATTGACACCGATAAACAGCTACGAGCTGCAGCAAGTCCAATACCGAGCCCACAGAGCAGCGGGGACTTTGCAGCATTAGGCCAAGTGGTGGGAGGTTT
CGTGATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACGTGTGTGGAGGACGCTGCTATTTTCATGGCGCTTG
CAGCTGCCGTCGACCTGAGCATCGAGGCGTGCAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCAT
mRNA sequenceShow/hide mRNA sequence
ATGAATCCGCAGGCTTTTATTAGGTTGTCGATTGGCTCGCTGGGATTGAGAGTCCCGGGAGATTCTCTAAACTCTACAAAACCTGGAATCAATTCTTTCTCTTCTCCATG
CTCATGCGAAATTCGTCTTCGAGGTTTCCCCGTGCAGTCATCTTCAATTCCCATACTCCCGTCTCCCGAAGCCATACCTGAGTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAGGAGTCTGATCTTAAAGCATTACTTGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTGTTCTCTGGAAGGAAAGGATCCCACTGTGGTGTT
GGCATCAAAAGGCAGCTGATTGGGACGTTTAAGCTGGATGTCGGTCCCGAATGGGGCGATGGGAAGCCGGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGAGAATGGAAGGCATGGTGCAGAGCTTCATTTGAGAGTGAAACTGGATCCAGATCCAAGATATGTTTTCCAGTTTGAAGACGTCACGAGGCTAAGTCCACAAATCG
TCCAGCTTCAAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTTCGGCCGATGGC
CTCGATCTTGAGGCCGAGAGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCGAGCCCCGGGTCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTTGAGGCGTGGC
GCGAGCGTGGAATTAGAGACACCGTCTGCTGTCGCTTCCACCTTCTCTCTGAAGCGCAGGAGGGAGGTGAACTTCTCATGTCTGAGATCCACATCAACGCCGAGAAAGGT
GGGGAGTTCTTCATTGACACCGATAAACAGCTACGAGCTGCAGCAAGTCCAATACCGAGCCCACAGAGCAGCGGGGACTTTGCAGCATTAGGCCAAGTGGTGGGAGGTTT
CGTGATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACGTGTGTGGAGGACGCTGCTATTTTCATGGCGCTTG
CAGCTGCCGTCGACCTGAGCATCGAGGCGTGCAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCAT
Protein sequenceShow/hide protein sequence
MNPQAFIRLSIGSLGLRVPGDSLNSTKPGINSFSSPCSCEIRLRGFPVQSSSIPILPSPEAIPESHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSGSADG
LDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSSPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKG
GEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH