; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc11g0314031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc11g0314031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionNuclear factor 1 A-type isoform 1
Genome locationCMiso1.1chr11:33070087..33075424
RNA-Seq ExpressionCmc11g0314031
SyntenyCmc11g0314031
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]8.7e-25399.77Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPL+PSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

KAG6593741.1 hypothetical protein SDJN03_13217, partial [Cucurbita argyrosperma subsp. sororia]2.0e-24194.63Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGT++NSTKPGV A SSPCSCEIRLRGFP+QTSSIP+V SPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQV+GGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MA+AAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]9.6e-25299.3Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]3.9e-253100Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]1.4e-24797.66Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH I SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein4.6e-25299.3Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990021.9e-253100Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 24.2e-25399.77Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPL+PSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRRTPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228011.3e-24194.16Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG SLNSTKPG+N+FSSPCSCEIRLRGFP+Q+SSIP++PSPEAIP+SH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

A0A6J1HM61 uncharacterized protein LOC1114642381.4e-24094.16Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGT++NSTKPGV+A SSPCSCEIRLRGFP+QTSSIP+V SPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRTPRHS
        MA+AAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRTPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.4e-9441.08Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVP-SPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P  +  +T   V+  SSPC C+I+L+ FP QT++IP +P      P+   +A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVP-SPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF      SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQADSL------SNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        S     DR  ++ SL      S  W     GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+
Subjt:  S----RDRVSQADSL------SNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      +++                      SP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPRH
        V  GFVMS  V+GEG+ SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPRH

AT1G50040.1 Protein of unknown function (DUF1005)4.5e-7436.6Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GTSLNSTKPGVNAFSS-PCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH-------VIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       +S +S+ P V+  SS  C C+I+ + FP Q  S+P++   E+  +S         +A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GTSLNSTKPGVNAFSS-PCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH-------VIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFQDVTRSSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF      SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFQDVTRSSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR + + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        T+   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPR
                 GFVMS RVQG  + SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPR

AT3G19680.1 Protein of unknown function (DUF1005)3.3e-7736.16Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTK------PGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH--------VIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P +S +S+        G+N  +  C C+IR + FP +  S+P++   E+  ++          +A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTK------PGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSH--------VIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFQD
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF  
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFQD

Query:  VTRSSPQIVQLQGSIKQPIFSCKF------SRDR-----VSQADSLSNYWSGLGD-GSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF      S DR      S    +S+  S +    S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRSSPQIVQLQGSIKQPIFSCKF------SRDR-----VSQADSLSNYWSGLGD-GSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPR
           G                              GFVMS  V+G G+ SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPR

AT4G29310.1 Protein of unknown function (DUF1005)2.8e-8440.55Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPG-VNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAI--PDSHVIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P T+ N    G V+  S+PC C++R++ FP Q + +PL    +A   P+S   A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPG-VNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAI--PDSHVIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSLSN--YWSGLG------DGSDLEVER-RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWG
        KFS DR  ++ SL +   +S  G       G   E ++ RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PWG
Subjt:  KFSRDRVSQADSLSN--YWSGLG------DGSDLEVER-RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWG

Query:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAM
        +LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEG+ SKP V +  
Subjt:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAM

Query:  RHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPRH
        +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  RHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPRH

AT5G17640.1 Protein of unknown function (DUF1005)7.0e-19275.93Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPG--VNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   +NST        FSS CSCEI+LRGFP+QT+SIPL+PS +A PD H I++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGTSLNSTKPG--VNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+V PEWG+GKP+ILFNGWI IGK+K +     AELHL+VKLDPDPRYVFQF+DVT  SPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS  GDG++LE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGLGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGRSSKPTVQLAMRHVTCIE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  A +PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEG+SSKP VQLAMRHVTC+E
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGRSSKPTVQLAMRHVTCIE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRTPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRTPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCGTTTATTAGGTTGTCAATTGGATCATTGGGATTGAGAATCCCAGGAACTTCTCTAAACTCTACAAAACCTGGAGTCAATGCTTTCTCTTCTCCGTG
TTCGTGTGAAATTCGTCTTCGGGGTTTCCCTATGCAGACATCTTCAATCCCTCTAGTCCCGTCTCCTGAAGCAATACCTGACTCTCATGTCATTGCCTCAAGCTTCTATC
TTGAAGAGTCCGATCTGAAAGCATTACTGGCACCTGGCTGCTTCTACAACACCCATGCCTGTCTTGAAATATCTGTCTTCTCGGGAAGGAAGGGATCCCACTGTGGTGTT
GGGATCAAAAGGCAGCTGATCGGGACGTTTAAACTGGACGTCAGTCCTGAGTGGGGTGATGGGAAGCCAGTCATTCTTTTTAATGGGTGGATAGGCATTGGCAAAAGCAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTTAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTCCAAGATGTTACTAGATCAAGCCCCCAAATCG
TCCAGCTTCAAGGCTCAATCAAGCAGCCGATCTTCAGTTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCCTTGAGCAACTATTGGTCAGGTCTTGGTGATGGC
TCGGATCTTGAGGTCGAGCGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCAAACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCGTGGC
GCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCTGAAGCCCAAGAGGGAGGTGAGCTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGC
GGGGAATTCTTTATAGACACCGACAAACAATTGCGAGCAGCAACAAGTCCAATACCGAGTCCACAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGCTT
TGTCATGAGTTGCAGAGTACAAGGGGAAGGAAGAAGCAGTAAGCCAACGGTTCAACTCGCAATGCGACACGTGACATGTATAGAAGATGCTGCTATTTTCATGGCGCTTG
CAGCTGCAGTTGATCTCAGCATTGAGGCATGTAGGCCGTTCCGAAGGAAGATTAGGAGAACGCCTCGGCATTCATAG
mRNA sequenceShow/hide mRNA sequence
TGATAAATGCAAGAGTGGCATTTGGGATGAGAACAGTTGAGAAGATGAAGTCAGGCACATGTATTATGGATAGCGATCCTAATCCTTTTTTTCTCTTTCTTTTTTCACAT
TTCTCAATCTTTCTCTCTTTTTTTAATTCCAATATTCTTTTGATTTTATTTATTTCTTCTTCATAATTTAATGATTTATGAAGGAAGAACATCAACAAACAAACAACAAC
AACAACAACAACAGCAATATGCTCTTCTCTGATAAGCCAAATTGGTTACAGCTGTTTCCACGATTTATCTCTCTTTCCATCGCTTTTTTTTTTTTTTTTTTCTTTCTTCT
TCCACTTTCCACTATTCCTCTCTTCTTTTCCAACTCAACAGTGACGTTCAAAACTCCAAAAATCAGAGTAAAAATTTTCCCTTTTCTTTATTGGGGTCTTTCCAAAAAAA
AAAATTTCACCTCAATTTTATCGCTTATTTCTCTCTGTTTTGTTCTTGTTTCTTCTGCAAATCAAAAGGGGTTGGTAGTATTACCCCGTACCCATCTCTTTCTTTATTCG
AATTAATTCAACTGTTTGGTTTTTTGAGAAAATTAAACAAAACCACCCTTTCTCTTCTTCTTCCCTTAAATTTCTCCGTCTATTTCGTGACAAATAATGCAGATTTCAGA
GAGGGAAAAAGGGTTTATTATAAATTTCTCTTCCCGATTTCAGTGAGAAAGAGAGGGAAATAGGGAGTTTCCTTTTCTGGGTTTGGTTTATTTCGGGGTTTGTGAGGCAA
TCTCGTGATCTGTGAAGTCTTCTTCTACGTGACATTCAAGACTTTTGCTGCTATGGGGATTGGGGTTTGTTCTTTTTTAATAGCTTCTTGTTAGATCTCTAAGAACATTC
ATCCTTTCACATTTAGAGAGAAATTTAGAGGGAAAGTGAGGGGGAAGTGGGATTATTTGAGTTTGGGGGAAATGTTGCTGTTTTTCTTCTTCAGGAATTTCGGCTGCTTC
GCCTTTTGTGGAGCATTGCAGTACTGTTTCTACCTGGCAACTAATCACAAGAAGTGAATAGGGAAGCAAAAAGAAAAAAAGTGGACCGATCAAATATCAATTCTTTAGGA
TTGAGACGATTGTCTATATAAGGACACTTGAGCTTGTCGAAATATTAATGATCTATCAAATGGACAAAAAATTGTGACCGGAATGGATCCTCAGGCGTTTATTAGGTTGT
CAATTGGATCATTGGGATTGAGAATCCCAGGAACTTCTCTAAACTCTACAAAACCTGGAGTCAATGCTTTCTCTTCTCCGTGTTCGTGTGAAATTCGTCTTCGGGGTTTC
CCTATGCAGACATCTTCAATCCCTCTAGTCCCGTCTCCTGAAGCAATACCTGACTCTCATGTCATTGCCTCAAGCTTCTATCTTGAAGAGTCCGATCTGAAAGCATTACT
GGCACCTGGCTGCTTCTACAACACCCATGCCTGTCTTGAAATATCTGTCTTCTCGGGAAGGAAGGGATCCCACTGTGGTGTTGGGATCAAAAGGCAGCTGATCGGGACGT
TTAAACTGGACGTCAGTCCTGAGTGGGGTGATGGGAAGCCAGTCATTCTTTTTAATGGGTGGATAGGCATTGGCAAAAGCAAGAATGAGAATGGAAGACATGGAGCAGAG
CTTCATTTGAGAGTTAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTCCAAGATGTTACTAGATCAAGCCCCCAAATCGTCCAGCTTCAAGGCTCAATCAAGCAGCC
GATCTTCAGTTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCCTTGAGCAACTATTGGTCAGGTCTTGGTGATGGCTCGGATCTTGAGGTCGAGCGAAGAGAAA
GAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCA
AACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCGTGGCGCGAGCGTGGGATTAGAGACACTGTCTG
CTGTCGCTTTCACCTTCTCTCTGAAGCCCAAGAGGGAGGTGAGCTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGCGGGGAATTCTTTATAGACACCGACAAAC
AATTGCGAGCAGCAACAAGTCCAATACCGAGTCCACAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGCTTTGTCATGAGTTGCAGAGTACAAGGGGAA
GGAAGAAGCAGTAAGCCAACGGTTCAACTCGCAATGCGACACGTGACATGTATAGAAGATGCTGCTATTTTCATGGCGCTTGCAGCTGCAGTTGATCTCAGCATTGAGGC
ATGTAGGCCGTTCCGAAGGAAGATTAGGAGAACGCCTCGGCATTCATAGGGAGACAAGGGAAAGGAAAAGGAAAAAGCCAAAGAAAAAAAAACTGTTTAAGGTAAAATTG
ATGTGTAAGAAACATAAACAAAGATGGGGATACATTTGTATGAATTTATCTCTCAAGGCTATAAAATTTGGCATTTTGTACAGCAAAAGGTGATGTCTCTTGCATCTTAT
CTTACCTTCTAGGCTCCATCATATCCTTTGTTATCTTTATAGACCCAATTTAATTACTTATTCTTTTCAATTCCCAGAACATAAGATTATTTGGAAGGAAGGAAAGTTAA
AAGGAGAAAAAACGAAAGTAAAAAAAGAAGCAAAACGCCACTTTTTCATCTCTGTCGTCAATGGAAGTTCACTAAATATATATAAAAATAATAACAATAATAATTGTCGT
AAATGGCAAAATTCAGACAAGGAAACTCTCTCGACCTTCACGTATATTTGACTATGACCTATAAGCAAAAAGATAAAACACTGGTTTGGTGCTTGTCATTATCACTATGA
CCTATAAGTAAAAAGAGAACACAAGTGTAGTGTTTGTCATTATTTAGAAAAGGGTTAGAGAAAGTGTATGAAATATTTTATCCTGGGTGATGCCTACGACCTCTTATTGA
TAAAGGAGTGGATTTGGAATTCCGAATTCAAAAGAAATTAGGAAATACAAATTCAGGCCGATCAAGGAGTTTTGGATCGTAGCATATGAGAAGTAAGCCTTCTGGAATCG
GAAAGGCACACAGTGTGACCTCCGAAGCCAAGGATCAAAGCCTCCAAGGTCGAGGGGAGCAAGTTCAGCCCCTCCTTTCCCTAGTCCTTATTCCATCGGTCCCTTTCCTA
CATTCCTCTTGTACTTATTTGATCGACATCATTTCAATTATACCTTAAAACAATCTATCTCTTGAGGTCATGGTTCTTGATTCAAAAGTAACGGATAACAATTGAATTGT
ATTAACTTGTTTTAAAAGAAAGAAAAGAAAAGCTTCAAATATTTTTTTGTTGACGTAGTTATCAAATGAGTTCAACTCACCAAAAACCACCTAATCCTCATGGTCATATT
TCTTTTAAATTACATTAAGTTCTTGAATTCAACAAATTTCTTGAGTTTTGTTTGGTTATCTACTCTTATGTTCATGAGATTAGTTGTATTTGAACTATATTATTTTTAAT
TGACCATTTAGCCCCTATAATTTAAACATTGCTTTTAAAGTTCTTGGAAGATTTTGGAGTTATATATTATGTTCTAATT
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGTSLNSTKPGVNAFSSPCSCEIRLRGFPMQTSSIPLVPSPEAIPDSHVIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFQDVTRSSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSGLGDG
SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKG
GEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGRSSKPTVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRTPRHS