; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G00650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G00650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationClcChr06:573100..576295
RNA-Seq ExpressionClc06G00650
SyntenyClc06G00650
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]1.2e-24596.96Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]1.2e-24596.73Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]5.3e-24697.2Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]3.2e-24395.56Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG +LNSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  D  DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]7.9e-25098.6Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGPGD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein5.7e-24696.73Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990022.6e-24697.2Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 25.7e-24696.96Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228011.6e-24395.56Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG +LNSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  D  DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1HM61 uncharacterized protein LOC1114642381.7e-24295.33Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG A+NSTKPGV+A SSPCSCEIRLRGFPVQTSSIP+V SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GD SD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.2e-9441.38Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P AA  +T   V+  SSPC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQADSLSNYWSGPGDF-----SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKL
        S     DR  ++ SL    S    +     S+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+L
Subjt:  S----RDRVSQADSLSNYWSGPGDF-----SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKL

Query:  EAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQV
        EAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      +++                      SP  SP+  SGD+        V
Subjt:  EAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQV

Query:  VGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
          GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)8.0e-7536.81Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GAALNSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S+ P V+  SS  C C+I+ + FP Q  S+P++   E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GAALNSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR + + SLS+  SG   FS      +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        T+   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
                 GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)5.0e-7735.56Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S+        G+N  +  C C+IR + FP +  S+P++   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSGPGDFSDLEVER----RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S     S +  E+    +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSGPGDFSDLEVER----RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
           G                              GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)1.2e-8340.68Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P  A N    G V+  S+PC C++R++ FP Q + +PL    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSL-------SNYW---SGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW
        KFS DR  ++ SL       S  W   +  GD  + + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PW
Subjt:  KFSRDRVSQADSL-------SNYW---SGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW

Query:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA
        G+LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+V + 
Subjt:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA

Query:  MRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
         +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  MRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)2.5e-19377.08Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   +NST        FSS CSCEI+LRGFPVQT+SIPL+PS +A PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+V PEWG+GKP+ILFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS  GD ++LE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  A +PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTCVE
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCTGTTGTTCTTCTTCTTCAGGAATTTTGGCTGCTTCGCTTTCTGTGGAGCACTGCAGTACTGTTTCTACTTGGCAACTAATCACAAGTTATGTCGTGATTCAAA
CATAACAAAAACGGGGTTTGGATACTTGAGCTCGTCGAAATATAAACGATCTATATCAAATGGACAAAAGATTGTGACCAAAATGGATCCTCAGGCATTTATTAGGTTGT
CAATTGGATCGCTGGGATTGAGAATCCCAGGGGCTGCTCTAAACTCTACAAAACCTGGAGTCAATGCTTTCTCTTCTCCATGTTCATGTGAAATTCGTCTTCGGGGTTTC
CCTGTGCAGACGTCTTCAATCCCACTAGTCCCGTCTCCTGAAGCAATACCTGACTCTCATAGCATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTGAAAGCATTACT
GGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCCCACTGTGGAGTTGGCATCAAAAGGCAGCTGATCGGGACGT
TTAAACTGGATGTCAGTCCCGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGGATTGGCAAAAGCAAAAATGAGAATGGAAGACATGGAGCAGAG
CTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACTAGGTTAAGCCCGCAAATTGTCCAGCTTCAAGGCTCAATCAAACAGCC
GATCTTCAGCTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTCCTGGTGATTTCTCAGATCTCGAGGTCGAGCGAAGGGAAA
GAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCG
AACCCCGGTTCCTGGCTGATTGTTCGTCCGGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGCGTGAGCGTGGGATTAGAGACACTGTTTG
CTGTCGCTTTCACCTTCTCTCCGAAGCCCAAGAGGGAGGGGAACTTCTCATGTCTGAGATCCATATCAATGCTGAGAAAGGTGGAGAGTTCTTCATAGACACTGACAAAC
AATTGCGAGCAGCAACAAGTCCAATACCGAGCCCACAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTGGTCGGAGGTTTCGTCATGAGTTGCAGAGTACAAGGGGAA
GGAAAAAGCAGCAAGCCGATGGTTCAACTCGCAATGCGACATGTGACATGTGTAGAAGATGCTGCCATTTTCATGGCGCTTGCAGCTGCAGTTGATCTCAGCATCGAGGC
GTGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCATTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCTGTTGTTCTTCTTCTTCAGGAATTTTGGCTGCTTCGCTTTCTGTGGAGCACTGCAGTACTGTTTCTACTTGGCAACTAATCACAAGTTATGTCGTGATTCAAA
CATAACAAAAACGGGGTTTGGATACTTGAGCTCGTCGAAATATAAACGATCTATATCAAATGGACAAAAGATTGTGACCAAAATGGATCCTCAGGCATTTATTAGGTTGT
CAATTGGATCGCTGGGATTGAGAATCCCAGGGGCTGCTCTAAACTCTACAAAACCTGGAGTCAATGCTTTCTCTTCTCCATGTTCATGTGAAATTCGTCTTCGGGGTTTC
CCTGTGCAGACGTCTTCAATCCCACTAGTCCCGTCTCCTGAAGCAATACCTGACTCTCATAGCATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTGAAAGCATTACT
GGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCCCACTGTGGAGTTGGCATCAAAAGGCAGCTGATCGGGACGT
TTAAACTGGATGTCAGTCCCGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGGATTGGCAAAAGCAAAAATGAGAATGGAAGACATGGAGCAGAG
CTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACTAGGTTAAGCCCGCAAATTGTCCAGCTTCAAGGCTCAATCAAACAGCC
GATCTTCAGCTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTCCTGGTGATTTCTCAGATCTCGAGGTCGAGCGAAGGGAAA
GAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCG
AACCCCGGTTCCTGGCTGATTGTTCGTCCGGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGCGTGAGCGTGGGATTAGAGACACTGTTTG
CTGTCGCTTTCACCTTCTCTCCGAAGCCCAAGAGGGAGGGGAACTTCTCATGTCTGAGATCCATATCAATGCTGAGAAAGGTGGAGAGTTCTTCATAGACACTGACAAAC
AATTGCGAGCAGCAACAAGTCCAATACCGAGCCCACAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTGGTCGGAGGTTTCGTCATGAGTTGCAGAGTACAAGGGGAA
GGAAAAAGCAGCAAGCCGATGGTTCAACTCGCAATGCGACATGTGACATGTGTAGAAGATGCTGCCATTTTCATGGCGCTTGCAGCTGCAGTTGATCTCAGCATCGAGGC
GTGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCATTCATAG
Protein sequenceShow/hide protein sequence
MLLLFFFFRNFGCFAFCGALQYCFYLATNHKLCRDSNITKTGFGYLSSSKYKRSISNGQKIVTKMDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGF
PVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKSKNENGRHGAE
LHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSGPGDFSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS
NPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGE
GKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRHS