; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006228 (gene) of Snake gourd v1 genome

Gene IDTan0006228
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationLG06:72326877..72331091
RNA-Seq ExpressionTan0006228
SyntenyTan0006228
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]9.6e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]9.6e-24496.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]4.3e-24496.5Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]1.1e-24496.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG +LNSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEW DGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSGS DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]4.9e-24898.13Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein4.7e-24496.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990022.1e-24496.5Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 24.7e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEW DGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228015.5e-24596.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG +LNSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEW DGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSGS DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1HM61 uncharacterized protein LOC1114642381.0e-24396.03Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG A+NSTKPGV+A SSPCSCEIRLRGFPVQTSSIP+V SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEW DGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQ DSLSNYWSGSGDGSD+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)5.5e-9642.15Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P AA  +T   V+  SSPC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQVDSL------SNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        S     DR  +  SL      S  W  S  GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+
Subjt:  S----RDRVSQVDSL------SNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      ++                      ASP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)3.5e-7436.81Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GAALNSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S+ P V+  SS  C C+I+ + FP Q  S+P++   E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GAALNSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR   + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        AS   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
                 GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)9.7e-7736.36Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S+        G+N  +  C C+IR + FP +  S+P++   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWCDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWCDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQPIFSCKF------SRDR-----VSQVDSLSNYWSG-SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF      S DR      S +  +S+  S  S   S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLQGSIKQPIFSCKF------SRDR-----VSQVDSLSNYWSG-SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
           G                              GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)8.2e-8440.91Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P  A N    G V+  S+PC C++R++ FP Q + +PL    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQVDSL-------SNYW---SGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW
        KFS DR  +  SL       S  W   + SGD  + + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PW
Subjt:  KFSRDRVSQVDSL-------SNYW---SGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW

Query:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA
        G+LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+V + 
Subjt:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA

Query:  MRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
         +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  MRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)1.4e-19577.78Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   +NST        FSS CSCEI+LRGFPVQT+SIPL+PS +A PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEW +GKP+ILFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQVD L+ YWS SGDG++LE+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQVDSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  AA+PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTC+E
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAATCCCAGGAGCTGCTCTAAATTCTACAAAACCTGGAGTCAATGCTTTCTCTTCTCCATG
TTCCTGTGAAATTCGTCTTCGGGGGTTCCCTGTGCAGACGTCTTCAATCCCGCTTGTCCCGTCTCCCGAAGCCATACCTGATTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAAGAGTCTGATTTGAAGGCATTACTCGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCCCACTGCGGTGTT
GGCATCAAAAGACAGCTGATCGGGACATTTAAACTGGATGTCGGCCCCGAATGGTGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATACGTTTTTCAGTTTGAAGATGTCACGAGGTTAAGCCCACAAATCG
TCCAGCTTCAAGGCTCAATCAAACAGCCAATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGTGGATTCATTGAGCAACTATTGGTCAGGTTCTGGTGATGGC
TCGGATCTTGAAGCCGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCGTGGC
GCGAGCGTGGGATTAGAGACACAGTCTGCTGTCGCTTTCACCTTCTCTCCGAAGCGCAAGAGGGAGGGGAGCTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGC
GGGGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAGCCAGTCCGATACCAAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGTTT
CGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAGGATGCTGCCATTTTCATGGCACTTG
CAGCTGCGGTCGATCTCAGCATCGAGGCGTGTAGGCCATTCCGAAGGAAGATAAGGAGAGCGCCTCGACATTCTTAG
mRNA sequenceShow/hide mRNA sequence
CTCTTCTCTCATAAGCCAATTGGTTACAGCTGTTTCCACGATTTCTCTCTCTCTCTCTCTATCGCTTTTTCTTCCACTGCCACTATTCCTCTGTTCTTTCCAACCCAACA
GGGACGTTCAAACTCCAAATCAGAGTAAAAAATCGCTTAAAATTTCCCTTTCTCTAACAAAATTTCACCTCAATTCTATCGCTTCTTCTTTATTCTGTTCTTGTTTCTTC
TCAAAATCAAAGGGGTAGTAGTATTTTTGTATAGTATTGTTTTTTGTAGGTCCAGATTATGTATTACCCATCTCTTTCTTTATTCGATTTAAGCCATTTGAACTGTTTGG
TTTGGTGAGAAAATTAAACAAAACCCCCCTCCCCTTAAATTTCTCCGTCTATTTCGTGACAAATAATGCAGATTTCTCTGAGAAAAAGGGGTATATTATAAACTTCTCTT
ATCGATTTGAGTGAGTGAGAGAGAGAAATAGGGAGTTTCCTTTTCTGGGTTGGGTTTATTTTGGGGTTTGTGAAGCGGCCTCGTGATCTTTGAAGTCTCTACGTGGCATT
CGAGACTCTTGTTCATATGGGGGTTGGATTTGGGGGATTGGGGATTCTTTAAAAGCTCATTATTGGATCTGTGTTTGTTTGTTTCATGAACCCTTTTCTAGAAACATTCA
TCCTTTCACATTTAGTGAGAAAATTGGAGAGAGGGAGGAGGGGAAGTGGGATTTGATTTCGGTGTTTGTTTTGGGGGGAAATGTTGCTGTTCTTCTTGTTGTTCTTCAGG
GATTCTGGCTGCTCTGCTTTCTGTGGAGCACTGAAGTACTTTTTCTACTTGGCTACTAATCACAAGGATACTTGAGCTCGTCGAAATATAAACGATTGTATAACAAATGG
ACAAAAAATTGTGACCGAAATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAATCCCAGGAGCTGCTCTAAATTCTACAAAACCTGGAGTCA
ATGCTTTCTCTTCTCCATGTTCCTGTGAAATTCGTCTTCGGGGGTTCCCTGTGCAGACGTCTTCAATCCCGCTTGTCCCGTCTCCCGAAGCCATACCTGATTCTCATAGC
ATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATTTGAAGGCATTACTCGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAA
GGGATCCCACTGCGGTGTTGGCATCAAAAGACAGCTGATCGGGACATTTAAACTGGATGTCGGCCCCGAATGGTGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGA
TAGGCATTGGCAAAAGTAAGAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATACGTTTTTCAGTTTGAAGATGTCACG
AGGTTAAGCCCACAAATCGTCCAGCTTCAAGGCTCAATCAAACAGCCAATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGTGGATTCATTGAGCAACTATTG
GTCAGGTTCTGGTGATGGCTCGGATCTTGAAGCCGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAA
CTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGG
GGAAAGCTCGAGGCGTGGCGCGAGCGTGGGATTAGAGACACAGTCTGCTGTCGCTTTCACCTTCTCTCCGAAGCGCAAGAGGGAGGGGAGCTTCTCATGTCTGAGATCCA
TATCAATGCCGAGAAAGGCGGGGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAGCCAGTCCGATACCAAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAG
GCCAAGTGGTTGGAGGTTTCGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAGGATGCT
GCCATTTTCATGGCACTTGCAGCTGCGGTCGATCTCAGCATCGAGGCGTGTAGGCCATTCCGAAGGAAGATAAGGAGAGCGCCTCGACATTCTTAGGGAGAAAAGGGAAA
AGGAAAAGGAAAAAAAAAAAGTGTTTTAAGGTAAAATTGATGTGTACAAACATAAACAAAGATGGGCATACATCTGTATGAATTTATATCTGAAGGCTATAAAATTTGGC
ATTTTGTACA
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGAALNSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVGPEWCDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQVDSLSNYWSGSGDG
SDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKG
GEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRHS