; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G009540 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G009540
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationCmo_Chr08:6200408..6202060
RNA-Seq ExpressionCmoCh08G009540
SyntenyCmoCh08G009540
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593741.1 hypothetical protein SDJN03_13217, partial [Cucurbita argyrosperma subsp. sororia]3.4e-24999.3Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGV ALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

XP_022964109.1 uncharacterized protein LOC111464238 [Cucurbita moschata]8.1e-251100Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

XP_023000027.1 uncharacterized protein LOC111494335 [Cucurbita maxima]9.9e-24998.83Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGV ALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQL+GTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEA+RRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRRAP+HS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

XP_023514401.1 uncharacterized protein LOC111778675 [Cucurbita pepo subsp. pepo]2.6e-24999.3Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGV ALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]4.8e-24395.79Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGT++NSTKPGV+A SSPCSCEIRLRGFPVQTSSIP+V SPEAIPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein5.3e-24094.16Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGT++NSTKPGV+A SSPCSCEIRLRGFP+QTSSIP+V SPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990022.0e-23994.16Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGT++NSTKPGV+A SSPCSCEIRLRGFP+QTSSIP+V SPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228011.6e-23993.46Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG ++NSTKPG+++ SSPCSCEIRLRGFPVQ+SSIP++ SPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGS DG D+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1HM61 uncharacterized protein LOC1114642383.9e-251100Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1KCE0 uncharacterized protein LOC1114943354.8e-24998.83Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPGTAINSTKPGV ALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQL+GTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSDVEA+RRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MAVAAAVDLSIEACRPFRRKIRRAPRHS
        MAVAAAVDLSIEACRPFRRKIRRAP+HS
Subjt:  MAVAAAVDLSIEACRPFRRKIRRAPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)3.7e-9241.29Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVV-LSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P  A  +T   V   SSPC C+I+L+ FP QT++IP + L     P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVV-LSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP V  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ++G+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKF

Query:  S----RDRVSQADSL------SNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGK
        S     DR  ++ SL      S  W  S  GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP      +W+PWG+
Subjt:  S----RDRVSQADSL------SNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------MSPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      ++                       SP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------MSPIPSPQ-SSGDFAA---LGQ

Query:  VIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH
        V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++A++AA+DLS++ACR F +++R+   H
Subjt:  VIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)1.7e-7337.87Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GTAINSTKPGVSALSS-PCSCEIRLRGFPVQTSSIPVVLSPEAIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S+ P VS +SS  C C+I+ + FP Q  S+PV+L  E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GTAINSTKPGVSALSS-PCSCEIRLRGFPVQTSSIPVVLSPEAIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLRGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q++G+ KQ +F+CKF    S DR + + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLRGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  SIPDSWQPWGKLEAWRERGIRDTVCCRFHLLSETQEGGELLMS-EIHINAEKGGEFFIDTDKQLR--------------AAMSPIPSPQ----SSGDF--
            +W+PW +L+AWRE G+ D +  RF L    ++G  + +S    I+ + GG F ID                    ++ S I S +    S  DF  
Subjt:  SIPDSWQPWGKLEAWRERGIRDTVCCRFHLLSETQEGGELLMS-EIHINAEKGGEFFIDTDKQLR--------------AAMSPIPSPQ----SSGDF--

Query:  -AALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPR
          +  Q   GFVMS RVQG  K SKP V++ ++HVTC EDAA  +A+AAAVDLS++ACR F +K+R   R
Subjt:  -AALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)3.5e-7435.22Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTK------PGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S+        G++  +  C C+IR + FP +  S+PV+   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTK------PGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVVLFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVVLFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLRGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q++G+ KQ +F+CKF         R+ +  +  +S   S     S   S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLRGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRDTVCCRFHLLSETQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAMSPIPSPQSSGDF----
         RS+PG+WLI+RPD     +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A +P  SPQ S D     
Subjt:  ARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRDTVCCRFHLLSETQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAMSPIPSPQSSGDF----

Query:  -----------AALGQVIG--------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPR
                   +  G   G              GFVMS  V+G GK SKP V++ + HVTC EDAA  +A+AAAVDLS++ACR F  K+R+  R
Subjt:  -----------AALGQVIG--------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)3.1e-8339.64Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPG-VSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P TA N    G V   S+PC C++R++ FP Q + +P+    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPG-VSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q++ ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSC

Query:  KFSRDRVSQADSLSNYWSGSGDG---------SDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWG
        KFS DR  ++ SL + ++ S  G            + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP  +   SW+PWG
Subjt:  KFSRDRVSQADSLSNYWSGSGDG---------SDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWG

Query:  KLEAWRERGIRDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAM
        +LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   + GFVM   V+GEGK SKP+V +  
Subjt:  KLEAWRERGIRDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAM

Query:  RHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH
        +HVTC+ DAA+F+A++AAVDLS++AC+ F RK+R+   H
Subjt:  RHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)4.8e-19376.62Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGTAINSTKPG--VSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   INST         SS CSCEI+LRGFPVQT+SIP++ S +A PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGTAINSTKPG--VSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEWG+GKP++LFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQLRGS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS SGDG+++E+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD S P+SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGSGDGSDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAMSPIPSPQSSGDFAALGQVI--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  A +PIPSPQSSGDF+ LGQ +  GGFVMS RVQGEGKSSKP+VQLAMRHVTC+E
Subjt:  RDTVCCRFHLLSETQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAMSPIPSPQSSGDFAALGQVI--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE

Query:  DAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMA+AAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMAVAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCTTTTATTCGGTTGTCAATTGGTTCTTTGGGATTGAGAATCCCAGGAACTGCTATAAATTCTACAAAACCTGGAGTCAGTGCTTTGTCTTCTCCGTG
TTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACATCTTCAATCCCAGTAGTCCTGTCTCCTGAAGCCATACCTGATTCTCATAGCATTGCTTCAAGCTTCTATC
TTGAAGAGTCTGATTTGAAAGCATTACTGGCACCTGGTTGCTTCTATAATACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGGAGGAAGGGATCTCACTGCGGTGTT
GGCATCAAAAGGCAGCTGATCGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAGCCAGTCGTTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTCCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAATCG
TCCAGCTTCGAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGCTCTGGTGATGGC
TCGGATGTCGAGGCCGAGCGGAGGGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATCGTTCGTCCTGATGTTTCCATTCCTGATAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGC
GCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCTGAAACACAAGAGGGAGGAGAACTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGT
GGCGAATTCTTCATCGACACTGACAAACAGTTACGAGCTGCAATGAGTCCAATACCAAGCCCGCAGAGTAGCGGAGACTTTGCAGCATTAGGCCAAGTAATCGGAGGCTT
CGTCATGAGCTGCAGAGTACAAGGAGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTAGCAATGCGACATGTGACATGTATAGAGGACGCTGCCATTTTCATGGCGGTTG
CAGCTGCAGTTGATCTCAGCATCGAGGCATGTAGGCCATTCCGTAGGAAGATTCGGCGAGCACCTCGACATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTCAGGCTTTTATTCGGTTGTCAATTGGTTCTTTGGGATTGAGAATCCCAGGAACTGCTATAAATTCTACAAAACCTGGAGTCAGTGCTTTGTCTTCTCCGTG
TTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACATCTTCAATCCCAGTAGTCCTGTCTCCTGAAGCCATACCTGATTCTCATAGCATTGCTTCAAGCTTCTATC
TTGAAGAGTCTGATTTGAAAGCATTACTGGCACCTGGTTGCTTCTATAATACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGGAGGAAGGGATCTCACTGCGGTGTT
GGCATCAAAAGGCAGCTGATCGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAGCCAGTCGTTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTCCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAATCG
TCCAGCTTCGAGGCTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGCTCTGGTGATGGC
TCGGATGTCGAGGCCGAGCGGAGGGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATCGTTCGTCCTGATGTTTCCATTCCTGATAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGC
GCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCTGAAACACAAGAGGGAGGAGAACTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGT
GGCGAATTCTTCATCGACACTGACAAACAGTTACGAGCTGCAATGAGTCCAATACCAAGCCCGCAGAGTAGCGGAGACTTTGCAGCATTAGGCCAAGTAATCGGAGGCTT
CGTCATGAGCTGCAGAGTACAAGGAGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTAGCAATGCGACATGTGACATGTATAGAGGACGCTGCCATTTTCATGGCGGTTG
CAGCTGCAGTTGATCTCAGCATCGAGGCATGTAGGCCATTCCGTAGGAAGATTCGGCGAGCACCTCGACATTCTTAGAGAGAAAACAGAAGGAAAAAAGGGTCAGAATAA
GTGCTTTTAAGGTAAAATTCATGTGTAGAAACATAAACAAAGATGGGGCATACATCATTATGAAACTTATCTTTGAGGGCTATAAAATTTGGCATTTTGTACAGTATCAG
CTGATGTTCTTGCTTATCTTAAAATTTAAAGTGTGTATATATATATATATAATTTAGCATAATGGCTGGTTGAGGTGGATTCACTAGCCAAGCATAG
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGTAINSTKPGVSALSSPCSCEIRLRGFPVQTSSIPVVLSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVGPEWGDGKPVVLFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLRGSIKQPIFSCKFSRDRVSQADSLSNYWSGSGDG
SDVEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVSIPDSWQPWGKLEAWRERGIRDTVCCRFHLLSETQEGGELLMSEIHINAEKG
GEFFIDTDKQLRAAMSPIPSPQSSGDFAALGQVIGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMAVAAAVDLSIEACRPFRRKIRRAPRHS