; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010828 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010828
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationChr06:26285153..26286585
RNA-Seq ExpressionHG10010828
SyntenyHG10010828
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]1.8e-24596.73Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]1.8e-24596.5Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]7.9e-24696.96Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_023514401.1 uncharacterized protein LOC111778675 [Cucurbita pepo subsp. pepo]3.7e-24395.56Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG A+NS KPGV+A SSPCSCEIRLRGFPVQTSSIP+V SPE+IPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]1.2e-24998.36Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPE+IPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein8.5e-24696.5Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990023.8e-24696.96Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPLVPSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 28.5e-24696.73Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG +LNS KPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPE+IPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228013.9e-24394.86Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+PG +LNS KPG+N+FSSPCSCEIRLRGFPVQ+SSIP++PSPE+IP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGK+KNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG  DG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1HM61 uncharacterized protein LOC1114642382.6e-24295.09Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIPG A+NS KPGV+A SSPCSCEIRLRGFPVQTSSIP+V SPE+IPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPV+LFNGWIGIGK+KNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.0e-9441.51Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P AA  +    V+  SSPC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVP-SPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQADSL------SNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        S     DR  ++ SL      S  W     GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+
Subjt:  S----RDRVSQADSL------SNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      +++                      SP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAT----------------------SPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)5.9e-7436.81Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GAALNSRKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPESIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S  P V+  SS  C C+I+ + FP Q  S+P++   ES  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GAALNSRKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLVPSPESIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR + + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        T+   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
                 GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)4.4e-7735.96Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S         G+N  +  C C+IR + FP +  S+P++   ES  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKTKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   KTK++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVSPEWGDGKPVILFNGWIGI--GKTKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSGPGDGSDLEVER----RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S     S +  E+    +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSGPGDGSDLEVER----RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAATSPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
           G                              GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)2.2e-8440.77Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNSRKPG-VNAFSSPCSCEIRLRGFPVQTSSIPL--VPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P  A N +  G V+  S+PC C++R++ FP Q + +PL       S P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNSRKPG-VNAFSSPCSCEIRLRGFPVQTSSIPL--VPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSLSN--YWSGPG------DGSDLEVER-RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWG
        KFS DR  ++ SL +   +S  G       G   E ++ RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PWG
Subjt:  KFSRDRVSQADSLSN--YWSGPG------DGSDLEVER-RERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWG

Query:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAM
        +LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+V +  
Subjt:  KLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAM

Query:  RHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  RHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)9.7e-19476.85Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALN--SRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP   +N  S+      FSS CSCEI+LRGFPVQT+SIPL+PS ++ PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPGAALN--SRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+V PEWG+GKP+ILFNGWI IGKTK +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS  GDG++LE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGPGDGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  A +PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTC+E
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAATSPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCGTTTATTAGGTTGTCAATTGGATCATTGGGATTGAGAATCCCAGGAGCTGCTCTAAACTCTAGAAAACCTGGAGTCAATGCTTTCTCTTCTCCGTG
TTCTTGTGAAATTCGTCTGCGAGGTTTCCCTGTGCAGACGTCTTCAATCCCTCTAGTCCCATCTCCTGAATCTATACCCGACTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAAGAGTCCGATCTGAAAGCATTACTGGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGAAAGGGATCCCACTGTGGTGTT
GGCATCAAAAGGCAGCTGATCGGGACGTTTAAACTGGACGTCAGTCCTGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAACCAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACTAGGTTAAGCCCGCAAATCG
TCCAGCTTCAAGGCTCGATTAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTCCTGGTGATGGC
TCAGATCTCGAGGTCGAGCGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGTGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTCCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTTGAGGCGTGGC
GTGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCCGAAGCCCAAGAGGGAGGGGAACTTCTCATGTCTGAGATTCATATCAATGCCGAGAAAGGC
GGGGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAACAAGTCCAATACCGAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGTTT
CGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAAAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAAGATGCTGCCATTTTCATGGCGCTTG
CAGCTGCAGTTGATCTCAGCATCGAGGCATGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCATTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTCAGGCGTTTATTAGGTTGTCAATTGGATCATTGGGATTGAGAATCCCAGGAGCTGCTCTAAACTCTAGAAAACCTGGAGTCAATGCTTTCTCTTCTCCGTG
TTCTTGTGAAATTCGTCTGCGAGGTTTCCCTGTGCAGACGTCTTCAATCCCTCTAGTCCCATCTCCTGAATCTATACCCGACTCTCATAGCATTGCCTCAAGCTTCTATC
TTGAAGAGTCCGATCTGAAAGCATTACTGGCACCTGGCTGCTTCTACAACACTCATGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGAAAGGGATCCCACTGTGGTGTT
GGCATCAAAAGGCAGCTGATCGGGACGTTTAAACTGGACGTCAGTCCTGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAACCAA
GAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACTAGGTTAAGCCCGCAAATCG
TCCAGCTTCAAGGCTCGATTAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTATCCCAGGCGGATTCATTGAGCAACTATTGGTCAGGTCCTGGTGATGGC
TCAGATCTCGAGGTCGAGCGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATC
AACAGGTTGTGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTCCGTCCTGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTTGAGGCGTGGC
GTGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCCGAAGCCCAAGAGGGAGGGGAACTTCTCATGTCTGAGATTCATATCAATGCCGAGAAAGGC
GGGGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAACAAGTCCAATACCGAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGTTT
CGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAAAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAAGATGCTGCCATTTTCATGGCGCTTG
CAGCTGCAGTTGATCTCAGCATCGAGGCATGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGGCATTCATAG
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGAALNSRKPGVNAFSSPCSCEIRLRGFPVQTSSIPLVPSPESIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGV
GIKRQLIGTFKLDVSPEWGDGKPVILFNGWIGIGKTKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSGPGDG
SDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKG
GEFFIDTDKQLRAATSPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRHS