; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016306 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016306
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationchr12:36080337..36083651
RNA-Seq ExpressionLag0016306
SyntenyLag0016306
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]1.8e-24496.5Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLLPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]3.9e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]1.2e-24596.73Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLRVPG ++NSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP+LPSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGS DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_022964109.1 uncharacterized protein LOC111464238 [Cucurbita moschata]6.7e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG AINSTKPGV+A SSPCSCEIRLRGFPVQTSSIP++ SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSD+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]4.5e-24897.9Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPL+PSPEAIPDSHSI SSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein4.3e-24495.79Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990021.9e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 28.6e-24596.5Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG ++NSTKPGVNAFSSPCSCEIRLRGFP+QTSSIPLLPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSG GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRR PRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228015.9e-24696.73Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLRVPG ++NSTKPG+N+FSSPCSCEIRLRGFPVQ+SSIP+LPSPEAIP+SHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGS DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MALAAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

A0A6J1HM61 uncharacterized protein LOC1114642383.3e-24496.26Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLR+PG AINSTKPGV+A SSPCSCEIRLRGFPVQTSSIP++ SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS
        SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFS
Subjt:  SGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS

Query:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDRVSQADSLSNYWSGSGDGSD+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
        TVCCRFHLLSE QEGGELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF
Subjt:  TVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFRRKIRRAPRHS
        MA+AAAVDLSIEACRPFRRKIRRAPRHS
Subjt:  MALAAAVDLSIEACRPFRRKIRRAPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.5e-9542.37Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV
        MDP  FIRL+IG+L L+VP AA  +T   V+  SSPC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++  LA    + +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPGVNAFSSPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        ++GR G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKF
Subjt:  FSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  S----RDRVSQADSL------SNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK
        S     DR  ++ SL      S  W  S  GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+
Subjt:  S----RDRVSQADSL------SNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF L+ +   G  ++++E  I++ +GG+F I+      ++                      ASP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAA----------------------ASPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT1G50040.1 Protein of unknown function (DUF1005)1.2e-7337.02Show/hide
Query:  MDPQAFIRLSIGSLGLRVP------GAAINSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +S+ P V+  SS  C C+I+ + FP Q  S+P+L   E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRVP------GAAINSTKPGVNAFSS-PCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV
        + Q+QG+ KQ +F+CKF    S DR + + SLS+  SG       E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD 
Subjt:  IVQLQGSIKQPIFSCKF----SRDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV

Query:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------
            +W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        AS   S + S D ++   +               
Subjt:  CIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------

Query:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
                 GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  ---------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT3G19680.1 Protein of unknown function (DUF1005)5.8e-7635.76Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ +S+        G+N  +  C C+IR + FP +  S+P++   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTK------PGVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLAPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +  + K  +  NGW+ +   K+K++ G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV
            SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S     S   S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + V
Subjt:  VTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSG----SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWV

Query:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG
         RS+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID T      A++P  SPQ S D  + G
Subjt:  ARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEG-GELLMSEIHINAEKGGEFFID-TDKQLRAAASPIPSPQSSGDFAALG

Query:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR
           G                              GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  QVVG------------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPR

AT4G29310.1 Protein of unknown function (DUF1005)2.9e-8340.91Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P  A N    G V+  S+PC C++R++ FP Q + +PL    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPG-VNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLAPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        SV++GR G  CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSC
Subjt:  SVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSL-------SNYW---SGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW
        KFS DR  ++ SL       S  W   + SGD  + + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PW
Subjt:  KFSRDRVSQADSL-------SNYW---SGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPW

Query:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA
        G+LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID                  G+  A+   V GFVM   V+GEGK SKP+V + 
Subjt:  GKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLA

Query:  MRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
         +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  MRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRH

AT5G17640.1 Protein of unknown function (DUF1005)1.2e-19577.78Show/hide
Query:  MDPQAFIRLSIGSLGLRVPGAAINSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LR+P   INST        FSS CSCEI+LRGFPVQT+SIPL+PS +A PD HSI++SFYLEESDL+ALL PGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRVPGAAINSTKPG--VNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEWG+GKP+ILFNGWI IGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCK
Subjt:  VFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FSRDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDRVSQ D L+ YWS SGDG++LE+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE
        RD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  AA+PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTC+E
Subjt:  RDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIE

Query:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH
        DAAIFMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFRRKIRRAPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTCTTCTTCTTCTTCTTCTTCTTCAAGGATTCTGGCTGCTCTGCTTTCTGTGGAGCACTGCAGTACTGTTTCTACTTGGCTACTAATCACAAGTTTACAAACAG
TATTCTACGTGTTTTCTTGGCAATCGAATGTAGTAGGGTCAGACAATTGTGCTTGTGTCGTGAGATTAGTTGGGGATACTTGAGCTTGTCGAAATTTAAACAATCTATAT
CAAATGGACAAAAAACTGTGACCAAAATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAGTCCCAGGAGCTGCTATAAATTCTACAAAACCT
GGAGTCAATGCTTTCTCTTCTCCGTGTTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCCATACCGCTACTCCCATCTCCTGAAGCCATACCTGACTC
TCATAGCATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTTAAAGCATTGCTGGCACCTGGTTGCTTCTACAACACTCATGCTTGTCTTGAAATATCTGTCTTCTCTG
GAAGGAAGGGATCCCATTGTGGCGTTGGCATCAAAAGGCAGCTGATTGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAACCCGTCATTCTTTTCAAT
GGGTGGATAGGCATTGGCAAAAGTAAGAATGAGAATGGAAGACATGGAGCCGAGCTTCACTTGAGAGTGAAACTAGATCCTGATCCACGATACGTTTTTCAGTTTGAAGA
TGTTACGAGGTTAAGCCCGCAAATCGTGCAGCTTCAAGGTTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCA
ACTATTGGTCAGGTTCTGGTGATGGCTCGGACCTCGAGGCCGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCC
TTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCA
ACCATGGGGAAAGCTCGAGGCGTGGCGCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCTGAAGCGCAAGAGGGAGGTGAACTTCTCATGTCTG
AGATACATATCAATGCTGAGAAAGGCGGAGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAGCGAGTCCAATACCGAGCCCGCAGAGCAGCGGAGACTTTGCT
GCATTGGGTCAAGTGGTTGGAGGTTTCGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGCAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGA
GGATGCTGCCATTTTCATGGCACTCGCAGCTGCAGTCGATCTCAGCATCGAGGCGTGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGACATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTACTCTTCTTCTTCTTCTTCTTCTTCAAGGATTCTGGCTGCTCTGCTTTCTGTGGAGCACTGCAGTACTGTTTCTACTTGGCTACTAATCACAAGTTTACAAACAG
TATTCTACGTGTTTTCTTGGCAATCGAATGTAGTAGGGTCAGACAATTGTGCTTGTGTCGTGAGATTAGTTGGGGATACTTGAGCTTGTCGAAATTTAAACAATCTATAT
CAAATGGACAAAAAACTGTGACCAAAATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAGTCCCAGGAGCTGCTATAAATTCTACAAAACCT
GGAGTCAATGCTTTCTCTTCTCCGTGTTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCCATACCGCTACTCCCATCTCCTGAAGCCATACCTGACTC
TCATAGCATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTTAAAGCATTGCTGGCACCTGGTTGCTTCTACAACACTCATGCTTGTCTTGAAATATCTGTCTTCTCTG
GAAGGAAGGGATCCCATTGTGGCGTTGGCATCAAAAGGCAGCTGATTGGGACGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAACCCGTCATTCTTTTCAAT
GGGTGGATAGGCATTGGCAAAAGTAAGAATGAGAATGGAAGACATGGAGCCGAGCTTCACTTGAGAGTGAAACTAGATCCTGATCCACGATACGTTTTTCAGTTTGAAGA
TGTTACGAGGTTAAGCCCGCAAATCGTGCAGCTTCAAGGTTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCGGATTCATTGAGCA
ACTATTGGTCAGGTTCTGGTGATGGCTCGGACCTCGAGGCCGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCC
TTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGGTCCTGGCTGATTGTTCGTCCTGATGTTTGCATACCTGAAAGTTGGCA
ACCATGGGGAAAGCTCGAGGCGTGGCGCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGCTTTCACCTTCTCTCTGAAGCGCAAGAGGGAGGTGAACTTCTCATGTCTG
AGATACATATCAATGCTGAGAAAGGCGGAGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCAGCGAGTCCAATACCGAGCCCGCAGAGCAGCGGAGACTTTGCT
GCATTGGGTCAAGTGGTTGGAGGTTTCGTCATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGCAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGA
GGATGCTGCCATTTTCATGGCACTCGCAGCTGCAGTCGATCTCAGCATCGAGGCGTGTAGGCCGTTCCGAAGGAAGATCAGGAGAGCGCCTCGACATTCTTAG
Protein sequenceShow/hide protein sequence
MLLFFFFFFFKDSGCSAFCGALQYCFYLATNHKFTNSILRVFLAIECSRVRQLCLCREISWGYLSLSKFKQSISNGQKTVTKMDPQAFIRLSIGSLGLRVPGAAINSTKP
GVNAFSSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLAPGCFYNTHACLEISVFSGRKGSHCGVGIKRQLIGTFKLDVGPEWGDGKPVILFN
GWIGIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSGSGDGSDLEAERRERKGWKVKIHDLSGSAVAAA
FITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTDKQLRAAASPIPSPQSSGDFA
ALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFRRKIRRAPRHS