; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027816 (gene) of Chayote v1 genome

Gene IDSed0027816
OrganismSechium edule (Chayote v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationLG04:35594104..35598245
RNA-Seq ExpressionSed0027816
SyntenySed0027816
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]2.9e-23291.84Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFP+QTSSIPLLPSPE++PDSH IAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDV PEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]6.5e-23291.61Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFP+QTSSIPL+PSPE++PDSH IAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDV PEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]1.0e-23291.38Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        M+PQAFIRLSIGSLGLR+PG +LN+  PG+N+FSS CSCEIRLRGFPVQ+SSIP+LPSPE++P+SHSIAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDVGPEWGDGKPVILFNGWI IGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS S DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRRA RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

XP_022964109.1 uncharacterized protein LOC111464238 [Cucurbita moschata]6.5e-23291.61Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG A+N+  PGV+A SS CSCEIRLRGFPVQTSSIP++ SPE++PDSHSIAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDVGPEWGDGKPV+LFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS SGDGSD+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDV IPD+WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSE  E GELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSP+SSGDF AL QV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMA+AAAVDLSIEACRPF+RKIRRA RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]7.4e-23693.24Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFPVQTSSIPL+PSPE++PDSHSI S FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDVGPEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRRA RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein7.0e-23291.14Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFP+QTSSIPL+PSPE++PDSH IAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDV PEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990023.1e-23291.61Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFP+QTSSIPL+PSPE++PDSH IAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDV PEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 21.4e-23291.84Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG +LN+  PGVNAFSS CSCEIRLRGFP+QTSSIPLLPSPE++PDSH IAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDV PEWGDGKPVILFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS  GDGSDLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEG+SSKP VQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

A0A6J1DQ78 uncharacterized protein LOC1110228014.8e-23391.38Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        M+PQAFIRLSIGSLGLR+PG +LN+  PG+N+FSS CSCEIRLRGFPVQ+SSIP+LPSPE++P+SHSIAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDVGPEWGDGKPVILFNGWI IGKSKNENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS S DG DLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PG+WLIVRPDVCIP++WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSEA E GELLMSEIHINAEKGGEFFIDTDKQLRAA+SPIPSP+SSGDF AL QVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMALAAAVDLSIEACRPF+RKIRRA RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

A0A6J1HM61 uncharacterized protein LOC1114642383.1e-23291.61Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF
        MDPQAFIRLSIGSLGLRIPG A+N+  PGV+A SS CSCEIRLRGFPVQTSSIP++ SPE++PDSHSIAS FYLEESDLKALLAPGCFYNTHACLE+SVF
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVF

Query:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF
        SGRKG SHCGVGIKRQLIG+FKLDVGPEWGDGKPV+LFNGWI IGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKF
Subjt:  SGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKF

Query:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR
        SRDRVSQADSLSNYWS SGDGSD+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPG+WLIVRPDV IPD+WQPWGKLEAWRERGIR
Subjt:  SRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIR

Query:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
        DTVCCRFHLLSE  E GELLMSEIHINAEKGGEFFIDTDKQLRAA SPIPSP+SSGDF AL QV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI
Subjt:  DTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAI

Query:  FMALAAAVDLSIEACRPFQRKIRRASRHS
        FMA+AAAVDLSIEACRPF+RKIRRA RHS
Subjt:  FMALAAAVDLSIEACRPFQRKIRRASRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.7e-9241.42Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLP-SPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSV
        MDP  FIRL+IG+L L++P AA   ++  V+  SS C C+I+L+ FP QT++IP +P      P+  ++A+ F+L  SD++  LA    + +  CL++ +
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLP-SPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSV

Query:  FSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK
        ++GR G++ CGV   R L+    + +       KP +  NGWIS+GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CK
Subjt:  FSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCK

Query:  FS----RDRVSQADSL------SNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWG
        FS     DR  ++ SL      S  W +S  GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPG+WLI+RP  C    W+PWG
Subjt:  FS----RDRVSQADSL------SNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWG

Query:  KLEAWRER-GIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAAS----------------------SPIPSPR-SSGDF---TALC
        +LEAWRER G  D +  RF L+ +      ++++E  I++ +GG+F I+      ++S                      SP  SPR  SGD+       
Subjt:  KLEAWRER-GIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAAS----------------------SPIPSPR-SSGDF---TALC

Query:  QVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASRH
         V  GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+   H
Subjt:  QVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASRH

AT1G50040.1 Protein of unknown function (DUF1005)7.0e-7538.15Show/hide
Query:  MDPQAFIRLSIGSLGLRIP------GAALNAANPGVNAFSS-LCSCEIRLRGFPVQTSSIPLLPSPESVPDSH-------SIASIFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P       ++ +++ P V+  SS  C C+I+ + FP Q  S+P+L   ES  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIP------GAALNAANPGVNAFSS-LCSCEIRLRGFPVQTSSIPLLPSPESVPDSH-------SIASIFYLEESDLKALLAPG

Query:  CFYNTHACLEVSVFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQI
         +    + L V V+S R  S         +LIG F++ +  +  + K  +  NGW+ +G     N + G+  ELH+ V+++PD R+VFQF+     SPQ+
Subjt:  CFYNTHACLEVSVFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQI

Query:  VQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDN
         Q+QG+ KQ +F+CKF   R S   +LS   SS   G   E   +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PGAWLI+RPD      
Subjt:  VQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDN

Query:  WQPWGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAAS--------------SPIPSPR----SSGDF---TALCQ
        W+PW +L+AWRE G+ D +  RF L  +       + +   I+ + GG F ID        +              S I S R    S  DF    +  Q
Subjt:  WQPWGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAAS--------------SPIPSPR----SSGDF---TALCQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASR
           GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R   R
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASR

AT3G19680.1 Protein of unknown function (DUF1005)3.7e-7635.83Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAAN------PGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSH--------SIASIFYLEESDLKALLAPG
        MDP +F+R+ +G+L +R P ++ ++++       G+N  +  C C+IR + FP +  S+P++   ES  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAAN------PGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSH--------SIASIFYLEESDLKALLAPG

Query:  CFYNTHACLEVSVFS--------GRKGSSHCGVGIK-RQLIGSFKLDVGPEWGDGKPVILFNGWISI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFE
         F    + L V  +S        G  G+S CG+     +L+G F++ +  +  + K  +  NGW+++   K+K++ G    ELH+ V+++PDPR+VFQF+
Subjt:  CFYNTHACLEVSVFS--------GRKGSSHCGVGIK-RQLIGSFKLDVGPEWGDGKPVILFNGWISI--GKSKNENGRHGAELHLRVKLDPDPRYVFQFE

Query:  DVTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSS----SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDW
             SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S+    S   S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + 
Subjt:  DVTRLSPQIVQLQGSIKQPIFSCKF--------SRDRVSQADSLSNYWSS----SGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDW

Query:  VARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFID-TDKQLRAASSPIPSPRSSGDF----
        V RS+PGAWLI+RPD C    W+PWG+LEAWRE G  DT+  RF L  +         S I +  + GG F ID T      AS+P  SP+ S D     
Subjt:  VARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFID-TDKQLRAASSPIPSPRSSGDF----

Query:  -------------------------TALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASR
                                  +      GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+  R
Subjt:  -------------------------TALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASR

AT4G29310.1 Protein of unknown function (DUF1005)2.7e-8241.27Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPG-VNAFSSLCSCEIRLRGFPVQTSSIPL--LPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEV
        MDP  F+RL+I SL LR+P  A N    G V+  S+ C C++R++ FP Q + +PL       S P+S + A  F+L+   ++ +            L V
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPG-VNAFSSLCSCEIRLRGFPVQTSSIPL--LPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEV

Query:  SVFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFS
        SV++GR G + CGV    +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FS
Subjt:  SVFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFS

Query:  CKFSRDRVSQADSL-------SNYW---SSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQP
        CKFS DR  ++ SL       S  W   + SGD  + + + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPGAWLI+RP      +W+P
Subjt:  CKFSRDRVSQADSL-------SNYW---SSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQP

Query:  WGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQL
        WG+LEAWRERG  D +  +F L+ +      + ++E  ++ ++GG+F ID     +  S  I SP            V GFVM   V+GEGK SKP+V +
Subjt:  WGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQL

Query:  AMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASRH
          +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+   H
Subjt:  AMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASRH

AT5G17640.1 Protein of unknown function (DUF1005)4.1e-19276.21Show/hide
Query:  MDPQAFIRLSIGSLGLRIPGAALNAANPG--VNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVS
        MDPQAFIRLS+GSL LRIP   +N+ +       FSS CSCEI+LRGFPVQT+SIPL+PS ++ PD HSI++ FYLEESDL+ALL PGCFY+ HA LE+S
Subjt:  MDPQAFIRLSIGSLGLRIPGAALNAANPG--VNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVS

Query:  VFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC
        VF+G+K S +CGVG KRQ IG FKL+VGPEWG+GKP+ILFNGWISIGK+K +     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSC
Subjt:  VFSGRKGSSHCGVGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSC

Query:  KFSRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERG
        KFSRDRVSQ D L+ YWSSSGDG++LE+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGAWL+VRPD   P++WQPWGKLEAWRERG
Subjt:  KFSRDRVSQADSLSNYWSSSGDGSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERG

Query:  IRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQ-LRAASSPIPSPRSSGDFTALCQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCI
        IRD+VCCRFHLLS   E G++LMSEI I+AEKGGEF IDTDKQ L  A++PIPSP+SSGDF+ L Q V  GGFVMS RVQGEGKSSKP+VQLAMRHVTC+
Subjt:  IRDTVCCRFHLLSEAPERGELLMSEIHINAEKGGEFFIDTDKQ-LRAASSPIPSPRSSGDFTALCQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCI

Query:  EDAAIFMALAAAVDLSIEACRPFQRKIRRASRH
        EDAAIFMALAAAVDLSI AC+PF+R  RR  RH
Subjt:  EDAAIFMALAAAVDLSIEACRPFQRKIRRASRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCATTGGGACTGAGAATCCCCGGAGCTGCACTAAATGCTGCAAACCCCGGAGTCAATGCTTTCTCTTCTTTGTG
TTCATGCGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACATCTTCAATCCCGTTACTTCCGTCTCCTGAATCCGTACCCGATTCTCATAGCATTGCCTCAATTTTCTATC
TTGAAGAGTCTGATTTGAAAGCATTACTGGCACCCGGCTGCTTCTACAACACTCATGCTTGTCTTGAAGTATCTGTCTTCTCAGGACGGAAGGGTTCTTCCCACTGTGGT
GTTGGCATCAAAAGGCAGCTAATCGGGTCGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAAGCATTGGCAAAAG
TAAGAATGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAA
TCGTCCAGCTTCAAGGCTCGATCAAACAGCCGATTTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCAGATTCATTGAGCAACTATTGGTCAAGTTCTGGTGAT
GGCTCGGATCTCGAGGCTGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCC
ATCAACAGGTTGTGATTGGGTTGCCAGGTCGAACCCCGGGGCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCCGATAATTGGCAGCCATGGGGAAAGCTCGAGGCGT
GGCGCGAGCGTGGGATTAGAGACACTGTCTGCTGTCGTTTTCACCTACTCTCTGAAGCGCCAGAGAGAGGGGAACTTCTCATGTCTGAGATCCATATCAATGCCGAGAAA
GGCGGGGAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCATCAAGTCCAATACCGAGCCCGCGGAGCAGCGGAGACTTTACAGCATTATGCCAAGTGGTTGGAGG
TTTCGTCATGAGCTGCAGAGTACAGGGGGAAGGAAAGAGCAGCAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAGGATGCTGCCATTTTCATGGCAC
TTGCAGCTGCAGTCGATCTCAGCATCGAGGCATGTAGGCCATTCCAACGGAAGATCAGAAGAGCTTCTCGACATTCATAG
mRNA sequenceShow/hide mRNA sequence
GTTTCCACGATTTCTCTTTCTCTATCGCTTTTTCCTCCACTCCCCACTATTCCTCTCTTCGTTCAAACTCCCTAATCAGACTAAAAAAAAAAAAAAAAAATCGCTTTTAA
TTTCCCTTTCTCGAACAAAATTTCACCTCTAATTTGTCGATTTTGTTCTTGTTTCTTCTGAAAATCAAAAGGGGTTGTAGTATTTTTTGTTAGTATTGGTTTTTGTAGGT
CCAGATTTTATATAAATATAAATATATATATATATATATTAACCATTGGTACTGTTTGGTTTTGATTACAAATTAAATAAAACCCTTTTCCCCCTTAAATTTCTCCGTCT
ATTTCGTGACAAATAATGCAGATTTCTGTGAGAAGAAATGGAATATTATAAGCTTTTGTGGGTTGGGTTTATTTTGGGGTTTGTGAAGTGGCCTCGTGATCTGTGAATGA
CATTCAAGACTCTTGTTCATATGGGGATTGGGTTTTCTAAAAGCTCATTATTTGATTTGTGCTTGTTTGTTTGATGAGCCCTTTTGTTTAAAACATTCATTCTTTAATAT
TTAGGGCGAAAATTTGAGAGAAGGGGAAGTGGGGTTTGGTTTTGGTGTTTGCTTGTTGGGGGAAATGTTCTTCTTGTTCTTGTTCAGGGATTCTGGGTGCTCTGATTTCT
GTGCAGCACTGTAAAACCTTTCTACTTGGCTACTGATCACAAGGATACTTGAGCACGTCGAAATATAAACGATCACATCAAATGGGCAAAAAATTGTGATTGAAATGGAT
CCTCAGGCTTTTATTAGGTTGTCAATTGGCTCATTGGGACTGAGAATCCCCGGAGCTGCACTAAATGCTGCAAACCCCGGAGTCAATGCTTTCTCTTCTTTGTGTTCATG
CGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACATCTTCAATCCCGTTACTTCCGTCTCCTGAATCCGTACCCGATTCTCATAGCATTGCCTCAATTTTCTATCTTGAAG
AGTCTGATTTGAAAGCATTACTGGCACCCGGCTGCTTCTACAACACTCATGCTTGTCTTGAAGTATCTGTCTTCTCAGGACGGAAGGGTTCTTCCCACTGTGGTGTTGGC
ATCAAAAGGCAGCTAATCGGGTCGTTTAAACTGGATGTCGGTCCCGAATGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAAGCATTGGCAAAAGTAAGAA
TGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAATCGTCC
AGCTTCAAGGCTCGATCAAACAGCCGATTTTCAGCTGCAAATTTAGTCGAGACAGGGTGTCCCAGGCAGATTCATTGAGCAACTATTGGTCAAGTTCTGGTGATGGCTCG
GATCTCGAGGCTGAGCGGAGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAAC
AGGTTGTGATTGGGTTGCCAGGTCGAACCCCGGGGCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCCGATAATTGGCAGCCATGGGGAAAGCTCGAGGCGTGGCGCG
AGCGTGGGATTAGAGACACTGTCTGCTGTCGTTTTCACCTACTCTCTGAAGCGCCAGAGAGAGGGGAACTTCTCATGTCTGAGATCCATATCAATGCCGAGAAAGGCGGG
GAGTTCTTCATAGACACTGACAAACAGTTGCGAGCAGCATCAAGTCCAATACCGAGCCCGCGGAGCAGCGGAGACTTTACAGCATTATGCCAAGTGGTTGGAGGTTTCGT
CATGAGCTGCAGAGTACAGGGGGAAGGAAAGAGCAGCAAGCCAATGGTTCAACTCGCAATGCGACATGTGACATGTATAGAGGATGCTGCCATTTTCATGGCACTTGCAG
CTGCAGTCGATCTCAGCATCGAGGCATGTAGGCCATTCCAACGGAAGATCAGAAGAGCTTCTCGACATTCATAGGAAAAAAAAGTGTTAGTGTAAAATTGATGTGTAGAA
ACATAAACAAAACTGGGCATACAACTGTGAATTTATATCTGAAGTTATAAAATCTGGCATTTTGTACAGTATAAGATGATGTCCTACTTATCTAATCTTCAA
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPGAALNAANPGVNAFSSLCSCEIRLRGFPVQTSSIPLLPSPESVPDSHSIASIFYLEESDLKALLAPGCFYNTHACLEVSVFSGRKGSSHCG
VGIKRQLIGSFKLDVGPEWGDGKPVILFNGWISIGKSKNENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRVSQADSLSNYWSSSGD
GSDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGAWLIVRPDVCIPDNWQPWGKLEAWRERGIRDTVCCRFHLLSEAPERGELLMSEIHINAEK
GGEFFIDTDKQLRAASSPIPSPRSSGDFTALCQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRASRHS