; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006228 (gene) of Chayote v1 genome

Gene IDSed0006228
OrganismSechium edule (Chayote v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationLG09:1006732..1010216
RNA-Seq ExpressionSed0006228
SyntenySed0006228
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]1.0e-23292.29Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPLLPSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]4.9e-23291.59Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]2.2e-23292.06Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]1.2e-23391.82Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+P   LNSTKPG+N+F SPCSCEIRLRGFPVQ+SSIP+LPSPEAIP+SHSIASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDVGPEW  GK VILFNGWIGIGKSKN+NGRHG ELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS S DGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+S+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR-AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLR AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQL MRHVTC+EDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR-AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]7.4e-23693.46Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFPVQTSSIPL+PSPEAIPDSHSI SSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDVGPEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein2.4e-23291.59Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990021.1e-23292.06Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPL+PSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 24.8e-23392.29Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T LNSTKPGVNAF SPCSCEIRLRGFP+QTSSIPLLPSPEAIPDSH IASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDV PEW  GK VILFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS  GDG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

A0A6J1DQ78 uncharacterized protein LOC1110228015.7e-23491.82Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        M+PQAFIRLSIGSLGLR+P   LNSTKPG+N+F SPCSCEIRLRGFPVQ+SSIP+LPSPEAIP+SHSIASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDVGPEW  GK VILFNGWIGIGKSKN+NGRHG ELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS S DGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+S+PGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR-AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SEAQEGGE+LMSEIHINAEKGGEFFIDTDKQLR AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQL MRHVTC+EDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR-AASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MALAAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

A0A6J1HM61 uncharacterized protein LOC1114642381.6e-23191.12Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF
        MDPQAFIRLSIGSLGLRIP T +NSTKPGV+A  SPCSCEIRLRGFPVQTSSIP++ SPEAIPDSHSIASSFYLEESDLKALL PGCFYNTHACLEISVF
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVF

Query:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS
        SGRKGSHCGVG+KRQLIGTFKLDVGPEW  GK V+LFNGWIGIGKSKN+NGRHG ELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQ IFSCKFS
Subjt:  SGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFS

Query:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD
        RDR+SQAD+L+NYWS SGDG D+EAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPGSWLIVRPDV IP+SWQPWGKLEAWRERGIRD
Subjt:  RDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRD

Query:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF
        TVCCRFHL+SE QEGGE+LMSEIHINAEKGGEFFIDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQL MRHVTCIEDAAIF
Subjt:  TVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAA-SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIF

Query:  MALAAAVDLSIEACRPFQRKIRRLSRHS
        MA+AAAVDLSIEACRPF+RKIRR  RHS
Subjt:  MALAAAVDLSIEACRPFQRKIRRLSRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)3.1e-9140.56Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISV
        MDP  FIRL+IG+L L++P+    +T   V+   SPC C+I+L+ FP QT++IP +P      P+  ++A++F+L  SD++ L +   F  +  CL+I +
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLP-SPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISV

Query:  FSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKF
        ++GR G+ CGV   R L+    + +       K  +  NGWI +GK    +     + HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+Q +F+CKF
Subjt:  FSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKF

Query:  S---------RDRMSQADT-LNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGK
        S         R R    +T ++  W +S  G + E   +ERKGW + +HDLSGS VA A I TPFV S G D V++SNPGSWLI+RP  C   +W+PWG+
Subjt:  S---------RDRMSQADT-LNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGK

Query:  LEAWRER-GIRDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRA-----------------------ASPIPSPQ-SSGDFAA---LGQ
        LEAWRER G  D +  RF LI +   G  I+++E  I++ +GG+F I+      +                       ASP  SP+  SGD+        
Subjt:  LEAWRER-GIRDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRA-----------------------ASPIPSPQ-SSGDFAA---LGQ

Query:  VVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR
        V  GFVMS  V+GEGK SKP V++ ++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+
Subjt:  VVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR

AT1G50040.1 Protein of unknown function (DUF1005)2.6e-7436.58Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPS-------PCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLTPG
        MDP +F+R+ +G+L +R P +  +S+    ++ PS        C C+I+ + FP Q  S+P+L   E+  +S        ++A+ F L +S ++  L   
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPS-------PCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH-------SIASSFYLEESDLKALLTPG

Query:  CFYNTHACLEISVFSGRKGSHCG--VGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHG--VELHLRVKLDPDPRYVFQFEDVTRLSPQ
         +    + L + V+S R+ + CG       +LIG F++ +  +    K+ +  NGW+ +G    +N + G   ELH+ V+++PD R+VFQF+     SPQ
Subjt:  CFYNTHACLEISVFSGRKGSHCG--VGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHG--VELHLRVKLDPDPRYVFQFEDVTRLSPQ

Query:  IVQLQGSIKQSIFSCKFSRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPE
        + Q+QG+ KQ++F+CKF   R S    L+   SS   G   E   +ERKGW + IHDLSGS VA A + TPFVPS G + V++S+PG+WLI+RPD     
Subjt:  IVQLQGSIKQSIFSCKFSRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPE

Query:  SWQPWGKLEAWRERGIRDTVCCRFHLISEAQEGGEILMS-EIHINAEKGGEFFIDTDKQLRAASPIPSPQSSGDFAALGQVVG-----------------
        +W+PW +L+AWRE G+ D +  RF L    ++G  + +S    I+ + GG F ID        +   S + S D ++   +                   
Subjt:  SWQPWGKLEAWRERGIRDTVCCRFHLISEAQEGGEILMS-EIHINAEKGGEFFIDTDKQLRAASPIPSPQSSGDFAALGQVVG-----------------

Query:  -----GFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIR
             GFVMS RVQG  K SKP V++G++HVTC EDAA  +ALAAAVDLS++ACR F +K+R
Subjt:  -----GFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIR

AT3G19680.1 Protein of unknown function (DUF1005)3.7e-7635.63Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTK------PGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLTPG
        MDP +F+R+ +G+L +R P +  +S+        G+N     C C+IR + FP +  S+P++   E+  ++         ++A+ F L ++ ++A L   
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTK------PGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSH--------SIASSFYLEESDLKALLTPG

Query:  CFYNTHACLEISVFS--------GRKGSHCGVGVK-RQLIGTFKLDVGPEWEFGKSVILFNGWIGI--GKSKNDNGRHGVELHLRVKLDPDPRYVFQFED
         F    + L +  +S        G  G+ CG+     +L+G F++ +  +    KS +  NGW+ +   K+K+  G    ELH+ V+++PDPR+VFQF+ 
Subjt:  CFYNTHACLEISVFS--------GRKGSHCGVGVK-RQLIGTFKLDVGPEWEFGKSVILFNGWIGI--GKSKNDNGRHGVELHLRVKLDPDPRYVFQFED

Query:  VTRLSPQIVQLQGSIKQSIFSCKF--------------SRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCD
            SPQ+ Q+QG+ KQ++F+CKF              S   MS+  +  +  SS     + E   +ERKGW + +HDLSGS VA A + TPFVPS G +
Subjt:  VTRLSPQIVQLQGSIKQSIFSCKF--------------SRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCD

Query:  WVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLISEAQEG-GEILMSEIHINAEKGGEFFIDT--DKQLRAASPIPSPQSSGDFAA
         V +S+PG+WLI+RPD C   +W+PWG+LEAWRE G  DT+  RF L    Q+G    + +   I+ + GG F ID        A++P  SPQ S D  +
Subjt:  WVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLISEAQEG-GEILMSEIHINAEKGGEFFIDT--DKQLRAASPIPSPQSSGDFAA

Query:  LGQVVG------------------------------GFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR
         G   G                              GFVMS  V+G GK SKP V++G+ HVTC EDAA  +ALAAAVDLS++ACR F  K+R+
Subjt:  LGQVVG------------------------------GFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR

AT4G29310.1 Protein of unknown function (DUF1005)1.3e-8440.92Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPG-VNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLTPGCFYNTHACLEI
        MDP  F+RL+I SL LR+P T  N    G V+   +PC C++R++ FP Q + +PL    +A   P+S + A  F+L+   ++ +            L +
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPG-VNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAI--PDSHSIASSFYLEESDLKALLTPGCFYNTHACLEI

Query:  SVFSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSC
        SV++GR G  CGV    +L+G  ++ V       ++V   NGW  +G    D  +    LHL V  +PDPR+VFQF      SP + Q+Q ++KQ +FSC
Subjt:  SVFSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSC

Query:  KFSRDRMSQADTL-------NNYW---SSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPW
        KFS DR  ++ +L       +  W   + SGD  + + + RERKGW + IHDLSGS VAAA + TPFV S G D V++SNPG+WLI+RP      SW+PW
Subjt:  KFSRDRMSQADTL-------NNYW---SSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPW

Query:  GKLEAWRERGIRDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGM
        G+LEAWRERG  D +  +F L+ +      I ++E  ++ ++GG+F ID     R  S        G+  A+   V GFVM   V+GEGK SKP+V +G 
Subjt:  GKLEAWRERGIRDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLRAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGM

Query:  RHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR
        +HVTC+ DAA+F+AL+AAVDLS++AC+ F RK+R+
Subjt:  RHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRR

AT5G17640.1 Protein of unknown function (DUF1005)3.7e-19376.62Show/hide
Query:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPG--VNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEIS
        MDPQAFIRLS+GSL LRIP  ++NST        F S CSCEI+LRGFPVQT+SIPL+PS +A PD HSI++SFYLEESDL+ALLTPGCFY+ HA LEIS
Subjt:  MDPQAFIRLSIGSLGLRIPVTVLNSTKPG--VNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEIS

Query:  VFSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCK
        VF+G+K  +CGVG KRQ IG FKL+VGPEW  GK +ILFNGWI IGK+K D      ELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQ IFSCK
Subjt:  VFSGRKGSHCGVGVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCK

Query:  FSRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI
        FSRDR+SQ D LN YWSSSGDG +LE+ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPG+WL+VRPD   P SWQPWGKLEAWRERGI
Subjt:  FSRDRMSQADTLNNYWSSSGDGLDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGI

Query:  RDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR--AASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLGMRHVTCIE
        RD+VCCRFHL+S   E G++LMSEI I+AEKGGEF IDTDKQ+   AA+PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQL MRHVTC+E
Subjt:  RDTVCCRFHLISEAQEGGEILMSEIHINAEKGGEFFIDTDKQLR--AASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLGMRHVTCIE

Query:  DAAIFMALAAAVDLSIEACRPFQRKIRRLSRH
        DAAIFMALAAAVDLSI AC+PF+R  RR  RH
Subjt:  DAAIFMALAAAVDLSIEACRPFQRKIRRLSRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAGGCTTTTATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAATCCCAGTAACTGTTCTAAATTCTACAAAACCTGGAGTCAATGCTTTCCCTTCTCCATG
TTCGTGTGAAATTCGTCTTCGCGGTTTCCCCGTGCAGACGTCTTCGATCCCGCTACTCCCGTCTCCTGAAGCCATACCGGATTCTCATAGCATCGCCTCAAGCTTCTATC
TCGAAGAGTCTGATTTGAAAGCATTACTGACACCTGGCTGCTTCTACAACACTCACGCCTGTCTTGAAATATCTGTGTTCTCTGGAAGGAAGGGATCCCACTGTGGTGTT
GGCGTGAAAAGGCAGCTGATCGGGACGTTTAAACTAGATGTCGGTCCCGAATGGGAATTCGGGAAGTCGGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAA
GAATGACAATGGAAGACATGGAGTAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAATCG
TCCAGCTTCAAGGCTCGATCAAACAGTCCATCTTCAGCTGCAAATTTAGTCGAGACAGGATGTCCCAGGCAGATACATTGAACAACTATTGGTCGAGTTCTGGTGATGGC
TTGGATCTCGAGGCCGAGCGGCGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCATTTGTGCCATC
AACAGGTTGCGATTGGGTTGCCAAGTCGAACCCTGGTTCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCCGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGC
GCGAGCGTGGGATTAGAGACACCGTCTGTTGTCGTTTTCACCTTATCTCTGAAGCGCAAGAGGGAGGGGAAATCCTCATGTCTGAAATCCATATCAATGCCGAAAAAGGC
GGGGAGTTCTTCATAGATACTGACAAACAGTTGCGAGCAGCAAGTCCGATACCGAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGTTTCGT
AATGAGCTGCAGAGTACAAGGGGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTCGGAATGCGACACGTGACATGTATAGAGGATGCTGCCATTTTCATGGCACTCGCAG
CTGCTGTCGATCTTAGCATCGAGGCGTGTAGGCCGTTTCAAAGGAAGATCAGGAGATTGTCTCGACATTCTTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAATCCCATAAAATTTCCCCTTCTCTGACAAATTTTCACCTCAATTTCATCACCTTTCTTTATTTTGTTCTTGTTTCTTCTCAAAATCAAAAGGGTCAGT
AGTATTTTTGTATTACTACATTACCCATCTTTTTTTAGATTAAATCCATTCCAAGAACTGTTTGGTTTTGTGAGAAGATTAAACAAAAGGGTATATATATATATTATGAA
CTTCTCTTATCGATTTGAGTGATTTAGAGAAAAAAAAAGGAGTTTCCTTTTCTGGGTTTAGGGTTTATTTTGGGGATTTGTGAATTTCTGTTCGTGACATTCGAGACAGT
TGTTCCTTTGGGGATTGGGTTTTGGGAATTGGGTTTTTTTTAAAAGAGCTCTGTATTGGATCTGTGTTTGTTTGTTTGATGAACCCTTTTTTTAAAAATCATTCATCATT
TCACATTTAGAGAGAAATTTGGAGTTGGGGAAAAGGGGAAGTGGGATTTGATTTTGGTGTTTGTTTTTAGGGGAATGTTGTTGTTCTTCAGATTCTGGTTCTGGGATTTT
GGCTGCTCTACTTTCTATTTTCTATAACAAGGACCCTTGAATTCTTCGAAAAATAAACAATCTACATCAAATGGACAGAAAACTGTGACCAAAATGGATCCTCAGGCTTT
TATTAGGTTGTCAATTGGCTCTTTGGGATTGAGAATCCCAGTAACTGTTCTAAATTCTACAAAACCTGGAGTCAATGCTTTCCCTTCTCCATGTTCGTGTGAAATTCGTC
TTCGCGGTTTCCCCGTGCAGACGTCTTCGATCCCGCTACTCCCGTCTCCTGAAGCCATACCGGATTCTCATAGCATCGCCTCAAGCTTCTATCTCGAAGAGTCTGATTTG
AAAGCATTACTGACACCTGGCTGCTTCTACAACACTCACGCCTGTCTTGAAATATCTGTGTTCTCTGGAAGGAAGGGATCCCACTGTGGTGTTGGCGTGAAAAGGCAGCT
GATCGGGACGTTTAAACTAGATGTCGGTCCCGAATGGGAATTCGGGAAGTCGGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGTAAGAATGACAATGGAAGAC
ATGGAGTAGAGCTTCATTTGAGAGTGAAACTAGATCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTCACGAGGTTAAGCCCGCAAATCGTCCAGCTTCAAGGCTCG
ATCAAACAGTCCATCTTCAGCTGCAAATTTAGTCGAGACAGGATGTCCCAGGCAGATACATTGAACAACTATTGGTCGAGTTCTGGTGATGGCTTGGATCTCGAGGCCGA
GCGGCGAGAAAGAAAAGGCTGGAAGGTGAAGATACATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCATTTGTGCCATCAACAGGTTGCGATTGGG
TTGCCAAGTCGAACCCTGGTTCCTGGCTGATTGTTCGTCCCGATGTTTGCATACCCGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCATGGCGCGAGCGTGGGATTAGA
GACACCGTCTGTTGTCGTTTTCACCTTATCTCTGAAGCGCAAGAGGGAGGGGAAATCCTCATGTCTGAAATCCATATCAATGCCGAAAAAGGCGGGGAGTTCTTCATAGA
TACTGACAAACAGTTGCGAGCAGCAAGTCCGATACCGAGCCCGCAGAGCAGCGGAGACTTTGCAGCATTAGGCCAAGTGGTTGGAGGTTTCGTAATGAGCTGCAGAGTAC
AAGGGGAAGGAAAGAGCAGTAAGCCAATGGTTCAACTCGGAATGCGACACGTGACATGTATAGAGGATGCTGCCATTTTCATGGCACTCGCAGCTGCTGTCGATCTTAGC
ATCGAGGCGTGTAGGCCGTTTCAAAGGAAGATCAGGAGATTGTCTCGACATTCTTAGGAAGAAAAGGGGAAAAGGAAAAGAAAAATGGTATTTATAAGGTAAAATTGATG
TGTAGAAACATAAACAATGATGGGTAAACATCTGTATGAAACTAACTCCTATAATATTTTGGCATTTTGTACGGTATAAAGGTGATTGATATCCCTCCATGCTATTTTTG
GTTAGTAACAACTATTGGGGTGGGAGATTC
Protein sequenceShow/hide protein sequence
MDPQAFIRLSIGSLGLRIPVTVLNSTKPGVNAFPSPCSCEIRLRGFPVQTSSIPLLPSPEAIPDSHSIASSFYLEESDLKALLTPGCFYNTHACLEISVFSGRKGSHCGV
GVKRQLIGTFKLDVGPEWEFGKSVILFNGWIGIGKSKNDNGRHGVELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQSIFSCKFSRDRMSQADTLNNYWSSSGDG
LDLEAERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGSWLIVRPDVCIPESWQPWGKLEAWRERGIRDTVCCRFHLISEAQEGGEILMSEIHINAEKG
GEFFIDTDKQLRAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLGMRHVTCIEDAAIFMALAAAVDLSIEACRPFQRKIRRLSRHS