; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g05200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g05200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationchr5:3611862..3614148
RNA-Seq ExpressionMoc05g05200
SyntenyMoc05g05200
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582343.1 Armadillo repeat-containing protein LFR, partial [Cucurbita argyrosperma subsp. sororia]4.6e-23692.92Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D +KK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022138003.1 armadillo repeat-containing protein LFR isoform X1 [Momordica charantia]6.3e-254100Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022924723.1 armadillo repeat-containing protein LFR-like [Cucurbita moschata]2.0e-23693.14Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022979388.1 armadillo repeat-containing protein LFR-like [Cucurbita maxima]1.4e-23793.81Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  +   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_023527171.1 armadillo repeat-containing protein LFR-like [Cucurbita pepo subsp. pepo]7.0e-23793.58Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

TrEMBL top hitse value%identityAlignment
A0A1S3AWW3 armadillo repeat-containing protein LFR1.3e-23392.12Show/hide
Query:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS  E+  HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+V
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A5D3D0A8 Armadillo repeat-containing protein LFR1.3e-23392.12Show/hide
Query:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS  E+  HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+V
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1CBV6 armadillo repeat-containing protein LFR isoform X13.1e-254100Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1EDA6 armadillo repeat-containing protein LFR-like9.9e-23793.14Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1IW20 armadillo repeat-containing protein LFR-like6.9e-23893.81Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  +   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

SwissProt top hitse value%identityAlignment
E9Q7E2 AT-rich interactive domain-containing protein 21.1e-0624.36Show/hide
Query:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL
        S +L  S  +   F   N+  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            WR+      +  
Subjt:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL

Query:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
         +D+V    VR L ++ +            ++   PG  + E++ H P+            L   D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q68CP9 AT-rich interactive domain-containing protein 21.9e-0624.08Show/hide
Query:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL
        S +L  S  +   F   N+  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            W++      +  
Subjt:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL

Query:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
         +D+V    VR L ++ + +  G             G  + E++ H P+            L   D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q6YTW6 Armadillo repeat-containing protein LFR7.4e-18171.43Show/hide
Query:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL
        G + GGG + PAKRGRPFGS++ + AAAAAA     +  AP+A +GPSL V T+ +DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D+TPL
Subjt:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL

Query:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVPETM-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC
        AK+PGLLDALLQVIDDWRDIA+P+D  K PRVRTLG N++++GFG+E  E + S+   P     +T  S   K     +  DE+GLFN+DDEGR E+QQC
Subjt:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVPETM-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC

Query:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAA
        AV+ASNIIRNFSFMPENE +M QHRH LETVFQC+ED  TED+EL+TN LET+VNLAP+LDLRIFSSSKPS+IKITEKRA +AIMGML SS++VWHCAAA
Subjt:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAA

Query:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN
        EL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE+CRKA+MI+ESLVSEPQN
Subjt:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN

Query:  RALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG
        R  LLV+EN FAEIL S+G+YSDTFARILYELT+RP+NKV A Q +WG
Subjt:  RALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG

Q9LS90 Armadillo repeat-containing protein LFR2.6e-19474.35Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    G   GSS  E +      K     WW++EDGLFNL
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG

Query:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

Arabidopsis top hitse value%identityAlignment
AT3G22990.1 ARM repeat superfamily protein1.9e-19574.35Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    G   GSS  E +      K     WW++EDGLFNL
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG

Query:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAATGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCATTCGGCAGTTCGAGCAGCAACGCCGCCGCTGC
AGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGAGTG
GATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCTACTCCCCTGGCTAAAATTCCCGGCCTT
CTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTGAGAACATTAGGTGCAAATTCTTCTGTAACGGG
ATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGCCTGAGACCTGGTTCTTCAGTTCCAGAGACAATGAGTCATGCTCCGAAACCATCTCCTCGACATTGGTGGCTTG
ATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCATTCGAAACTTTTCTTTCATGCCAGAGAATGAA
GTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAACTTGTCACAAACACCCTAGAGACAATTGTGAA
TTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCGAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGGTGAAGCCATCATGGGAATGCTGGGATCCTCTG
TCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTCCCTTTGTTCCCCAGATACACAAGCGTTTAGTC
GACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATGGACTGCAGAATAAAGCTAGCAAGTGAGCGATG
GGCAATCGACCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATTGGAGAGTCTTGTATCTGAGCCGCAGAACAGGG
CTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGATATTGTATGAACTAACATCGAGACCTAACAAT
AAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAATGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCATTCGGCAGTTCGAGCAGCAACGCCGCCGCTGC
AGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGAGTG
GATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCTACTCCCCTGGCTAAAATTCCCGGCCTT
CTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTGAGAACATTAGGTGCAAATTCTTCTGTAACGGG
ATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGCCTGAGACCTGGTTCTTCAGTTCCAGAGACAATGAGTCATGCTCCGAAACCATCTCCTCGACATTGGTGGCTTG
ATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCATTCGAAACTTTTCTTTCATGCCAGAGAATGAA
GTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAACTTGTCACAAACACCCTAGAGACAATTGTGAA
TTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCGAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGGTGAAGCCATCATGGGAATGCTGGGATCCTCTG
TCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTCCCTTTGTTCCCCAGATACACAAGCGTTTAGTC
GACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATGGACTGCAGAATAAAGCTAGCAAGTGAGCGATG
GGCAATCGACCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATTGGAGAGTCTTGTATCTGAGCCGCAGAACAGGG
CTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGATATTGTATGAACTAACATCGAGACCTAACAAT
AAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGA
Protein sequenceShow/hide protein sequence
MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGL
LDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENE
VIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLV
DLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNN
KVAAAQGVWGM