; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0487 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0487
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationMC05:3623190..3626078
RNA-Seq ExpressionMC05g0487
SyntenyMC05g0487
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582343.1 Armadillo repeat-containing protein LFR, partial [Cucurbita argyrosperma subsp. sororia]3.58e-30392.7Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D +KK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

XP_022138003.1 armadillo repeat-containing protein LFR isoform X1 [Momordica charantia]0.099.78Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        QNRALLLVYENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

XP_022924723.1 armadillo repeat-containing protein LFR-like [Cucurbita moschata]1.25e-30392.92Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

XP_022979388.1 armadillo repeat-containing protein LFR-like [Cucurbita maxima]3.74e-30593.58Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  +   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

XP_023527171.1 armadillo repeat-containing protein LFR-like [Cucurbita pepo subsp. pepo]3.07e-30493.36Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

TrEMBL top hitse value%identityAlignment
A0A1S3AWW3 armadillo repeat-containing protein LFR9.50e-30091.9Show/hide
Query:  MQKREQSKLGGNVGGG-SAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQSKLGGNVGGG-SAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS  E+  HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+V
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        SLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

A0A5D3D0A8 Armadillo repeat-containing protein LFR9.50e-30091.9Show/hide
Query:  MQKREQSKLGGNVGGG-SAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQSKLGGNVGGG-SAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS  E+  HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+V
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        SLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

A0A6J1CBV6 armadillo repeat-containing protein LFR isoform X10.099.78Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        QNRALLLVYENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  QNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

A0A6J1EDA6 armadillo repeat-containing protein LFR-like6.05e-30492.92Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  E   H+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

A0A6J1IW20 armadillo repeat-containing protein LFR-like1.81e-30593.58Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS  +   HAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELT RPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

SwissProt top hitse value%identityAlignment
E9Q7E2 AT-rich interactive domain-containing protein 21.1e-0624.36Show/hide
Query:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL
        S +L  S  +   F   N+  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            WR+      +  
Subjt:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL

Query:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
         +D+V    VR L ++ +            ++   PG  + E++ H P+            L   D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q68CP9 AT-rich interactive domain-containing protein 21.9e-0624.08Show/hide
Query:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL
        S +L  S  +   F   N+  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            W++      +  
Subjt:  SAHLGPSLHVHTSFADQNN-KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL

Query:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
         +D+V    VR L ++ + +  G             G  + E++ H P+            L   D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q6YTW6 Armadillo repeat-containing protein LFR9.7e-18171.43Show/hide
Query:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL
        G + GGG + PAKRGRPFGS++ + AAAAAA     +  AP+A +GPSL V T+ +DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D+TPL
Subjt:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL

Query:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVPETM-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC
        AK+PGLLDALLQVIDDWRDIA+P+D  K PRVRTLG N++++GFG+E  E + S+   P     +T  S   K     +  DE+GLFN+DDEGR E+QQC
Subjt:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVPETM-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC

Query:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAA
        AV+ASNIIRNFSFMPENE +M QHRH LETVFQC+ED  TED+EL+TN LET+VNLAP+LDLRIFSSSKPS+IKITEKRA +AIMGML SS++VWHCAAA
Subjt:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAA

Query:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN
        EL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE+CRKA+MI+ESLVSEPQN
Subjt:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN

Query:  RALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWG
        R  LLV+EN FAEIL S+G+YSDTFARILYELT RP+NKV A Q +WG
Subjt:  RALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWG

Q9LS90 Armadillo repeat-containing protein LFR4.5e-19474.35Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    G   GSS  E +      K     WW++EDGLFNL
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG

Query:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        ILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM

Arabidopsis top hitse value%identityAlignment
AT3G22990.1 ARM repeat superfamily protein3.2e-19574.35Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    G   GSS  E +      K     WW++EDGLFNL
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGS---NGLRPGSSVPETM--SHAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLG

Query:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM
        ILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNNKVAAAQGVWGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAATGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCATTCGGCAGTTCGAGCAGCAACGCCGCCGCTGC
AGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGAGTG
GATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCTACTCCCCTGGCTAAAATTCCCGGCCTT
CTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTGAGAACATTAGGTGCAAATTCTTCTGTAACGGG
ATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGCCTGAGACCTGGTTCTTCAGTTCCAGAGACAATGAGTCATGCTCCGAAACCATCTCCTCGACATTGGTGGCTTG
ATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCATTCGAAACTTTTCTTTCATGCCAGAGAATGAA
GTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAACTTGTCACAAACACCCTAGAGACAATTGTGAA
TTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCGAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGGTGAAGCCATCATGGGAATGCTGGGATCCTCTG
TCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTCCCTTTGTTCCCCAGATACACAAGCGTTTAGTC
GACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATGGACTGCAGAATAAAGCTAGCAAGTGAGCGATG
GGCAATCGATCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATTGGAGAGTCTTGTATCTGAGCCGCAGAACAGGG
CTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGATATTGTATGAACTAACATTGAGACCTAACAAT
AAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATCGGAGCATGGTTTGGGTTACTTCCGAAGCCCATTAATTTGGGCCTTACAAAAATAGGTTCATGAATTGGGCCTCCGTATATTTATGGGCCCTTCGCGGTCCACTCATT
AATCTTGGTGGGTGGCGGGTGGATTCATTCTGCTGTATCGATGCCCTAAATTTGATTTCCAGCTCCCGATTTGTTACCGGACCGGATCTCGTCGGATCGCCGTTGCCGAA
TCCGTCGAAGTTCAGTGGCAACAAAACGGGCGATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAATGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCA
TTCGGCAGTTCGAGCAGCAACGCCGCCGCTGCAGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGATCAAAA
CAATAAAAGGATAGTGTTGGCTCTACAGAGTGGATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAG
ACTCTACTCCCCTGGCTAAAATTCCCGGCCTTCTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTG
AGAACATTAGGTGCAAATTCTTCTGTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGCCTGAGACCTGGTTCTTCAGTTCCAGAGACAATGAGTCATGC
TCCGAAACCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCA
TTCGAAACTTTTCTTTCATGCCAGAGAATGAAGTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAA
CTTGTCACAAACACCCTAGAGACAATTGTGAATTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCGAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGG
TGAAGCCATCATGGGAATGCTGGGATCCTCTGTCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTC
CCTTTGTTCCCCAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATG
GACTGCAGAATAAAGCTAGCAAGTGAGCGATGGGCAATCGATCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATT
GGAGAGTCTTGTATCTGAGCCGCAGAACAGGGCTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGA
TATTGTATGAACTAACATTGAGACCTAACAATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGATCTGTAGTACCCTACCATCAATCGTTGTATTCATCTATGATC
GAGATGACTAGTACTCAGCTACCGAATCAAGTGAAGAACTCGAACTTCATCATTACTATAAAATGTAAATATTAGCTTATTGGGGTATCAAATTCGAGCTATTGCACCTG
CGGAGTCCGTATTTCTCAGTCATTTGTACTGCTCATTTTGTAATTTGTTTCTATTTTCATGCTAGAGTGCTCAATATCTATGGTTGATTGCTAACATTGATATGTGAAAT
ATCATTTATAACTGGACCACTTAGTGTAGGTAGGATTTGTAATTTTTAACATTGACATTAGGGGTATAAATTTTAATAAAATAGGTCA
Protein sequenceShow/hide protein sequence
MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGL
LDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVPETMSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENE
VIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLV
DLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTLRPNN
KVAAAQGVWGM