; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005951 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005951
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationscaffold254:2652792..2655075
RNA-Seq ExpressionMS005951
SyntenyMS005951
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582343.1 Armadillo repeat-containing protein LFR, partial [Cucurbita argyrosperma subsp. sororia]1.2e-23392.49Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRD+ALP+D +KK RVRTLGANS +TGFGNEFEALGSNG  PGSS  E T H+PKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022138003.1 armadillo repeat-containing protein LFR isoform X1 [Momordica charantia]4.3e-25099.12Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNG  PGSSVPET SHAPKPSPRHWWLDEDGLFNLDDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022924723.1 armadillo repeat-containing protein LFR-like [Cucurbita moschata]5.6e-23492.72Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNG  PGSS  E T H+PKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022979388.1 armadillo repeat-containing protein LFR-like [Cucurbita maxima]3.9e-23593.38Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNG  PGSS  + T HAPKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_023527171.1 armadillo repeat-containing protein LFR-like [Cucurbita pepo subsp. pepo]1.9e-23493.16Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRDIALP+D VKK RVRTLGANS +TGFGNEFEALGSNG  PGSS  E T HAPKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

TrEMBL top hitse value%identityAlignment
A0A1S3AWW3 armadillo repeat-containing protein LFR3.7e-23191.7Show/hide
Query:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE
Subjt:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE

Query:  KDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDD
        KDDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+G  PGSS  E+T HA KPS RHWWL+EDGLFNLDD
Subjt:  KDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDD

Query:  EGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSS
        EGRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+
Subjt:  EGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSS

Query:  VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMIL
        VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMIL
Subjt:  VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMIL

Query:  ESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ESLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  ESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A5D3D0A8 Armadillo repeat-containing protein LFR3.7e-23191.7Show/hide
Query:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE
        MQKR+Q+KLGGNV GG SAPPAKRGRPFGS +SNAAA AAA     ETLAPS  LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE
Subjt:  MQKREQSKLGGNV-GGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKE

Query:  KDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDD
        KDDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+G  PGSS  E+T HA KPS RHWWL+EDGLFNLDD
Subjt:  KDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDD

Query:  EGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSS
        EGRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIEDH+TEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+
Subjt:  EGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSS

Query:  VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMIL
        VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMIL
Subjt:  VKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMIL

Query:  ESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ESLVSEPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  ESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1CBV6 armadillo repeat-containing protein LFR isoform X12.1e-25099.12Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNG  PGSSVPET SHAPKPSPRHWWLDEDGLFNLDDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
        QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1EDA6 armadillo repeat-containing protein LFR-like2.7e-23492.72Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SN AAAAAAAETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN-AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNG  PGSS  E T H+PKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1IW20 armadillo repeat-containing protein LFR-like1.9e-23593.38Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
        MQKREQ+KLGGNVG  SAPPAKRGRPFGS +SNAAAAAAA ETLAPSA LGPSLH+HTSF ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR

Query:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE
        +DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNG  PGSS  + T HAPKPSPRHWWLDEDGLFN DDEGRAE
Subjt:  KDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAE

Query:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH
        RQQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+EDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWH
Subjt:  RQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWH

Query:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
        CAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS
Subjt:  CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVS

Query:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        EPQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  EPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

SwissProt top hitse value%identityAlignment
E9Q7E2 AT-rich interactive domain-containing protein 24.9e-0725.5Show/hide
Query:  SAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSS
        S +L  S  +   F +  +  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD                 TLG+ SS
Subjt:  SAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSS

Query:  VTGFGNEFEALGSNGY-------VPGSSVPETTS---HAPKPSPRHW-WLDEDGLFN-------LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
        V  FG E+       +       V  + V +  S    A + +P  W W   + LF+        D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  VTGFGNEFEALGSNGY-------VPGSSVPETTS---HAPKPSPRHW-WLDEDGLFN-------LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q68CP9 AT-rich interactive domain-containing protein 21.4e-0623.8Show/hide
Query:  SAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL
        S +L  S  +   F +  +  ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            W++      +  
Subjt:  SAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IAL

Query:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH
         +D+V    VR L ++ + +  G             G  + E+  H P+            L   D EG     Q  +  + I+RN SF   N  ++A +
Subjt:  PRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQH

Query:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ
        R  L  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q
Subjt:  RHTLETVFQCIEDHITEDEELVTNTLETIVNLAP--LLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ

Query:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
          ++ ++  +++P +    + +  LY L E+      K+A    +ID L+ ++
Subjt:  -IHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q6YTW6 Armadillo repeat-containing protein LFR1.8e-17971.27Show/hide
Query:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTP
        G + GGG + PAKRGRPFGS++ + AAAAAA     +  AP+A +GPSL V T+  +DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D+TP
Subjt:  GGNVGGGSAPPAKRGRPFGSSSSNAAAAAAA-----ETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTP

Query:  LAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGYVPGSSVPETT-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQ
        LAK+PGLLDALLQVIDDWRDIA+P+D  K PRVRTLG N++++GFG+E  E + S+   P     +T  S   K     +  DE+GLFN+DDEGR E+QQ
Subjt:  LAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGYVPGSSVPETT-SHAPKPSPRHWWLDEDGLFNLDDEGRAERQQ

Query:  CAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAA
        CAV+ASNIIRNFSFMPENE +M QHRH LETVFQC+ED  TED+EL+TN LET+VNLAP+LDLRIFSSSKPS+IKITEKRA +AIMGML SS++VWHCAA
Subjt:  CAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAA

Query:  AELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQ
        AEL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE+CRKA+MI+ESLVSEPQ
Subjt:  AELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQ

Query:  NRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG
        NR  LLV+EN FAEIL S+G+YSDTFARILYELT+RP+NKV A Q +WG
Subjt:  NRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG

Q9LS90 Armadillo repeat-containing protein LFR5.0e-19374.03Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF  +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKD

Query:  DMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHA--PKPSPRH----WWLDEDGLF
        D+R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  + + A   K + +H    WW++EDGLF
Subjt:  DMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHA--PKPSPRH----WWLDEDGLF

Query:  NLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGM
        NLDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+
Subjt:  NLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGM

Query:  LGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKA
        L SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKA
Subjt:  LGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKA

Query:  AMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        AMILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  AMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

Arabidopsis top hitse value%identityAlignment
AT3G22990.1 ARM repeat superfamily protein3.5e-19474.03Show/hide
Query:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKD
        MQKRE  K GGN GG S PPAKRGRPFGS+S+N    AAAAAAA+ ++PSA LGPSL VH SF  +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+
Subjt:  MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSN----AAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKD

Query:  DMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHA--PKPSPRH----WWLDEDGLF
        D+R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  + + A   K + +H    WW++EDGLF
Subjt:  DMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHA--PKPSPRH----WWLDEDGLF

Query:  NLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGM
        NLDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+A +A++G+
Subjt:  NLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGM

Query:  LGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKA
        L SSVK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKA
Subjt:  LGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKA

Query:  AMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        AMILE+LVSEPQNR LLL YENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  AMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAACGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCATTCGGCAGTTCGAGCAGCAACGCCGCCGCTGC
AGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGCAGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGA
GTGGATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCTACTCCCCTGGCTAAAATTCCCGGC
CTTCTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTGAGAACATTAGGTGCAAATTCTTCTGTAAC
GGGATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGGTATGTACCTGGTTCTTCAGTTCCAGAGACAACGAGTCATGCTCCAAAACCATCTCCTCGACATTGGTGGC
TTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCATTCGAAACTTTTCTTTCATGCCAGAGAAT
GAAGTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAACTTGTAACAAACACACTAGAGACAATTGT
GAATTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCAAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGGTGAAGCCATCATGGGAATGCTGGGATCCT
CTGTCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTCCCTTTGTTCCCCAGATACACAAGCGTTTA
GTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATGGACTGCAGAATAAAGCTAGCAAGTGAGCG
ATGGGCAATCGATCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATTGGAGAGTCTTGTATCTGAGCCACAGAACA
GGGCTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGATATTGTATGAACTAACATCGAGACCTAAC
AATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAGAGAGAGCAGAGCAAGTTGGGCGGAAACGTTGGTGGTGGCTCCGCGCCTCCGGCGAAGCGCGGCCGTCCATTCGGCAGTTCGAGCAGCAACGCCGCCGCTGC
AGCCGCTGCCGAGACCTTGGCTCCATCGGCACACCTTGGCCCTTCTCTCCATGTTCATACTTCCTTCGCAGCAGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGA
GTGGATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCTACTCCCCTGGCTAAAATTCCCGGC
CTTCTTGATGCTCTTCTACAAGTTATCGATGACTGGCGTGATATAGCGCTTCCAAGGGATCTTGTAAAGAAGCCAAGGGTGAGAACATTAGGTGCAAATTCTTCTGTAAC
GGGATTTGGGAATGAATTTGAGGCATTGGGCTCGAACGGGTATGTACCTGGTTCTTCAGTTCCAGAGACAACGAGTCATGCTCCAAAACCATCTCCTCGACATTGGTGGC
TTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGCCTCAAATATCATTCGAAACTTTTCTTTCATGCCAGAGAAT
GAAGTTATTATGGCCCAACATCGACATACTTTGGAAACAGTGTTTCAGTGTATAGAAGATCATATTACAGAGGATGAGGAACTTGTAACAAACACACTAGAGACAATTGT
GAATTTAGCTCCGTTGCTAGATCTTCGTATCTTTAGCTCATCAAAACCATCCTACATCAAAATAACTGAGAAACGAGCAGGTGAAGCCATCATGGGAATGCTGGGATCCT
CTGTCAAAGTTTGGCACTGTGCTGCCGCAGAATTACTTGGACGATTAATAATAAATCCTGATAATGAGCCTTTCCTTCTTCCCTTTGTTCCCCAGATACACAAGCGTTTA
GTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTCGTCGAAGTTAACATGGACTGCAGAATAAAGCTAGCAAGTGAGCG
ATGGGCAATCGATCGGCTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAGGCCGCAATGATATTGGAGAGTCTTGTATCTGAGCCACAGAACA
GGGCTCTACTTCTAGTATACGAAAATGCATTTGCAGAAATACTCTTTTCAGATGGCAGATATTCTGATACGTTTGCGAGGATATTGTATGAACTAACATCGAGACCTAAC
AATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATG
Protein sequenceShow/hide protein sequence
MQKREQSKLGGNVGGGSAPPAKRGRPFGSSSSNAAAAAAAETLAPSAHLGPSLHVHTSFAADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPG
LLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGYVPGSSVPETTSHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPEN
EVIMAQHRHTLETVFQCIEDHITEDEELVTNTLETIVNLAPLLDLRIFSSSKPSYIKITEKRAGEAIMGMLGSSVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRL
VDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQNRALLLVYENAFAEILFSDGRYSDTFARILYELTSRPN
NKVAAAQGVWGM