; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024702 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024702
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationtig00002486:2009222..2011881
RNA-Seq ExpressionSgr024702
SyntenySgr024702
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582343.1 Armadillo repeat-containing protein LFR, partial [Cucurbita argyrosperma subsp. sororia]5.4e-23893.81Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS  S+ AAAAAAAETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D +KK RVRTLGANS +TGFGNEFEALGSNGLRPGSS SEAT H+ KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022138003.1 armadillo repeat-containing protein LFR isoform X1 [Momordica charantia]1.8e-24195.79Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKR+Q+KLGGNVGG SAPPAKRGRPFGS+SS AAAAAAAETLAPSA LGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSV E   HA KPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIE+HITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYI+ITEK A EAIMGMLGS+VKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        QNRALLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  QNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022924723.1 armadillo repeat-containing protein LFR-like [Cucurbita moschata]2.4e-23894.03Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS  S+ AAAAAAAETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS SEAT H+ KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022979388.1 armadillo repeat-containing protein LFR-like [Cucurbita maxima]2.9e-23994.69Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAA-ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS +S AAAAAAA ETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAA-ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS S+AT HA KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_023527171.1 armadillo repeat-containing protein LFR-like [Cucurbita pepo subsp. pepo]8.4e-23994.47Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS  S+ AAAAAAAETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS SEAT HA KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

TrEMBL top hitse value%identityAlignment
A0A1S3AWW3 armadillo repeat-containing protein LFR9.3e-23693Show/hide
Query:  MQKRDQNKLGGNV-GGTSAPPAKRGRPFGSASSIAAAAAAA-----ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKRDQNKLGGNV GG SAPPAKRGRPFGS +S AAA AAA     ETLAPS LLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKRDQNKLGGNV-GGTSAPPAKRGRPFGSASSIAAAAAAA-----ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS SE+T HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIE+H+TEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAV
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A5D3D0A8 Armadillo repeat-containing protein LFR9.3e-23693Show/hide
Query:  MQKRDQNKLGGNV-GGTSAPPAKRGRPFGSASSIAAAAAAA-----ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKRDQNKLGGNV GG SAPPAKRGRPFGS +S AAA AAA     ETLAPS LLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKRDQNKLGGNV-GGTSAPPAKRGRPFGSASSIAAAAAAA-----ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDE
        DDMR+DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANSSVTGFGNEFEALGS+GLRPGSS SE+T HA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAV
        GRAERQQCAVSASNI+RNFSFMPENE IMA HRHTLETVFQCIE+H+TEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAV
Subjt:  GRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1CBV6 armadillo repeat-containing protein LFR isoform X18.7e-24295.79Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
        MQKR+Q+KLGGNVGG SAPPAKRGRPFGS+SS AAAAAAAETLAPSA LGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKD

Query:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQ
        STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSV E   HA KPSPRHWWLDEDGLFNLDDEGRAERQ
Subjt:  STPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQ

Query:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCA
        QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIE+HITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYI+ITEK A EAIMGMLGS+VKVWHCA
Subjt:  QCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCA

Query:  AAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
        AAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP
Subjt:  AAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEP

Query:  QNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        QNRALLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  QNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1EDA6 armadillo repeat-containing protein LFR-like1.2e-23894.03Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS  S+ AAAAAAAETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSA-SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+D VKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS SEAT H+ KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1IW20 armadillo repeat-containing protein LFR-like1.4e-23994.69Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAA-ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK
        MQKR+QNKLGGNVG  SAPPAKRGRPFGS +S AAAAAAA ETLAPSALLGPSLH+HTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAA-ETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRK

Query:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKK RVRTLGANS +TGFGNEFEALGSNGLRPGSS S+AT HA KPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENE IMAQHRHTLETVFQC+E+HITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYI+ITEK AVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

SwissProt top hitse value%identityAlignment
E9Q7E2 AT-rich interactive domain-containing protein 21.8e-0525.37Show/hide
Query:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEF-------------
        ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD                 TLG+ SSV  FG E+             
Subjt:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEF-------------

Query:  EALGSNGLRPGSSVSEATCHALKPSPRHW-WLDEDGLFN-------LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITED
        + +  N +R    +S+    A + +P  W W   + LF+        D EG     Q  +  + I+RN SF   N  ++A +R  L  +      H    
Subjt:  EALGSNGLRPGSSVSEATCHALKPSPRHW-WLDEDGLFN-------LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITED

Query:  EELVTNALETIVNLAP--LLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQ-IHKRLVDLMSIPALDAQ
         +L    L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +  Q  ++ ++  +++P +   
Subjt:  EELVTNALETIVNLAP--LLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQ-IHKRLVDLMSIPALDAQ

Query:  AAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI
         + +  LY L E+      K+A    +ID L+ ++
Subjt:  AAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q68CP9 AT-rich interactive domain-containing protein 23.9e-0523.49Show/hide
Query:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IALPRDLVKKPRVRTLGANSSVTG
        ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            W++      +   +D+V    VR L ++ + + 
Subjt:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDD------------WRD------IALPRDLVKKPRVRTLGANSSVTG

Query:  FGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEEL
         G   E +  +   P               PR   ++       D EG     Q  +  + I+RN SF   N  ++A +R  L  +      H     +L
Subjt:  FGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEEL

Query:  VTNALETIVNLAP--LLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQ-IHKRLVDLMSIPALDAQAAA
            L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +  Q  ++ ++  +++P +    + 
Subjt:  VTNALETIVNLAP--LLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQ-IHKRLVDLMSIPALDAQAAA

Query:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI
        +  LY L E+      K+A    +ID L+ ++
Subjt:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q6YTW6 Armadillo repeat-containing protein LFR2.3e-17870.31Show/hide
Query:  GGNVGGTSAPPAKRGRPFGS-----ASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL
        G + GG  + PAKRGRPFGS     A++ AAAAA  +  AP+AL+GPSL V T+ +DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D+TPL
Subjt:  GGNVGGTSAPPAKRGRPFGS-----ASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL

Query:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVSE-ATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQC
        AK+PGLLDALLQVIDDWRDIA+P+D  K PRVRTLG N++++GFG+E  E + S+   P    ++ A     K     +  DE+GLFN+DDEGR E+QQC
Subjt:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVSE-ATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQC

Query:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAA
        AV+ASNIIRNFSFMPENE +M QHRH LETVFQC+E+  TED+EL+TN LET+VNLAP+LDLRIFSSSKPS+I+ITEK AV+AIMGML S+++VWHCAAA
Subjt:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAA

Query:  ELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN
        EL+GRLIINPDNEPFLLP  PQI+KRLVDL+S+PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE+CRKA+MI+ESLVSEPQN
Subjt:  ELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN

Query:  RALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG
        R  LL +EN FAEIL S+G+YSDTFARILYELT+RP+NKV A Q +WG
Subjt:  RALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG

Q9LS90 Armadillo repeat-containing protein LFR1.0e-19474.19Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKR+  K GGN GG+S PPAKRGRPFGS S    + AAAAAAA+ ++PSALLGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  +   AL  K + +H    WW++EDGLFN
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN

Query:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGML
        LDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI +H+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK AV+A++G+L
Subjt:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGML

Query:  GSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAA
         S+VK W+CAAAELLGRLIINPDNEPF+ P  PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAA
Subjt:  GSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAA

Query:  MILESLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        MILE+LVSEPQNR LLLAYENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  MILESLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

Arabidopsis top hitse value%identityAlignment
AT3G22990.1 ARM repeat superfamily protein7.2e-19674.19Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKR+  K GGN GG+S PPAKRGRPFGS S    + AAAAAAA+ ++PSALLGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  +   AL  K + +H    WW++EDGLFN
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN

Query:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGML
        LDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI +H+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK AV+A++G+L
Subjt:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITEKGAVEAIMGML

Query:  GSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAA
         S+VK W+CAAAELLGRLIINPDNEPF+ P  PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAA
Subjt:  GSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAA

Query:  MILESLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        MILE+LVSEPQNR LLLAYENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  MILESLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACAACTGCACTTGATGACAATATCGTAAATTACTGTGAAAGAAATGGCATATGTGGTAAATTCTCCATTTGGGCACGCTACCGATTGTTACCGGATCGGATCGC
CATAGCCGAATCAGCCGAAGTTCAGAGAGAACAAGACGTAGGCATGCAGAAGAGAGATCAGAACAAGTTGGGCGGAAATGTTGGCGGTACCTCTGCGCCTCCGGCTAAGC
GAGGCCGTCCGTTCGGCAGCGCAAGCAGCATCGCCGCTGCTGCCGCTGCCGCCGAGACGTTGGCTCCATCGGCTCTCCTAGGGCCTTCTCTTCATGTTCATACTTCCTTC
GCGGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGAGTGGATTGAAGAGTGAATTGACGTGGGCACTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGA
TATGCGCAAAGACTCCACTCCTCTGGCTAAAATTCCCGGCTTGCTCGACGCTCTTCTTCAAGTTATAGATGACTGGCGTGATATAGCACTTCCGAGGGATCTTGTAAAGA
AGCCAAGGGTCAGAACTCTAGGTGCAAATTCTTCTGTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGTTTCAGAGGCA
ACATGTCATGCTCTTAAACCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGC
TTCAAATATCATCCGAAACTTCTCTTTCATGCCAGAGAATGAAGTTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGAACATATTACAG
AGGATGAGGAACTTGTCACAAATGCACTAGAGACAATTGTGAATTTAGCTCCGCTACTCGATCTTCGTATCTTTAGCTCATCAAAACCATCCTACATCAGAATAACAGAG
AAAGGAGCAGTTGAAGCCATCATGGGTATGCTTGGATCTGCTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGCTTGATAATAAATCCTGATAATGAGCC
TTTCCTTCTTCCCTTTGCTCCCCAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTTGTCG
AAGTTAATATGGACTGCAGAATAAAGCTGGCAAGCGAGCGATGGGCGATCGATAGACTCCTTAAGGTAATCAAAACGCCTCACCCAGTTCCAGAAATATGCAGGAAAGCA
GCTATGATATTGGAGAGTCTTGTATCTGAGCCACAGAACAGGGCTTTACTTCTAGCATATGAAAATGCTTTTGCAGAAATACTCTTCTCGGATGGCAGATATTCCGATAC
ATTCGCAAGGATATTGTATGAACTAACATCCAGACCAAACAATAAAGTTGCTGCTGCTCAAGGGGTATGGGGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACAACTGCACTTGATGACAATATCGTAAATTACTGTGAAAGAAATGGCATATGTGGTAAATTCTCCATTTGGGCACGCTACCGATTGTTACCGGATCGGATCGC
CATAGCCGAATCAGCCGAAGTTCAGAGAGAACAAGACGTAGGCATGCAGAAGAGAGATCAGAACAAGTTGGGCGGAAATGTTGGCGGTACCTCTGCGCCTCCGGCTAAGC
GAGGCCGTCCGTTCGGCAGCGCAAGCAGCATCGCCGCTGCTGCCGCTGCCGCCGAGACGTTGGCTCCATCGGCTCTCCTAGGGCCTTCTCTTCATGTTCATACTTCCTTC
GCGGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGAGTGGATTGAAGAGTGAATTGACGTGGGCACTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGA
TATGCGCAAAGACTCCACTCCTCTGGCTAAAATTCCCGGCTTGCTCGACGCTCTTCTTCAAGTTATAGATGACTGGCGTGATATAGCACTTCCGAGGGATCTTGTAAAGA
AGCCAAGGGTCAGAACTCTAGGTGCAAATTCTTCTGTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGTTTCAGAGGCA
ACATGTCATGCTCTTAAACCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCTGTTTCTGC
TTCAAATATCATCCGAAACTTCTCTTTCATGCCAGAGAATGAAGTTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGAACATATTACAG
AGGATGAGGAACTTGTCACAAATGCACTAGAGACAATTGTGAATTTAGCTCCGCTACTCGATCTTCGTATCTTTAGCTCATCAAAACCATCCTACATCAGAATAACAGAG
AAAGGAGCAGTTGAAGCCATCATGGGTATGCTTGGATCTGCTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGCTTGATAATAAATCCTGATAATGAGCC
TTTCCTTCTTCCCTTTGCTCCCCAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTTGTCG
AAGTTAATATGGACTGCAGAATAAAGCTGGCAAGCGAGCGATGGGCGATCGATAGACTCCTTAAGGTAATCAAAACGCCTCACCCAGTTCCAGAAATATGCAGGAAAGCA
GCTATGATATTGGAGAGTCTTGTATCTGAGCCACAGAACAGGGCTTTACTTCTAGCATATGAAAATGCTTTTGCAGAAATACTCTTCTCGGATGGCAGATATTCCGATAC
ATTCGCAAGGATATTGTATGAACTAACATCCAGACCAAACAATAAAGTTGCTGCTGCTCAAGGGGTATGGGGCATGTGA
Protein sequenceShow/hide protein sequence
MKTTALDDNIVNYCERNGICGKFSIWARYRLLPDRIAIAESAEVQREQDVGMQKRDQNKLGGNVGGTSAPPAKRGRPFGSASSIAAAAAAAETLAPSALLGPSLHVHTSF
ADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEA
TCHALKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIRITE
KGAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFAPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKA
AMILESLVSEPQNRALLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM