; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009503 (gene) of Snake gourd v1 genome

Gene IDTan0009503
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationLG10:6370359..6373102
RNA-Seq ExpressionTan0009503
SyntenyTan0009503
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582343.1 Armadillo repeat-containing protein LFR, partial [Cucurbita argyrosperma subsp. sororia]9.8e-24796.24Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA AETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDVALPKD +KK RVRTLG NS +TGFGNEFEALGSNGLRPGSSASE TGH+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022924723.1 armadillo repeat-containing protein LFR-like [Cucurbita moschata]4.4e-24796.46Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA AETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDVALPKD VKK RVRTLG NS +TGFGNEFEALGSNGLRPGSSASE TGH+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_022979388.1 armadillo repeat-containing protein LFR-like [Cucurbita maxima]5.8e-24796.02Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA +ETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NS +TGFGNEFEALGSNGLRPGSSAS+ TGHAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_023527171.1 armadillo repeat-containing protein LFR-like [Cucurbita pepo subsp. pepo]2.6e-24796.46Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA AETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALPKD VKK RVRTLG NS +TGFGNEFEALGSNGLRPGSSASE TGHAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

XP_038899743.1 armadillo repeat-containing protein LFR [Benincasa hispida]1.6e-24194.53Show/hide
Query:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAAA----AAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+QNKL GNV GG SAPPAKRGRPFGSVNSN AAA    AA AETLAPSALLGPS H+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAAA----AAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE
        DDMRRDSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NSSVTGFGNEFEALGSNGLRPGSSASE TGHA KPSPRHWWLDEDGLFNLDDE
Subjt:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV
        GRAERQQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQCIEDH TEDEELVTNALETIVNLAPLLDLRIFSSSKP+YIKITEKRAV+AIMGMLGSAV
Subjt:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDA AAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

TrEMBL top hitse value%identityAlignment
A0A1S3AWW3 armadillo repeat-containing protein LFR2.3e-24194.97Show/hide
Query:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAA-AAAVA---ETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+QNKLGGNV GGASAPPAKRGRPFGSVNSN AA AAAVA   ETLAPS LLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAA-AAAVA---ETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE
        DDMRRDSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NSSVTGFGNEFEALGS+GLRPGSSASE TGHA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV
        GRAERQQCAVSASNI+RNFSFMPENESIMA HRHTLETVFQCIEDH+TEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV
Subjt:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A5D3D0A8 Armadillo repeat-containing protein LFR2.3e-24194.97Show/hide
Query:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAA-AAAVA---ETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
        MQKR+QNKLGGNV GGASAPPAKRGRPFGSVNSN AA AAAVA   ETLAPS LLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK
Subjt:  MQKREQNKLGGNV-GGASAPPAKRGRPFGSVNSNTAA-AAAVA---ETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEK

Query:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE
        DDMRRDSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NSSVTGFGNEFEALGS+GLRPGSSASE TGHA KPS RHWWL+EDGLFNLDDE
Subjt:  DDMRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDE

Query:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV
        GRAERQQCAVSASNI+RNFSFMPENESIMA HRHTLETVFQCIEDH+TEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV
Subjt:  GRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV

Query:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE
        KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK PHPVPEICRKAAMILE
Subjt:  KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILE

Query:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  SLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1CBV6 armadillo repeat-containing protein LFR isoform X11.5e-24094.69Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQ+KLGGNVGG SAPPAKRGRPFGS +SN AAAAA AETLAPSA LGPSLHVHTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NSSVTGFGNEFEALGSNGLRPGSS  E   HAPKPSPRHWWLDEDGLFNLDDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNIIRNFSFMPENE IMAQHRHTLETVFQCIEDHITEDEELVTN LETIVNLAPLLDLRIFSSSKPSYIKITEKRA EAIMGMLGS+VKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNR LLL YENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1EDA6 armadillo repeat-containing protein LFR-like2.1e-24796.46Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA AETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRDVALPKD VKK RVRTLG NS +TGFGNEFEALGSNGLRPGSSASE TGH+PKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

A0A6J1IW20 armadillo repeat-containing protein LFR-like2.8e-24796.02Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
        MQKREQNKLGGNVG ASAPPAKRGRPFGSVNSN AAAAA +ETLAPSALLGPSLH+HTSF DQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRR

Query:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER
        DSTPLAKIPGLLDALLQVIDDWRD+ALP+DLVKK RVRTLG NS +TGFGNEFEALGSNGLRPGSSAS+ TGHAPKPSPRHWWLDEDGLFN DDEGRAER
Subjt:  DSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAER

Query:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
        QQCAVSASNI+RNFSFMPENESIMAQHRHTLETVFQC+EDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC
Subjt:  QQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHC

Query:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
        AAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE
Subjt:  AAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSE

Query:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Subjt:  PQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

SwissProt top hitse value%identityAlignment
E9Q7E2 AT-rich interactive domain-containing protein 25.4e-0625.3Show/hide
Query:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVIDD------------WRD------VALPKDLVKKARVRTLGVNSSVTG
        ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            WR+      V   KD+V    VR L     ++ 
Subjt:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVIDD------------WRD------VALPKDLVKKARVRTLGVNSSVTG

Query:  FGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEEL
             E        PG    E   H P+            L   D EG     Q  +  + I+RN SF   N  ++A +R  L  +      H     +L
Subjt:  FGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEEL

Query:  VTNALETIVNLAP--LLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSIPALDAQAAA
            L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q  ++ ++  +++P +    + 
Subjt:  VTNALETIVNLAP--LLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSIPALDAQAAA

Query:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI
        +  LY L E+      K+A    +ID L+ ++
Subjt:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q68CP9 AT-rich interactive domain-containing protein 27.1e-0624.4Show/hide
Query:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVIDD------------WRD------VALPKDLVKKARVRTLGVNSSVTG
        ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD            W++      V   KD+V    VR L  + + + 
Subjt:  RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVIDD------------WRD------VALPKDLVKKARVRTLGVNSSVTG

Query:  FGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEEL
         G   E +  +   P               PR   ++       D EG     Q  +  + I+RN SF   N  ++A +R  L  +      H     +L
Subjt:  FGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEEL

Query:  VTNALETIVNLAP--LLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSIPALDAQAAA
            L+T+ N+A   LLD   F ++   +  +T+          L S  +       E+LG L    DN   +  +V Q  ++ ++  +++P +    + 
Subjt:  VTNALETIVNLAP--LLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSIPALDAQAAA

Query:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI
        +  LY L E+      K+A    +ID L+ ++
Subjt:  VGALYNLVEVNMDCRIKLASERWAIDRLLKVI

Q6YTW6 Armadillo repeat-containing protein LFR7.5e-18170.76Show/hide
Query:  GGNVGGASAPPAKRGRPFGSVNSN----TAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPL
        G + GG  + PAKRGRPFGS   +     AAAAA+ +  AP+AL+GPSL V T+  DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+RRD+TPL
Subjt:  GGNVGGASAPPAKRGRPFGSVNSN----TAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPL

Query:  AKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNE-FEALGSNGLRPGSSASEVT-GHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC
        AK+PGLLDALLQVIDDWRD+A+PKD  K  RVRTLGVN++++GFG+E  E + S+   P    ++       K     +  DE+GLFN+DDEGR E+QQC
Subjt:  AKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNE-FEALGSNGLRPGSSASEVT-GHAPKPSPRHWWLDEDGLFNLDDEGRAERQQC

Query:  AVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAA
        AV+ASNIIRNFSFMPENE++M QHRH LETVFQC+ED  TED+EL+TN LET+VNLAP+LDLRIFSSSKPS+IKITEKRAV+AIMGML S+++VWHCAAA
Subjt:  AVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAA

Query:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN
        EL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE+CRKA+MI+ESLVSEPQN
Subjt:  ELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQN

Query:  RGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG
        R  LL +EN FAEIL S+G+YSDTFARILYELT+RP+NKV A Q +WG
Subjt:  RGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG

Q9LS90 Armadillo repeat-containing protein LFR1.8e-19875.22Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNT---AAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG+S PPAKRGRPFGS ++N+   AAAAA A+ ++PSALLGPSL VH SFV+QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNT---AAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGS---NGLRPGSSASEVTG--HAPKPSPRHWWLDEDGLFNL
        +RRD  PLAKI GLLDALL +IDDWRD+ALPKDL +  RVRTLG N+SVTGFGNE++AL S    G   GSSA+E  G     K     WW++EDGLFNL
Subjt:  MRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGS---NGLRPGSSASEVTG--HAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NE +MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+AV+A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLG

Query:  SAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        S+VK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ILE+LVSEPQNRGLLLAYENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM

Arabidopsis top hitse value%identityAlignment
AT3G22990.1 ARM repeat superfamily protein1.3e-19975.22Show/hide
Query:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNT---AAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKRE  K GGN GG+S PPAKRGRPFGS ++N+   AAAAA A+ ++PSALLGPSL VH SFV+QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNT---AAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGS---NGLRPGSSASEVTG--HAPKPSPRHWWLDEDGLFNL
        +RRD  PLAKI GLLDALL +IDDWRD+ALPKDL +  RVRTLG N+SVTGFGNE++AL S    G   GSSA+E  G     K     WW++EDGLFNL
Subjt:  MRRDSTPLAKIPGLLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGS---NGLRPGSSASEVTG--HAPKPSPRHWWLDEDGLFNL

Query:  DDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLG
        DDEGR+E+Q CA++ASN+IRNFSFMP+NE +MAQHRH LETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS K SYI I EK+AV+A++G+L 
Subjt:  DDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLG

Query:  SAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM
        S+VK W+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+SI A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIKTPHPVPE+CRKAAM
Subjt:  SAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAM

Query:  ILESLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
        ILE+LVSEPQNRGLLLAYENAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Subjt:  ILESLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGAGAGAGCAGAACAAGTTGGGCGGAAATGTTGGCGGCGCCTCGGCGCCTCCGGCTAAGCGAGGCCGTCCATTCGGCAGCGTAAACAGCAACACCGCCGCTGC
AGCCGCAGTCGCCGAGACCTTGGCTCCATCTGCACTGCTTGGCCCTTCTCTTCATGTTCATACTTCCTTCGTCGATCAAAACAATAAAAGGATAGTGTTGGCTCTACAGA
GTGGATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTTACTCTGCTCTCTTTTAAAGAGAAGGATGATATGCGCAGAGACTCGACTCCTTTGGCTAAAATTCCCGGC
TTGCTCGATGCTCTTCTTCAAGTTATAGATGATTGGCGTGATGTAGCACTTCCGAAGGATCTTGTAAAGAAGGCAAGGGTCAGAACATTAGGTGTAAACTCTTCTGTAAC
GGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGCTTCTGAGGTAACGGGTCATGCCCCAAAACCATCTCCTCGACATTGGTGGC
TTGATGAAGATGGTCTATTTAATCTGGATGATGAAGGACGAGCAGAAAGACAGCAATGTGCTGTTTCTGCTTCAAATATCATCCGAAACTTCTCTTTCATGCCAGAGAAT
GAATCTATTATGGCTCAACATCGACATACTCTGGAAACAGTGTTTCAATGTATAGAAGATCATATTACAGAGGATGAAGAACTTGTCACAAATGCACTAGAGACAATTGT
AAATTTAGCCCCGCTCCTTGATCTTCGTATCTTTAGTTCGTCAAAACCATCCTACATCAAAATAACAGAGAAACGAGCAGTCGAAGCCATCATGGGAATGTTGGGATCTG
CTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGGTTGATAATAAATCCCGATAATGAGCCTTTTCTTCTTCCCTTCGTCCCCCAGATACACAAGCGTTTA
GTTGATCTTATGAGCATCCCAGCACTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTCGTCGAAGTTAATATGGACTGCAGAATAAAGCTGGCAAGCGAAAG
ATGGGCGATCGACCGACTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAAGCAGCAATGATATTGGAGAGTCTTGTATCTGAGCCACAGAACA
GGGGTTTGCTGCTAGCATACGAAAATGCATTTGCAGAAATACTCTTCTCAGATGGCAGATATTCGGATACATTCGCTCGGATATTGTATGAATTAACATCCAGACCAAAC
AACAAAGTTGCTGCTGCTCAAGGTGTATGGGGCATGTGA
mRNA sequenceShow/hide mRNA sequence
GTTGAATTCATATTGCATCTTAGTGCCCTAATTTTGGGTTTCCAGCTTCCCGATTGTTAGAGGATCGGATCGCCATTGCTGAATCACCCGAAGTTTAGCGAGAACAAGAC
AGGCCATGCAGAAGAGAGAGCAGAACAAGTTGGGCGGAAATGTTGGCGGCGCCTCGGCGCCTCCGGCTAAGCGAGGCCGTCCATTCGGCAGCGTAAACAGCAACACCGCC
GCTGCAGCCGCAGTCGCCGAGACCTTGGCTCCATCTGCACTGCTTGGCCCTTCTCTTCATGTTCATACTTCCTTCGTCGATCAAAACAATAAAAGGATAGTGTTGGCTCT
ACAGAGTGGATTGAAGAGTGAATTGACGTGGGCATTGAACACTCTTACTCTGCTCTCTTTTAAAGAGAAGGATGATATGCGCAGAGACTCGACTCCTTTGGCTAAAATTC
CCGGCTTGCTCGATGCTCTTCTTCAAGTTATAGATGATTGGCGTGATGTAGCACTTCCGAAGGATCTTGTAAAGAAGGCAAGGGTCAGAACATTAGGTGTAAACTCTTCT
GTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGCTTCTGAGGTAACGGGTCATGCCCCAAAACCATCTCCTCGACATTG
GTGGCTTGATGAAGATGGTCTATTTAATCTGGATGATGAAGGACGAGCAGAAAGACAGCAATGTGCTGTTTCTGCTTCAAATATCATCCGAAACTTCTCTTTCATGCCAG
AGAATGAATCTATTATGGCTCAACATCGACATACTCTGGAAACAGTGTTTCAATGTATAGAAGATCATATTACAGAGGATGAAGAACTTGTCACAAATGCACTAGAGACA
ATTGTAAATTTAGCCCCGCTCCTTGATCTTCGTATCTTTAGTTCGTCAAAACCATCCTACATCAAAATAACAGAGAAACGAGCAGTCGAAGCCATCATGGGAATGTTGGG
ATCTGCTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGGTTGATAATAAATCCCGATAATGAGCCTTTTCTTCTTCCCTTCGTCCCCCAGATACACAAGC
GTTTAGTTGATCTTATGAGCATCCCAGCACTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTCGTCGAAGTTAATATGGACTGCAGAATAAAGCTGGCAAGC
GAAAGATGGGCGATCGACCGACTTCTTAAAGTAATCAAGACACCTCATCCAGTTCCAGAAATATGCAGGAAAGCAGCAATGATATTGGAGAGTCTTGTATCTGAGCCACA
GAACAGGGGTTTGCTGCTAGCATACGAAAATGCATTTGCAGAAATACTCTTCTCAGATGGCAGATATTCGGATACATTCGCTCGGATATTGTATGAATTAACATCCAGAC
CAAACAACAAAGTTGCTGCTGCTCAAGGTGTATGGGGCATGTGATCGTAAGTATCCTACAGCCAATCGTCGTATCCCTCTGCCCGAGATACTAGACTAGTACTCGCTTCC
GATTCAAACAAAGGACTCGAACCTCATTGTCGCCCGAGATACTAGACTAGTACTCGCTTCCGATTCAAACAAAGGACTCGAACCTCATTGTCACTGTAATATGTAAATAT
CAGCTATGGTTTATCGGTGATTATAAGTCTACTAACTTTCATATATATCTAATTCAAGCTATTGCATCATCGAAGGCTGTATGTCTCAGTCATTTGTACTGCTCACTTGT
AATCCTATTGTAAGTTACTTCTATTTTCAATACCAGAGTGTCGACA
Protein sequenceShow/hide protein sequence
MQKREQNKLGGNVGGASAPPAKRGRPFGSVNSNTAAAAAVAETLAPSALLGPSLHVHTSFVDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPG
LLDALLQVIDDWRDVALPKDLVKKARVRTLGVNSSVTGFGNEFEALGSNGLRPGSSASEVTGHAPKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPEN
ESIMAQHRHTLETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRL
VDLMSIPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEICRKAAMILESLVSEPQNRGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPN
NKVAAAQGVWGM