; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039144 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039144
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr2:36955904..36957211
RNA-Seq ExpressionLag0039144
SyntenyLag0039144
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064930.1 UPF0481 protein [Cucumis melo var. makuwa]5.4e-9346.01Show/hide
Query:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL
        SR+ + KEI  R+VES+  S+       T    +  IY VP+ LR+ NPKAY+P+VISIGP H  R   DL    +K  Y+ + L    +   ++++ +L
Subjt:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETI+ME  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC  +   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS
         YF EV + M++V+    D     + H +D LR+ +T    R  P   D       + WP +ATELHDCGISF  + R  M++ F++ +GVL +P+I + 
Subjt:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS

Query:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW
        ++ E+  RN+IAYE CH +     + DV NF  F+ +LINT KDV LL++  IIQN LGS  E+   FN+L KN++V  N+Y  EC  MK YC+ RRHRW
Subjt:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW

Query:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        MTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

XP_004138858.1 UPF0481 protein At3g47200 [Cucumis sativus]5.3e-8843.86Show/hide
Query:  SRNTKPKEIVSRVVESLKTSL-----KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFH-RNTPDLMANRRKSRYLQDILQFMNIDLYKVV-SYL
        SR+ + K I  R+V S+  S+     +S      +  IY VP+ LR  NPKAY+P+VISIGP H   T + +   +K  Y+ + L    +D  +++  +L
Subjt:  SRNTKPKEIVSRVVESLKTSL-----KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFH-RNTPDLMANRRKSRYLQDILQFMNIDLYKVV-SYL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETIEM+  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC S+   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFV-DNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLS----CTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEV
         YF +V + M++V +   D     + H +D LR+ +T    R  P  +  LS     + WP +ATELH+CGISF  + +  M++ F++  GVL +P+I +
Subjt:  HYFIEVNK-MNFV-DNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLS----CTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEV

Query:  SETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHR
         ++ E+  RN+IAYE CH +     + D  NF  F+ +LINT +DV LL++  IIQN LGS +E+   F++L KN+++  N Y   C  MK YC+ RRHR
Subjt:  SETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHR

Query:  WMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        WMTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  WMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

XP_008445209.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]5.4e-9346.01Show/hide
Query:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL
        SR+ + KEI  R+VES+  S+       T    +  IY VP+ LR+ NPKAY+P+VISIGP H  R   DL    +K  Y+ + L    +   ++++ +L
Subjt:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETI+ME  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC  +   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS
         YF EV + M++V+    D     + H +D LR+ +T    R  P   D       + WP +ATELHDCGISF  + R  M++ F++ +GVL +P+I + 
Subjt:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS

Query:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW
        ++ E+  RN+IAYE CH +     + DV NF  F+ +LINT KDV LL++  IIQN LGS  E+   FN+L KN++V  N+Y  EC  MK YC+ RRHRW
Subjt:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW

Query:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        MTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

XP_022961893.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita moschata]1.2e-8444.53Show/hide
Query:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI
        ++V    +D  IY+VP+PLRS+ P+AYTP VISIGP H    DL AN  K  YLQ+ L    +    +V  +    +R R CYAE+IEM   +F+ELL  
Subjt:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI

Query:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI
        D  FVVM+LIG  F   R  D S LW+F   + CDL+LLENQLPFFLL+ L  LC SS   L+ V +F EL   YFIE +K               G  +
Subjt:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI

Query:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV
         HF+DLLR+  T + S E     TS   T WPP+AT+LH+CG+ FK        I F+D  G L LP+I + +  E ++RNLIAYE+CH      L ++V
Subjt:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV

Query:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT
         NFA F+  L+ T++DV+LLI   II N+ GSI EVT  FNNL K++    N Y+ +C+ MK YC+R RHRW++ L+R YF+TPW   +    +L+ +LT
Subjt:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT

Query:  ILQAVMAYLNL
        ++Q ++A ++L
Subjt:  ILQAVMAYLNL

XP_022961897.1 UPF0481 protein At3g47200-like isoform X3 [Cucurbita moschata]1.2e-8444.53Show/hide
Query:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI
        ++V    +D  IY+VP+PLRS+ P+AYTP VISIGP H    DL AN  K  YLQ+ L    +    +V  +    +R R CYAE+IEM   +F+ELL  
Subjt:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI

Query:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI
        D  FVVM+LIG  F   R  D S LW+F   + CDL+LLENQLPFFLL+ L  LC SS   L+ V +F EL   YFIE +K               G  +
Subjt:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI

Query:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV
         HF+DLLR+  T + S E     TS   T WPP+AT+LH+CG+ FK        I F+D  G L LP+I + +  E ++RNLIAYE+CH      L ++V
Subjt:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV

Query:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT
         NFA F+  L+ T++DV+LLI   II N+ GSI EVT  FNNL K++    N Y+ +C+ MK YC+R RHRW++ L+R YF+TPW   +    +L+ +LT
Subjt:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT

Query:  ILQAVMAYLNL
        ++Q ++A ++L
Subjt:  ILQAVMAYLNL

TrEMBL top hitse value%identityAlignment
A0A0A0LPK8 Uncharacterized protein2.5e-8843.86Show/hide
Query:  SRNTKPKEIVSRVVESLKTSL-----KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFH-RNTPDLMANRRKSRYLQDILQFMNIDLYKVV-SYL
        SR+ + K I  R+V S+  S+     +S      +  IY VP+ LR  NPKAY+P+VISIGP H   T + +   +K  Y+ + L    +D  +++  +L
Subjt:  SRNTKPKEIVSRVVESLKTSL-----KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFH-RNTPDLMANRRKSRYLQDILQFMNIDLYKVV-SYL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETIEM+  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC S+   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFV-DNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLS----CTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEV
         YF +V + M++V +   D     + H +D LR+ +T    R  P  +  LS     + WP +ATELH+CGISF  + +  M++ F++  GVL +P+I +
Subjt:  HYFIEVNK-MNFV-DNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLS----CTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEV

Query:  SETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHR
         ++ E+  RN+IAYE CH +     + D  NF  F+ +LINT +DV LL++  IIQN LGS +E+   F++L KN+++  N Y   C  MK YC+ RRHR
Subjt:  SETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHR

Query:  WMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        WMTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  WMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

A0A1S3BD00 UPF0481 protein At3g47200-like2.6e-9346.01Show/hide
Query:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL
        SR+ + KEI  R+VES+  S+       T    +  IY VP+ LR+ NPKAY+P+VISIGP H  R   DL    +K  Y+ + L    +   ++++ +L
Subjt:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETI+ME  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC  +   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS
         YF EV + M++V+    D     + H +D LR+ +T    R  P   D       + WP +ATELHDCGISF  + R  M++ F++ +GVL +P+I + 
Subjt:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS

Query:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW
        ++ E+  RN+IAYE CH +     + DV NF  F+ +LINT KDV LL++  IIQN LGS  E+   FN+L KN++V  N+Y  EC  MK YC+ RRHRW
Subjt:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW

Query:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        MTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

A0A5A7VCL1 UPF0481 protein2.6e-9346.01Show/hide
Query:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL
        SR+ + KEI  R+VES+  S+       T    +  IY VP+ LR+ NPKAY+P+VISIGP H  R   DL    +K  Y+ + L    +   ++++ +L
Subjt:  SRNTKPKEIVSRVVESLKTSLKSVDPDPT----DCCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVS-YL

Query:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK
           +R RN Y ETI+ME  +F++LL  D  FVVMY+IGS    FRD DTSFLWRF NGI  DLLLLENQLPFFLL  L  LC  +   L+++S F EL +
Subjt:  GPVQRVRNCYAETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAK

Query:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS
         YF EV + M++V+    D     + H +D LR+ +T    R  P   D       + WP +ATELHDCGISF  + R  M++ F++ +GVL +P+I + 
Subjt:  HYFIEVNK-MNFVDNNN-DRQGYGIKHFIDLLRMDITGSCSREQP---DVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVS

Query:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW
        ++ E+  RN+IAYE CH +     + DV NF  F+ +LINT KDV LL++  IIQN LGS  E+   FN+L KN++V  N+Y  EC  MK YC+ RRHRW
Subjt:  ETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRW

Query:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        MTSLKR YF TPWA ++F   VL+  LT+LQ V+A++ L
Subjt:  MTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

A0A6J1HBC3 UPF0481 protein At3g47200-like isoform X35.9e-8544.53Show/hide
Query:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI
        ++V    +D  IY+VP+PLRS+ P+AYTP VISIGP H    DL AN  K  YLQ+ L    +    +V  +    +R R CYAE+IEM   +F+ELL  
Subjt:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI

Query:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI
        D  FVVM+LIG  F   R  D S LW+F   + CDL+LLENQLPFFLL+ L  LC SS   L+ V +F EL   YFIE +K               G  +
Subjt:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI

Query:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV
         HF+DLLR+  T + S E     TS   T WPP+AT+LH+CG+ FK        I F+D  G L LP+I + +  E ++RNLIAYE+CH      L ++V
Subjt:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV

Query:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT
         NFA F+  L+ T++DV+LLI   II N+ GSI EVT  FNNL K++    N Y+ +C+ MK YC+R RHRW++ L+R YF+TPW   +    +L+ +LT
Subjt:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT

Query:  ILQAVMAYLNL
        ++Q ++A ++L
Subjt:  ILQAVMAYLNL

A0A6J1HD53 UPF0481 protein At3g47200-like isoform X15.9e-8544.53Show/hide
Query:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI
        ++V    +D  IY+VP+PLRS+ P+AYTP VISIGP H    DL AN  K  YLQ+ L    +    +V  +    +R R CYAE+IEM   +F+ELL  
Subjt:  KSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEMEDADFLELLAI

Query:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI
        D  FVVM+LIG  F   R  D S LW+F   + CDL+LLENQLPFFLL+ L  LC SS   L+ V +F EL   YFIE +K               G  +
Subjt:  DCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGI

Query:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV
         HF+DLLR+  T + S E     TS   T WPP+AT+LH+CG+ FK        I F+D  G L LP+I + +  E ++RNLIAYE+CH      L ++V
Subjt:  KHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDV

Query:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT
         NFA F+  L+ T++DV+LLI   II N+ GSI EVT  FNNL K++    N Y+ +C+ MK YC+R RHRW++ L+R YF+TPW   +    +L+ +LT
Subjt:  CNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLT

Query:  ILQAVMAYLNL
        ++Q ++A ++L
Subjt:  ILQAVMAYLNL

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026457.5e-1327.93Show/hide
Query:  VSRVVESLKTSLKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDIL--QFMNIDLYKVVSYLGPVQ-RVRNCYAET
        V  V +SL   L+  D +     I+ VP+ L   +P +YTP  +SIGP+H   P+L    R    +   +  Q+ +   + +V  L  ++ ++R CY + 
Subjt:  VSRVVESLKTSLKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDIL--QFMNIDLYKVVSYLGPVQ-RVRNCYAET

Query:  IEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEV
        I       L ++A+D SF++ +L   K   FR  +T       N IL D++++ENQ+P F+L +  +  + S +   ++
Subjt:  IEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEV

Q9SD53 UPF0481 protein At3g472001.3e-2826.22Show/hide
Query:  CCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDL-MANRRKSRYLQ---DILQFMNIDLYKVVSYLGPVQ-RVRNCYAETIEMEDADFLELLAIDCSFV
        CCI+RVP+   ++NPKAY P+V+SIGP+H     L M  + K R LQ   D  +  +++   +V  +  ++ ++R  Y+E ++    D + ++ +D  F+
Subjt:  CCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDL-MANRRKSRYLQ---DILQFMNIDLYKVVSYLGPVQ-RVRNCYAETIEMEDADFLELLAIDCSFV

Query:  VM--YLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGIKHF
        +M   ++        D   S  W   + I  DLLLLENQ+PFF+L+ L         ++   S+ + +A H+F      N +D         + Y  KH 
Subjt:  VM--YLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNND----RQGYGIKHF

Query:  IDLLRMDITGSCSR----EQPDVYTSL-------------SCTIWPPSATELHDCGISFK-KKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAY
        +DL+R     + S       P V   L                    SA  L   GI F+ ++S+    +  +     L +P++     +     N +A+
Subjt:  IDLLRMDITGSCSR----EQPDVYTSL-------------SCTIWPPSATELHDCGISFK-KKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAY

Query:  ERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLIN-KEIIQNSLGSIEEVTSFFNNLSKNLL--VRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFN
        E    + +   ++++  +  F+  L+N E+DV  L N K II+N  GS  EV+ FF  +SK+++  V ++  +N  + +  Y ++  +      +  +F 
Subjt:  ERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLIN-KEIIQNSLGSIEEVTSFFNNLSKNLL--VRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFN

Query:  TPW---AAVAFFFTVLITSLTILQAVMAYLN
        +PW   ++ A  F +L+T L    A+++YLN
Subjt:  TPW---AAVAFFFTVLITSLTILQAVMAYLN

Arabidopsis top hitse value%identityAlignment
AT2G28580.1 Plant protein of unknown function (DUF247)2.8e-3930.89Show/hide
Query:  PTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKS--RYLQDILQFMNIDLYK------VVSYLG--PVQR-----------VRNCYAE-
        P  CCIYRVP  LR VNP+AYTP+++ IGP H +       R K+  RY    L ++N++L+K      +    G  PV+            +R+ YAE 
Subjt:  PTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKS--RYLQDILQFMNIDLYK------VVSYLG--PVQR-----------VRNCYAE-

Query:  TIEMEDADFLELLAIDCSFVVMYLI--GSKFPHFRDRDTSF-LWRFRN--GILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVN
        TI +   DF+E++  D  F++++ I  GS     +  D  F   R  N   IL DL+LLENQLP+ LLE+L +   S+ +            K  F ++ 
Subjt:  TIEMEDADFLELLAIDCSFVVMYLI--GSKFPHFRDRDTSF-LWRFRN--GILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVN

Query:  KMNFVDNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLSCTIWPP-------SATELHDCGISF---KKKSRATMHIEFQDGDGVLSLPEIEVSET
           F      ++    +HF DL R     + S  +  +  + +    PP       +A +L   G++F    +++  ++ I F+  DG+L +P   V + 
Subjt:  KMNFVDNNNDRQGYGIKHFIDLLRMDITGSCSREQPDVYTSLSCTIWPP-------SATELHDCGISF---KKKSRATMHIEFQDGDGVLSLPEIEVSET

Query:  LEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMT
         E  +RNL+A E+C    H   T  VC++  FL +LINT++DV+LL  K I++N LG    VT   N L   L+   + Y +  E++  +   R +R + 
Subjt:  LEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMT

Query:  SLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        +L+R YF   W   A    V++  LT++Q V + L +
Subjt:  SLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

AT2G36430.1 Plant protein of unknown function (DUF247)1.2e-3429.47Show/hide
Query:  LKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDL-MANRRKSRYLQDIL-QFMNIDLYKVVSYLGPVQRV-RNCYAETIEMEDADFLEL
        L S    PT C I+RVPQ +   N + Y PRV+SIGP+HR    L M    K RYL  +L +  N+ L   +  +  V+ V R CY+ETI M+  +F E+
Subjt:  LKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDL-MANRRKSRYLQDIL-QFMNIDLYKVVSYLGPVQRV-RNCYAETIEMEDADFLEL

Query:  LAID-CSFVVMYLIGSKFPHFRDRD--TSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNNDR-QG
        + +D C  + ++   +    F   D   +  W        D L LENQ+PFF+LE L  L     +E +  ++   LA  +F   N M+  + +  R + 
Subjt:  LAID-CSFVVMYLIGSKFPHFRDRD--TSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNNDR-QG

Query:  YGIKHFIDLLRMDITGSCSREQPDVYTSLSCTIWPP----SATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHH
           KH +DLLR           P   T+      P     S ++L   GI  ++   A   +  +   G + +P I V + +   L N +AYE+CH    
Subjt:  YGIKHFIDLLRMDITGSCSREQPDVYTSLSCTIWPP----SATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHH

Query:  KTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLL--VRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFF
           T     +A  L  L NT KDVE L ++ II+N  G+  E+  F N+L +++   +      +  E +  Y +   H    + K  YFN+PW+ V+  
Subjt:  KTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLL--VRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFF

Query:  FTVLITSLTILQAV
          +++  L+++Q +
Subjt:  FTVLITSLTILQAV

AT2G44930.1 Plant protein of unknown function (DUF247)3.8e-3628.44Show/hide
Query:  CCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGPVQR--------------------VRNCYAETIE
        CCIYRVP  LR VNP+AYTP+++ IGP    +    L  ++   RY    L +MN++L+K   YL  +                      VR  YAE+ +
Subjt:  CCIYRVPQPLRSVNPKAYTPRVISIGPFH--RNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGPVQR--------------------VRNCYAETIE

Query:  -MEDADFLELLAIDCSFVVMYLIGSKFPH---------FRDRDTSFLWR-FRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFI
         ++  DF++++ +D  F++   I ++            F + D  F        IL DL+LLENQLP+ LLE+L +   ++              K  F 
Subjt:  -MEDADFLELLAIDCSFVVMYLIGSKFPH---------FRDRDTSFLWR-FRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFI

Query:  EVNKMNFVDNNNDRQGYGIKHFIDLLR----------MDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISF---KKKSRATMHIEFQDGDGVLSLPE
        ++    F      ++G   +HF DL R           +   S   + P++  SL       +A +L   G+ F   ++K+  ++ I F+   G+L +P 
Subjt:  EVNKMNFVDNNNDRQGYGIKHFIDLLR----------MDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISF---KKKSRATMHIEFQDGDGVLSLPE

Query:  IEVSETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRR
            +  E  +RNL+A E+C    H  LT  VCN+  FL +LI+T++DV+LL+ K +I+N LG    V    N L   L+   + Y+   + +  +   R
Subjt:  IEVSETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRR

Query:  RHRWMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL
        R+R + +L+R YF   W   A     +I  LT++  V + L +
Subjt:  RHRWMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNL

AT4G31980.1 unknown protein6.9e-5432.55Show/hide
Query:  VVESLKTSLKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMA-NRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEME
        +V+S+K  L  +    T CCIY+VP  LR +NP AYTPR++S GP HR   +L A   +K RYL   +   N  L  +V       Q  R+CYAE +++ 
Subjt:  VVESLKTSLKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMA-NRRKSRYLQDILQFMNIDLYKVVSYLGP-VQRVRNCYAETIEME

Query:  DADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILC-----DLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFV
          +F+E+L +D SF+V  L+ S +P  R  +      F N ++      D++L+ENQLPFF+++++  L ++     Q   +  +LA+ +F       F+
Subjt:  DADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILC-----DLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFV

Query:  DNNNDRQGY-GIKHFIDLLRMDITGSCSREQPDV---YTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAY
           +D +     +HF+DLLR     SC   Q  +   YT++      P ATELH  G+ FK    ++  ++    DGVL +P I V +  E   +N+I +
Subjt:  DNNNDRQGY-GIKHFIDLLRMDITGSCSREQPDV---YTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAY

Query:  ERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNE-CESMKSYCRRRRHRWMTSLKRKYFNTP
        E+C   +   L     ++   L   I +  D +LLI+  II N LG+  +V++ FN++SK ++     Y +   E++++YC    +RW   L+R YF+ P
Subjt:  ERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNE-CESMKSYCRRRRHRWMTSLKRKYFNTP

Query:  WAAVAFFFTVLITSLTILQAVMAYLNL
        WA  + F  +L+  LT +Q+V + L L
Subjt:  WAAVAFFFTVLITSLTILQAVMAYLNL

AT5G22550.2 Plant protein of unknown function (DUF247)4.2e-3527.31Show/hide
Query:  CCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPD---LMANRRKSRYLQDILQFMN------IDLYKVVSYLGPVQRVRNCYAETIEMEDADFLELLAID
        CCIYR+P  L+ VN KAY P+++SIGP+H ++      M    K RYL+  +          I L  +VS  G  Q++R+ Y+E +E      ++++ +D
Subjt:  CCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPD---LMANRRKSRYLQDILQFMN------IDLYKVVSYLGPVQRVRNCYAETIEMEDADFLELLAID

Query:  CSFVVM--YLIGSKFPHFRDRDTSFLWRFRNGIL-CDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYF-IEVNK-MNFVDNNNDRQGYGI
          F++M   ++  K  +   +D  F  R+    L  DLLLLENQ+P FLL+ L +       +L   ++ + LA  +F   + K   F + +N+ +    
Subjt:  CSFVVM--YLIGSKFPHFRDRDTSFLWRFRNGIL-CDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYF-IEVNK-MNFVDNNNDRQGYGI

Query:  KHFIDLLRMDITG------------------------------------SCSREQPDVYTSLSCTIWPP---------SATELHDCGISFKKKSRATMHI
        KH +DL+R                                         SCS+E     TS      PP         SA +L   GI F +K      +
Subjt:  KHFIDLLRMDITG------------------------------------SCSREQPDVYTSLSCTIWPP---------SATELHDCGISFKKKSRATMHI

Query:  EFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNL--LVRSNI
        +     G++ +P +   + +   L N +A+E    + + + + ++ +F  F+  LINTE D   LI K I++N  G+ EEV+ FF N+ K++   +  + 
Subjt:  EFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLSYLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNL--LVRSNI

Query:  YHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMA
          N  E +  Y  +  H      K  +FNTPW  ++    +++  LTI QA  A
Subjt:  YHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCATGGTGAAACAAGTCGGAACACCAAACCAAAAGAAATTGTCAGTAGAGTGGTCGAATCCCTGAAAACAAGTCTTAAAAGTGTTGATCCAGATCCTACAGATTG
TTGCATATACAGGGTTCCACAACCTCTGCGCAGTGTCAACCCGAAAGCTTATACTCCCAGAGTCATCTCCATTGGCCCTTTTCATCGCAATACACCAGATCTGATGGCTA
ATAGACGTAAAAGTAGGTACCTTCAAGACATTCTTCAGTTTATGAATATCGACTTGTACAAGGTTGTATCCTACCTCGGACCTGTGCAAAGAGTTCGCAATTGTTATGCA
GAAACTATTGAGATGGAGGATGCAGATTTTTTGGAACTTCTAGCCATTGATTGTTCTTTCGTCGTCATGTATCTTATTGGTTCTAAGTTCCCTCACTTTCGAGATAGAGA
TACGTCGTTTTTATGGAGATTCAGGAATGGAATATTATGTGATCTACTGCTACTTGAAAACCAACTTCCTTTCTTCCTCCTTGAACAACTATGCCAGTTGTGCATCTCGT
CTGCAGATGAACTGCAAGAAGTCAGTAATTTCTCTGAACTTGCAAAACACTATTTCATTGAGGTTAACAAAATGAATTTTGTTGACAATAACAATGATCGGCAAGGTTAT
GGAATAAAACATTTTATTGATCTTTTAAGAATGGACATCACCGGTTCATGTTCTCGAGAACAACCAGATGTTTACACCTCCCTTAGTTGCACGATTTGGCCGCCTTCAGC
CACCGAGCTCCACGACTGCGGCATCTCTTTCAAGAAGAAATCAAGAGCCACTATGCACATAGAGTTTCAAGACGGTGATGGTGTTCTCAGTCTTCCAGAAATCGAAGTAT
CTGAAACTTTGGAAGTCCAACTGAGAAATCTCATAGCTTATGAGCGATGTCATGGCAGGCACCACAAGACATTGACAGACGATGTATGCAACTTCGCTTTCTTCTTGAGT
TACTTGATCAACACAGAAAAGGACGTGGAATTGCTCATCAATAAAGAGATCATACAAAACAGTTTGGGCAGCATTGAGGAAGTTACCAGCTTTTTCAACAACCTAAGTAA
AAACCTCCTCGTTCGAAGTAATATATACCATAATGAATGTGAGAGCATGAAAAGTTACTGCAGGCGCCGTCGGCATCGGTGGATGACTTCATTGAAACGCAAATATTTCA
ACACGCCCTGGGCTGCCGTCGCCTTCTTTTTCACCGTTCTCATCACTAGTCTTACCATACTGCAAGCAGTGATGGCATATCTCAACTTACGAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCATGGTGAAACAAGTCGGAACACCAAACCAAAAGAAATTGTCAGTAGAGTGGTCGAATCCCTGAAAACAAGTCTTAAAAGTGTTGATCCAGATCCTACAGATTG
TTGCATATACAGGGTTCCACAACCTCTGCGCAGTGTCAACCCGAAAGCTTATACTCCCAGAGTCATCTCCATTGGCCCTTTTCATCGCAATACACCAGATCTGATGGCTA
ATAGACGTAAAAGTAGGTACCTTCAAGACATTCTTCAGTTTATGAATATCGACTTGTACAAGGTTGTATCCTACCTCGGACCTGTGCAAAGAGTTCGCAATTGTTATGCA
GAAACTATTGAGATGGAGGATGCAGATTTTTTGGAACTTCTAGCCATTGATTGTTCTTTCGTCGTCATGTATCTTATTGGTTCTAAGTTCCCTCACTTTCGAGATAGAGA
TACGTCGTTTTTATGGAGATTCAGGAATGGAATATTATGTGATCTACTGCTACTTGAAAACCAACTTCCTTTCTTCCTCCTTGAACAACTATGCCAGTTGTGCATCTCGT
CTGCAGATGAACTGCAAGAAGTCAGTAATTTCTCTGAACTTGCAAAACACTATTTCATTGAGGTTAACAAAATGAATTTTGTTGACAATAACAATGATCGGCAAGGTTAT
GGAATAAAACATTTTATTGATCTTTTAAGAATGGACATCACCGGTTCATGTTCTCGAGAACAACCAGATGTTTACACCTCCCTTAGTTGCACGATTTGGCCGCCTTCAGC
CACCGAGCTCCACGACTGCGGCATCTCTTTCAAGAAGAAATCAAGAGCCACTATGCACATAGAGTTTCAAGACGGTGATGGTGTTCTCAGTCTTCCAGAAATCGAAGTAT
CTGAAACTTTGGAAGTCCAACTGAGAAATCTCATAGCTTATGAGCGATGTCATGGCAGGCACCACAAGACATTGACAGACGATGTATGCAACTTCGCTTTCTTCTTGAGT
TACTTGATCAACACAGAAAAGGACGTGGAATTGCTCATCAATAAAGAGATCATACAAAACAGTTTGGGCAGCATTGAGGAAGTTACCAGCTTTTTCAACAACCTAAGTAA
AAACCTCCTCGTTCGAAGTAATATATACCATAATGAATGTGAGAGCATGAAAAGTTACTGCAGGCGCCGTCGGCATCGGTGGATGACTTCATTGAAACGCAAATATTTCA
ACACGCCCTGGGCTGCCGTCGCCTTCTTTTTCACCGTTCTCATCACTAGTCTTACCATACTGCAAGCAGTGATGGCATATCTCAACTTACGAAGGTAA
Protein sequenceShow/hide protein sequence
MEHGETSRNTKPKEIVSRVVESLKTSLKSVDPDPTDCCIYRVPQPLRSVNPKAYTPRVISIGPFHRNTPDLMANRRKSRYLQDILQFMNIDLYKVVSYLGPVQRVRNCYA
ETIEMEDADFLELLAIDCSFVVMYLIGSKFPHFRDRDTSFLWRFRNGILCDLLLLENQLPFFLLEQLCQLCISSADELQEVSNFSELAKHYFIEVNKMNFVDNNNDRQGY
GIKHFIDLLRMDITGSCSREQPDVYTSLSCTIWPPSATELHDCGISFKKKSRATMHIEFQDGDGVLSLPEIEVSETLEVQLRNLIAYERCHGRHHKTLTDDVCNFAFFLS
YLINTEKDVELLINKEIIQNSLGSIEEVTSFFNNLSKNLLVRSNIYHNECESMKSYCRRRRHRWMTSLKRKYFNTPWAAVAFFFTVLITSLTILQAVMAYLNLRR