; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014009 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014009
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationchr1:54392124..54396204
RNA-Seq ExpressionLag0014009
SyntenyLag0014009
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439637.1 PREDICTED: uncharacterized protein LOC103484369 isoform X3 [Cucumis melo]1.0e-19888.46Show/hide
Query:  MASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAI
        MA+TSTPFP S PL  PL  SQAM  NSSSPSS+L YL+AN+D+FAGNFTA KRFS+FD R   ++T+ VPCGFLKKFPVRDSDRIAME CNGVVVVSAI
Subjt:  MASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAI

Query:  FNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM
        FNDHDKIRQPRGLGSKTLD VCFFMFVDETTV+GLENHKII  +NSSPDI IGAWRIVRVS KNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM
Subjt:  FNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM

Query:  VDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE
        VDPLLLIHSLI+TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE
Subjt:  VDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE

Query:  AFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGD
        AFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRHNLKK  Y  P+L P ISKPKRTKRAGPDLLYVNG+CCSKC NYLLQMWG+
Subjt:  AFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGD

XP_022926657.1 uncharacterized protein LOC111433726 isoform X1 [Cucurbita moschata]8.7e-22787.61Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LWES+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
          NAD+F GNF+AEKRFSYFD   ++ ++ +PCGFLKKFPV DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP  NS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K     EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

XP_022926659.1 uncharacterized protein LOC111433726 isoform X3 [Cucurbita moschata]4.0e-20381.42Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LWES+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
                                                   +D+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP  NS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K     EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

XP_023003354.1 uncharacterized protein LOC111496987 [Cucurbita maxima]1.6e-22587.17Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LW S+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSD LPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
          NAD+F GNF+AEKRFSYFD   ++ ++ +PCGFLK FPV DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP RNS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVT++ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K    PEL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

XP_023517976.1 uncharacterized protein LOC111781548 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-22787.61Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LWES+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
          NAD+F GNF+AEKRFSYFD   ++ ++ +PCGFLKKFPV DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        +IP RNS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K     EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYT7 uncharacterized protein LOC103484369 isoform X34.9e-19988.46Show/hide
Query:  MASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAI
        MA+TSTPFP S PL  PL  SQAM  NSSSPSS+L YL+AN+D+FAGNFTA KRFS+FD R   ++T+ VPCGFLKKFPVRDSDRIAME CNGVVVVSAI
Subjt:  MASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAI

Query:  FNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM
        FNDHDKIRQPRGLGSKTLD VCFFMFVDETTV+GLENHKII  +NSSPDI IGAWRIVRVS KNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM
Subjt:  FNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLM

Query:  VDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE
        VDPLLLIHSLI+TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE
Subjt:  VDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELE

Query:  AFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGD
        AFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRHNLKK  Y  P+L P ISKPKRTKRAGPDLLYVNG+CCSKC NYLLQMWG+
Subjt:  AFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157184.0e-19388.65Show/hide
Query:  QAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVV
        Q + WNSS  SS+LRYL+ANADTF GNFTA+ RFS+FD R  NDS++AVPCGFLKKFPV DSDRIAMERC+GVVVVSAIFNDHDKIRQPRGLGSKTL+ V
Subjt:  QAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFD-RPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVV

Query:  CFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISK
        CFFMFVDETTVRGLE+H +I  RNSSPDIIGAWRIVRVS KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LIVTENADMAISK
Subjt:  CFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISK

Query:  HPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK
        HPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IK
Subjt:  HPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK

Query:  INMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        INMFE EVFEQVALEYRHNLKKKGY  P+LGPHISKPKRTKRAGPDLLYVNGTCCSKCQ YLLQMWGD S
Subjt:  INMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

A0A6J1EEZ8 uncharacterized protein LOC111433726 isoform X14.2e-22787.61Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LWES+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
          NAD+F GNF+AEKRFSYFD   ++ ++ +PCGFLKKFPV DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP  NS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K     EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

A0A6J1EFS8 uncharacterized protein LOC111433726 isoform X31.9e-20381.42Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LWES+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
                                                   +D+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP  NS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K     EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

A0A6J1KT34 uncharacterized protein LOC1114969878.0e-22687.17Show/hide
Query:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL
        LW S+VGLRLCYSNQ+SSVSLCF  SPP SSLSTLLSP PNASSD LPSIPSSSLSSPIPPPMASTSTPFPPSAP APPLS SQ M WNSS  SS+LRYL
Subjt:  LWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYL

Query:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK
          NAD+F GNF+AEKRFSYFD   ++ ++ +PCGFLK FPV DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHK
Subjt:  VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHK

Query:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK
        IIP RNS PDIIGAWRIVRVS KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVT++ADMAISKHPYYIHTMEEAMATARWK
Subjt:  IIPRRNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH
        KWWDVDSLK QMETYCENGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE EVFEQVALEYRH
Subjt:  KWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRH

Query:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        NLK K    PEL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  NLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI708.9e-4934.2Show/hide
Query:  SSPIPPPMASTSTPFPPSAPLA-PPLSFSQAMQWNSSSPSSDLRYLV------ANADT------FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLK-----
        S  +PPP A      P   P+   P+  + A+  N+ S S  L+ L        N +T      F G  T + R   FD      T++V CGF+K     
Subjt:  SSPIPPPMASTSTPFPPSAPLA-PPLSFSQAMQWNSSSPSSDLRYLV------ANADT------FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLK-----

Query:  ---KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------VRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNL-YE
            F + ++D + M++C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T       RGL+ +K           +G WR+V V   NL Y 
Subjt:  ---KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------VRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNL-YE

Query:  NPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTK
        +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ Y   GL P+S  K
Subjt:  NPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTK

Query:  LPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFEEEVFEQVALEYRHNLKKKGYA
        LP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +       ++MF +       ++  H  +++ +A
Subjt:  LPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFEEEVFEQVALEYRHNLKKKGYA

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)6.4e-5034.2Show/hide
Query:  SSPIPPPMASTSTPFPPSAPLA-PPLSFSQAMQWNSSSPSSDLRYLV------ANADT------FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLK-----
        S  +PPP A      P   P+   P+  + A+  N+ S S  L+ L        N +T      F G  T + R   FD      T++V CGF+K     
Subjt:  SSPIPPPMASTSTPFPPSAPLA-PPLSFSQAMQWNSSSPSSDLRYLV------ANADT------FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLK-----

Query:  ---KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------VRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNL-YE
            F + ++D + M++C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T       RGL+ +K           +G WR+V V   NL Y 
Subjt:  ---KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------VRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNL-YE

Query:  NPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTK
        +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ Y   GL P+S  K
Subjt:  NPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTK

Query:  LPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFEEEVFEQVALEYRHNLKKKGYA
        LP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +       ++MF +       ++  H  +++ +A
Subjt:  LPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFEEEVFEQVALEYRHNLKKKGYA

AT1G53040.1 Protein of unknown function (DUF616)1.8e-4434.44Show/hide
Query:  PSSSLSSPIPPPMASTSTPFPPSAP---LAPPLSFSQAMQWNSSSP-SSDLRYL----------VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFL
        P  S S P PPP      P P   P   L P  + +   ++   SP   +L Y+                F G  + E R + FD      ++ V CGF+
Subjt:  PSSSLSSPIPPPMASTSTPFPPSAP---LAPPLSFSQAMQWNSSSP-SSDLRYL----------VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFL

Query:  K--------KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYEN
        K         F + +     +++ + V+V SAIF  +D I++P  +       + F+MFVDE T   L+N       N     +G WRI+ V     Y +
Subjt:  K--------KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYEN

Query:  PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKL
           NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A    +K +D  S+  Q+E Y + GL P++  KL
Subjt:  PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKL

Query:  PYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
        P T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  PYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.8e-4434.44Show/hide
Query:  PSSSLSSPIPPPMASTSTPFPPSAP---LAPPLSFSQAMQWNSSSP-SSDLRYL----------VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFL
        P  S S P PPP      P P   P   L P  + +   ++   SP   +L Y+                F G  + E R + FD      ++ V CGF+
Subjt:  PSSSLSSPIPPPMASTSTPFPPSAP---LAPPLSFSQAMQWNSSSP-SSDLRYL----------VANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFL

Query:  K--------KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYEN
        K         F + +     +++ + V+V SAIF  +D I++P  +       + F+MFVDE T   L+N       N     +G WRI+ V     Y +
Subjt:  K--------KFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSGKNLYEN

Query:  PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKL
           NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A    +K +D  S+  Q+E Y + GL P++  KL
Subjt:  PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKL

Query:  PYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
        P T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  PYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)9.5e-4634.63Show/hide
Query:  FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDS--DRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPR
        F GN +  +R   F   P    + V CGF+ +     S  D+  +++C   VV + IF+ +D+  QP  +  +++++ CF M VDE ++  L  +  + +
Subjt:  FAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDS--DRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPR

Query:  RNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWD
               +G WR++ +     Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H ++ +  EEA A  R +K + 
Subjt:  RNSSPDIIGAWRIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWD

Query:  VDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE
           +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L  + K+ MF+
Subjt:  VDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)1.9e-13462.87Show/hide
Query:  NSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFV
        + SSP  +LRY+   +++F GNF+ +KRFSYF+    D  + VPCGF + FPV +SDR+ ME+C G+VV SAIFNDHDKIRQP GLG KTL+ VCF+MF+
Subjt:  NSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFV

Query:  DETTVRGLENHKIIPRRNSSPDIIGAWRIVRVS-GKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYI
        D+ T+  L +H +I + N S   +GAWRI+++S  +NLY NPAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS++V    DMAISKHP+++
Subjt:  DETTVRGLENHKIIPRRNSSPDIIGAWRIVRVS-GKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYI

Query:  HTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE
        +TMEEAMATARWKKW DVD L+ QMETYCE+GLKPWS +KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE
Subjt:  HTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE

Query:  EEVFEQVALEYRHNLKK---KGYAEPELGPHISK----PKRTKRAGPDLLYVNGTCCSKCQNYLLQMWG
         EVFEQV +EYRHNLKK     Y E E            KR K    +   +N    S C+NYL  MWG
Subjt:  EEVFEQVALEYRHNLKK---KGYAEPELGPHISK----PKRTKRAGPDLLYVNGTCCSKCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATCCAGAGGTTATGGGAAAGCCAGGTTGGTCTACGCCTCTGCTATTCCAATCAAAACTCCTCTGTTTCTCTCTGTTTTATCTTTTCTCCTCCATCTTCCTCGCT
CTCTACACTTCTTTCTCCTCTTCCAAATGCCTCTTCCGATCCTCTCCCTTCGATCCCATCCAGTTCCCTCTCTTCTCCTATCCCTCCTCCTATGGCGAGCACAAGTACGC
CGTTCCCACCCTCCGCTCCTCTTGCTCCTCCCCTGTCTTTTTCTCAGGCAATGCAGTGGAATTCCTCTTCGCCGTCGTCTGATTTGAGGTATCTCGTCGCGAATGCCGAT
ACTTTCGCCGGTAATTTCACTGCCGAGAAGAGGTTTTCTTACTTCGATCGTCCACCAAATGATTCTACTCTAGCGGTTCCTTGTGGATTTCTCAAGAAATTTCCTGTCAG
AGATTCTGATCGAATTGCGATGGAGCGTTGCAACGGCGTGGTCGTGGTTTCCGCGATCTTCAACGATCATGATAAAATTCGGCAACCGAGAGGCCTCGGATCGAAAACTT
TGGATGTCGTGTGCTTTTTCATGTTTGTTGATGAAACTACGGTGAGAGGACTCGAGAACCACAAAATAATTCCTAGAAGAAACTCATCTCCGGATATAATTGGCGCTTGG
AGAATTGTGAGAGTTTCGGGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTGATACCTAAGTATTTAGTTCACAGACTGTTTCCGAACTCAAAATTCAGTATATG
GGTGGACGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGCTGATTGTGACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATATTCACA
CAATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGATTGAAACCATGGAGTCCC
ACTAAGCTTCCATATACCACAGATGTACCAGATAGTGCTTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTATTCAACGAATTGGAAGCTTT
CAACCCAAGGGACCAATTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTTGAAGAAGAAGTTTTCGAGCAAGTTGCTTTGGAATATA
GGCACAATCTCAAAAAGAAAGGATATGCTGAGCCTGAACTGGGCCCCCACATCTCCAAGCCCAAACGAACCAAAAGGGCCGGCCCAGATTTGTTGTACGTCAATGGCACC
TGTTGCAGCAAGTGCCAAAACTATCTCCTCCAGATGTGGGGTGACGTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATCCAGAGGTTATGGGAAAGCCAGGTTGGTCTACGCCTCTGCTATTCCAATCAAAACTCCTCTGTTTCTCTCTGTTTTATCTTTTCTCCTCCATCTTCCTCGCT
CTCTACACTTCTTTCTCCTCTTCCAAATGCCTCTTCCGATCCTCTCCCTTCGATCCCATCCAGTTCCCTCTCTTCTCCTATCCCTCCTCCTATGGCGAGCACAAGTACGC
CGTTCCCACCCTCCGCTCCTCTTGCTCCTCCCCTGTCTTTTTCTCAGGCAATGCAGTGGAATTCCTCTTCGCCGTCGTCTGATTTGAGGTATCTCGTCGCGAATGCCGAT
ACTTTCGCCGGTAATTTCACTGCCGAGAAGAGGTTTTCTTACTTCGATCGTCCACCAAATGATTCTACTCTAGCGGTTCCTTGTGGATTTCTCAAGAAATTTCCTGTCAG
AGATTCTGATCGAATTGCGATGGAGCGTTGCAACGGCGTGGTCGTGGTTTCCGCGATCTTCAACGATCATGATAAAATTCGGCAACCGAGAGGCCTCGGATCGAAAACTT
TGGATGTCGTGTGCTTTTTCATGTTTGTTGATGAAACTACGGTGAGAGGACTCGAGAACCACAAAATAATTCCTAGAAGAAACTCATCTCCGGATATAATTGGCGCTTGG
AGAATTGTGAGAGTTTCGGGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTGATACCTAAGTATTTAGTTCACAGACTGTTTCCGAACTCAAAATTCAGTATATG
GGTGGACGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGCTGATTGTGACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATATTCACA
CAATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGATTGAAACCATGGAGTCCC
ACTAAGCTTCCATATACCACAGATGTACCAGATAGTGCTTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTATTCAACGAATTGGAAGCTTT
CAACCCAAGGGACCAATTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTTGAAGAAGAAGTTTTCGAGCAAGTTGCTTTGGAATATA
GGCACAATCTCAAAAAGAAAGGATATGCTGAGCCTGAACTGGGCCCCCACATCTCCAAGCCCAAACGAACCAAAAGGGCCGGCCCAGATTTGTTGTACGTCAATGGCACC
TGTTGCAGCAAGTGCCAAAACTATCTCCTCCAGATGTGGGGTGACGTTTCTTGA
Protein sequenceShow/hide protein sequence
MEIQRLWESQVGLRLCYSNQNSSVSLCFIFSPPSSSLSTLLSPLPNASSDPLPSIPSSSLSSPIPPPMASTSTPFPPSAPLAPPLSFSQAMQWNSSSPSSDLRYLVANAD
TFAGNFTAEKRFSYFDRPPNDSTLAVPCGFLKKFPVRDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAW
RIVRVSGKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSP
TKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEEEVFEQVALEYRHNLKKKGYAEPELGPHISKPKRTKRAGPDLLYVNGT
CCSKCQNYLLQMWGDVS