; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038988 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038988
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1985 domain-containing protein
Genome locationchr2:32611608..32615370
RNA-Seq ExpressionLag0038988
SyntenyLag0038988
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440207.1 PREDICTED: uncharacterized protein LOC103484737 isoform X1 [Cucumis melo]6.9e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

XP_008440208.1 PREDICTED: uncharacterized protein LOC103484737 isoform X2 [Cucumis melo]6.9e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

XP_008440212.1 PREDICTED: uncharacterized protein LOC103484737 isoform X5 [Cucumis melo]6.9e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

XP_016899363.1 PREDICTED: uncharacterized protein LOC103484737 isoform X3 [Cucumis melo]6.9e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

XP_022132727.1 uncharacterized protein LOC111005524 [Momordica charantia]4.8e-5341.84Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKLS-GRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITG+NCGE P +DM K+    F ++YF  E+ IKR+ + ++FM+MD  + KD V++AKLY L  FLLGKQ++TG+  ++  L+DDDE
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKLS-GRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI--YEM
        QFE YPWGR+ Y  TI+ +K AIK+ DA  +GI GFP+ALLVWAYE IP+LS  + + A KIS+G+P+MNNW+A+VHPEW+DL+ K+F    F +   E 
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI--YEM

Query:  RDENVVHKRTS-PIQQIE-----------EQQLKHASTSSSNERDT-NGP--TNLEDVKDMILTSLEELNMKMTMLFREMAEMKLLLAKKQSGG
         D  +  +  S P+  +E           ++Q   A TS + + D   GP   + + V + +LT +  +   M  +  ++  MK L+ K    G
Subjt:  RDENVVHKRTS-PIQQIE-----------EQQLKHASTSSSNERDT-NGP--TNLEDVKDMILTSLEELNMKMTMLFREMAEMKLLLAKKQSGG

TrEMBL top hitse value%identityAlignment
A0A1S3B065 uncharacterized protein LOC103484737 isoform X43.3e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X53.3e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

A0A1S3B181 uncharacterized protein LOC103484737 isoform X73.3e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

A0A1S4DTS6 uncharacterized protein LOC103484737 isoform X33.3e-5251.79Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITGLNCGE PA+DM K+  G+F ++YF  EK I+R+ + ++F +MD  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD E
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKL-SGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI
        QF++YPWGRI Y  TI+ +K AIK+ DA  +G+ GFP AL VWAYE IP+L+  +   A +IS G P+MNNW ADVHPEWKDL+ KVF+   F +
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI

A0A6J1BX50 uncharacterized protein LOC1110055242.3e-5341.84Show/hide
Query:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKLS-GRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE
        GR  ++  +DF  ITG+NCGE P +DM K+    F ++YF  E+ IKR+ + ++FM+MD  + KD V++AKLY L  FLLGKQ++TG+  ++  L+DDDE
Subjt:  GRNKEYLYEDFEKITGLNCGEFPAMDMKKLS-GRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI--YEM
        QFE YPWGR+ Y  TI+ +K AIK+ DA  +GI GFP+ALLVWAYE IP+LS  + + A KIS+G+P+MNNW+A+VHPEW+DL+ K+F    F +   E 
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQI--YEM

Query:  RDENVVHKRTS-PIQQIE-----------EQQLKHASTSSSNERDT-NGP--TNLEDVKDMILTSLEELNMKMTMLFREMAEMKLLLAKKQSGG
         D  +  +  S P+  +E           ++Q   A TS + + D   GP   + + V + +LT +  +   M  +  ++  MK L+ K    G
Subjt:  RDENVVHKRTS-PIQQIE-----------EQQLKHASTSSSNERDT-NGP--TNLEDVKDMILTSLEELNMKMTMLFREMAEMKLLLAKKQSGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G10260.1 FUNCTIONS IN: molecular_function unknown1.8e-0527.87Show/hide
Query:  KDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKIS
        KD + + +L  LS  + G    + + +     + D   FE YPWG + + + I S+K    + +   V I G  HALL+W YE I  L+  T        
Subjt:  KDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKIS

Query:  NGIPKMNNWLA---DVHPEWKD
         G+P + +W +   +++ +W D
Subjt:  NGIPKMNNWLA---DVHPEWKD

AT3G31910.1 Domain of unknown function (DUF1985)1.6e-0631.82Show/hide
Query:  KDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPML
        K+++ + +L  LS  + G    + + +     + D   FE YPWGR+ + + INS+K    + D+ V  I    HAL++W YE +P L
Subjt:  KDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPML

AT3G32960.1 Domain of unknown function (DUF1985)4.7e-0627.94Show/hide
Query:  DGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSI-KNAIKNPDALVVGIAGFPHALLVWAYECIPML------
        D    +++  LA L  + +  L         ++++    D E+  NYPWG   +   ++SI KN   N       I GFP AL +W  E IPML      
Subjt:  DGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPWGRIVYTTTINSI-KNAIKNPDALVVGIAGFPHALLVWAYECIPML------

Query:  ----SGPTIVCAEKI----SNGIPKMNNWLADVHPE
              PT    EK     S  + ++ N  A  HP+
Subjt:  ----SGPTIVCAEKI----SNGIPKMNNWLADVHPE

AT4G08430.1 Ulp1 protease family protein2.0e-0927.55Show/hide
Query:  LYEDFEKITGLNCGEFPAMDMKKLSGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKV-RLAKLYFLSNFLL------GKQVSTGVEVDHITLLDDDE
        LYE FE ITGLNC  F   D    +G    K F +E  +  S V  LF +++ V +  K   L K   +    L      G    + V +     + D  
Subjt:  LYEDFEKITGLNCGEFPAMDMKKLSGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKV-RLAKLYFLSNFLL------GKQVSTGVEVDHITLLDDDE

Query:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIP--------KMNNWLADVHPEWKDLATKVFEHRK
         FE YPWGR+ + + + S+K    + D+ V  I G   ALLVW YE +P + G      +    G+P        K  N+ A +  E K    +V  H++
Subjt:  QFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIP--------KMNNWLADVHPEWKDLATKVFEHRK

Query:  FQIYEMRDENVVHKRTSPIQQIEEQQLKHASTSSSNERDTNGPTNLEDVKDMILTSLEELNMKMT
         Q  E  +E+   K        +    K  +  +  + D    T+L D+++M    LE++N+ ++
Subjt:  FQIYEMRDENVVHKRTSPIQQIEEQQLKHASTSSSNERDTNGPTNLEDVKDMILTSLEELNMKMT

AT5G28810.1 Domain of unknown function (DUF1985)1.5e-0730.28Show/hide
Query:  LYEDFEKITGLNCGEFPAMDMKKLSGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPW
        LYE FE ITGLNC  F   D +      L + F+  K                   + ++ + +L  LS  + G    + V +     + D   FE YPW
Subjt:  LYEDFEKITGLNCGEFPAMDMKKLSGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDDEQFENYPW

Query:  GRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIP
        GR+ + + + S+K    + D+ V  I G   ALLVW YE +P
Subjt:  GRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAAAGAAGGAGAAGCCAAAGAATAATTGGTAGTGGAAGTGGATTGCCTACAAGGAAACGGAAAGAAAATGAAGTTATCGATATAGATTCCGAGGACGAAGTTGA
TAGATCAGACAATGTCGGGATACCAACAAAGTCATCGTTCTTCAATCGGTTGTTTGGCGACATGGCTTCCGAGATTCGACCACCGACACGAAAAAGGGATGAAGATGATG
ATAAACAAGAAGAACCTATACCAATCAAGATTGTAGACCCATTTGCTCATTTAGCAAACAGTCAATTAGTTCCATATGGTTATTGCTTTGGGGAAAACACAATGAATACT
GCTCAAGAGGGCTCTCATTACAAGCAACAGGATATAGCTGGAAACTTCGATGACCTAGAAGTTTCCGATCTTCACGTTGGAACAGAAGAAGATAAACATTGTAAAGACGG
AGAGAAAAGCACAACGGATAACAAGGAAGAGGGGACTCAATCCGAGGATAAAAATGGAAATCGGGATGTGGGGAGTGTGGATGTTGAAGAAAAAGAATTTGAGGGGAATG
CTATTGACCTGGAAAATACGACAACTAGAGAAGAAGTTGGGAGAAGCAAAAGAAAAAGGCCGGAGGAAGTTGCTAAGATGGAACGGAAGAGTGAATCTTCAAAAAAGGTG
ATCAAGATGGAACGATATTCCAGAGGATCGGTCTTCGGAAGAAACAAAGAATATTTGTATGAGGATTTCGAAAAAATAACAGGACTAAATTGTGGTGAATTCCCAGCAAT
GGATATGAAAAAACTGTCAGGTAGGTTTCTTAGAAAGTATTTTGACGATGAAAAACCGATAAAGAGATCAATTGTTTCTGATCTTTTCATGAAAATGGATGGAGTGAAAA
AAAAGGACAAGGTGAGGTTGGCCAAGTTATATTTCCTATCAAACTTCTTATTGGGAAAACAAGTAAGCACAGGGGTCGAAGTAGATCATATCACACTACTTGATGATGAT
GAACAATTTGAAAATTACCCATGGGGAAGGATAGTATATACCACCACCATCAACTCAATAAAGAATGCTATAAAGAATCCGGATGCATTGGTGGTTGGGATTGCTGGATT
CCCTCACGCACTACTTGTCTGGGCATACGAATGCATACCTATGCTTTCGGGACCAACCATAGTTTGCGCCGAAAAAATTTCAAATGGGATCCCGAAGATGAACAACTGGT
TGGCAGACGTTCATCCGGAATGGAAGGACTTGGCAACGAAGGTTTTTGAACACAGGAAGTTTCAGATATATGAGATGCGTGATGAGAATGTTGTCCACAAAAGAACGTCA
CCGATTCAGCAAATAGAAGAGCAACAACTTAAGCATGCATCAACTTCTAGCTCAAATGAAAGGGACACCAATGGACCTACAAACTTGGAAGACGTTAAGGACATGATCCT
TACAAGTCTGGAGGAATTAAATATGAAGATGACCATGCTTTTCCGAGAAATGGCTGAAATGAAGTTATTGTTGGCCAAAAAGCAATCGGGGGGACATCATGCCGACAATA
AGGAGAATAATGATGATGACGAAGGGGACAACAGGGAAGGAAATGTTGGAAATGAAGGCGAAGGGGACTTCAGGGGGGAGGATGAAAACAGGAATGAAGGCGAAGGCGGG
CAAAAAGATGGAAACAATGAAGGCGAAGGATATGAAAGAAAAAAAAATGATGACACGGGGGAAGACGTAGATGAAGATATTGTCAGCAATAGCTTCTTAGTGGAAGTGGA
GAAAATTGAAAAGGCTGCAATATCGAATATTAAAAAGTTGAAAGGTACTGGACCGTTCGCGGAAGGTCAAACTTTTGTAATGAGGGAAAAAAGGAAGATCATTCCATCGA
AACTAATGAAGTCTCCCTTCACATCGAAGTTTGGATCCGCTGAAGAAAAGGAACGACGAAAAAAGACAAAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAAAGAAGGAGAAGCCAAAGAATAATTGGTAGTGGAAGTGGATTGCCTACAAGGAAACGGAAAGAAAATGAAGTTATCGATATAGATTCCGAGGACGAAGTTGA
TAGATCAGACAATGTCGGGATACCAACAAAGTCATCGTTCTTCAATCGGTTGTTTGGCGACATGGCTTCCGAGATTCGACCACCGACACGAAAAAGGGATGAAGATGATG
ATAAACAAGAAGAACCTATACCAATCAAGATTGTAGACCCATTTGCTCATTTAGCAAACAGTCAATTAGTTCCATATGGTTATTGCTTTGGGGAAAACACAATGAATACT
GCTCAAGAGGGCTCTCATTACAAGCAACAGGATATAGCTGGAAACTTCGATGACCTAGAAGTTTCCGATCTTCACGTTGGAACAGAAGAAGATAAACATTGTAAAGACGG
AGAGAAAAGCACAACGGATAACAAGGAAGAGGGGACTCAATCCGAGGATAAAAATGGAAATCGGGATGTGGGGAGTGTGGATGTTGAAGAAAAAGAATTTGAGGGGAATG
CTATTGACCTGGAAAATACGACAACTAGAGAAGAAGTTGGGAGAAGCAAAAGAAAAAGGCCGGAGGAAGTTGCTAAGATGGAACGGAAGAGTGAATCTTCAAAAAAGGTG
ATCAAGATGGAACGATATTCCAGAGGATCGGTCTTCGGAAGAAACAAAGAATATTTGTATGAGGATTTCGAAAAAATAACAGGACTAAATTGTGGTGAATTCCCAGCAAT
GGATATGAAAAAACTGTCAGGTAGGTTTCTTAGAAAGTATTTTGACGATGAAAAACCGATAAAGAGATCAATTGTTTCTGATCTTTTCATGAAAATGGATGGAGTGAAAA
AAAAGGACAAGGTGAGGTTGGCCAAGTTATATTTCCTATCAAACTTCTTATTGGGAAAACAAGTAAGCACAGGGGTCGAAGTAGATCATATCACACTACTTGATGATGAT
GAACAATTTGAAAATTACCCATGGGGAAGGATAGTATATACCACCACCATCAACTCAATAAAGAATGCTATAAAGAATCCGGATGCATTGGTGGTTGGGATTGCTGGATT
CCCTCACGCACTACTTGTCTGGGCATACGAATGCATACCTATGCTTTCGGGACCAACCATAGTTTGCGCCGAAAAAATTTCAAATGGGATCCCGAAGATGAACAACTGGT
TGGCAGACGTTCATCCGGAATGGAAGGACTTGGCAACGAAGGTTTTTGAACACAGGAAGTTTCAGATATATGAGATGCGTGATGAGAATGTTGTCCACAAAAGAACGTCA
CCGATTCAGCAAATAGAAGAGCAACAACTTAAGCATGCATCAACTTCTAGCTCAAATGAAAGGGACACCAATGGACCTACAAACTTGGAAGACGTTAAGGACATGATCCT
TACAAGTCTGGAGGAATTAAATATGAAGATGACCATGCTTTTCCGAGAAATGGCTGAAATGAAGTTATTGTTGGCCAAAAAGCAATCGGGGGGACATCATGCCGACAATA
AGGAGAATAATGATGATGACGAAGGGGACAACAGGGAAGGAAATGTTGGAAATGAAGGCGAAGGGGACTTCAGGGGGGAGGATGAAAACAGGAATGAAGGCGAAGGCGGG
CAAAAAGATGGAAACAATGAAGGCGAAGGATATGAAAGAAAAAAAAATGATGACACGGGGGAAGACGTAGATGAAGATATTGTCAGCAATAGCTTCTTAGTGGAAGTGGA
GAAAATTGAAAAGGCTGCAATATCGAATATTAAAAAGTTGAAAGGTACTGGACCGTTCGCGGAAGGTCAAACTTTTGTAATGAGGGAAAAAAGGAAGATCATTCCATCGA
AACTAATGAAGTCTCCCTTCACATCGAAGTTTGGATCCGCTGAAGAAAAGGAACGACGAAAAAAGACAAAATCCTAA
Protein sequenceShow/hide protein sequence
MDKRRRSQRIIGSGSGLPTRKRKENEVIDIDSEDEVDRSDNVGIPTKSSFFNRLFGDMASEIRPPTRKRDEDDDKQEEPIPIKIVDPFAHLANSQLVPYGYCFGENTMNT
AQEGSHYKQQDIAGNFDDLEVSDLHVGTEEDKHCKDGEKSTTDNKEEGTQSEDKNGNRDVGSVDVEEKEFEGNAIDLENTTTREEVGRSKRKRPEEVAKMERKSESSKKV
IKMERYSRGSVFGRNKEYLYEDFEKITGLNCGEFPAMDMKKLSGRFLRKYFDDEKPIKRSIVSDLFMKMDGVKKKDKVRLAKLYFLSNFLLGKQVSTGVEVDHITLLDDD
EQFENYPWGRIVYTTTINSIKNAIKNPDALVVGIAGFPHALLVWAYECIPMLSGPTIVCAEKISNGIPKMNNWLADVHPEWKDLATKVFEHRKFQIYEMRDENVVHKRTS
PIQQIEEQQLKHASTSSSNERDTNGPTNLEDVKDMILTSLEELNMKMTMLFREMAEMKLLLAKKQSGGHHADNKENNDDDEGDNREGNVGNEGEGDFRGEDENRNEGEGG
QKDGNNEGEGYERKKNDDTGEDVDEDIVSNSFLVEVEKIEKAAISNIKKLKGTGPFAEGQTFVMREKRKIIPSKLMKSPFTSKFGSAEEKERRKKTKS