; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031908 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031908
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionH15 domain-containing protein
Genome locationchr11:18429087..18431674
RNA-Seq ExpressionLag0031908
SyntenyLag0031908
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR017956 - AT hook, DNA-binding motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAK05521.1 predicted protein, partial [Hordeum vulgare subsp. vulgare]7.8e-0831.09Show/hide
Query:  HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHGHPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYT
        +SH   +     +  GR R    L S+      GA +L  KG GRPRK   L SS     + +G ++   +G GRPRK  S       G PR    G  T
Subjt:  HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHGHPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYT

Query:  LLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLS
         +++G GRPRK +S         +G  T +  G GRPRK  S          G  T L  G GRPRK P   S+       + T             SL+
Subjt:  LLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLS

Query:  SSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY
         S    G P      L +G G P    +  ++     RG    L  G  RP   R  P   S+       A T + K LGRPRK  S  +       G  
Subjt:  SSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRK
        T + K  GRPRK
Subjt:  TLLGKGLGRPRK

KAF6998140.1 hypothetical protein CFC21_014287 [Triticum aestivum]2.3e-0730.59Show/hide
Query:  PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAY
        P  AY L+  G   GRPRK         G PR        ++  KG GRPRK Q L SS     +G P G ++  ++G GRPRK            +G  
Subjt:  PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSSSCHGHPRGAYTLLG------
        T + +G GRPRK            +G  T + KG GRPRK             G  T +  G GRPRK       ++ SS          +L G      
Subjt:  TLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSSSCHGHPRGAYTLLG------

Query:  ---------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG
                 +G G P    +  ++     RG    L  G  RP  + P  ++S  G      A T + K LGRPRK  S  +       GA T + K  G
Subjt:  ---------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG

Query:  RPRK
        RPRK
Subjt:  RPRK

PWV06054.1 hypothetical protein C3747_120g5 [Trypanosoma cruzi]7.0e-0925.55Show/hide
Query:  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTL
        SA  SS + C + +S    ++      + +   +  L S      +  L     A+  +  +  +   A++    SL       S ++S  S    A   
Subjt:  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTL

Query:  LGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRK
           SL       S ++S  S    A++    SL        PR+ PS S++ HS PR   +L      + R+  S S++ HS               PR+
Subjt:  LGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRK

Query:  FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY
         PS S++ H  PR   +L   G  + R+  SLS++ H  PR   +    G   PR+  S S++ H  PR   +L   G   PR+ PS S++ H  PR   
Subjt:  FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRKFPSLS
        +        PR+ PSLS
Subjt:  TLLGKGLGRPRKFPSLS

XP_020199045.1 collagen alpha-1(I) chain [Aegilops tauschii subsp. strangulata]2.3e-0730.59Show/hide
Query:  PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAY
        P  AY L+  G   GRPRK         G PR        ++  KG GRPRK Q L SS     +G P G ++  ++G GRPRK            +G  
Subjt:  PRGAYTLL--GKGLGRPRKFPSLSSSCHGHPR-----GAYTLLGKGLGRPRKFQSLSSSC----HGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSSSCHGHPRGAYTLLG------
        T + +G GRPRK            +G  T + KG GRPRK             G  T +  G GRPRK       ++ SS          +L G      
Subjt:  TLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKF------QSLSSSCHGHPRGAYTLLG------

Query:  ---------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG
                 +G G P    +  ++     RG    L  G  RP  + P  ++S  G      A T + K LGRPRK  S  +       GA T + K  G
Subjt:  ---------KGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPR-KFPSLSSSYHGH--PRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLG

Query:  RPRK
        RPRK
Subjt:  RPRK

XP_033736132.1 uncharacterized protein LOC117324396 [Pecten maximus]5.9e-1625.07Show/hide
Query:  RKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRG
        R   + + +CH+ +RG YT     L   R   + + +CH+  RG YT               + +CH+  RG+YT     L   R   + + +CH+  RG
Subjt:  RKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRG

Query:  AYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFP
         YT               + +CH   RG YT     L   R   + + +CH   RG YT     L   R     + +C    RG YT         R   
Subjt:  AYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFP

Query:  SLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTL
          + +C    RG YT     L   R   + + +C     G YT     L   R   + + +C     G YT     L   R   + + +C    RG YT 
Subjt:  SLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTL

Query:  LGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYT
            L   R   + + + H   RG YT     L   R   + + +C    RG YT
Subjt:  LGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYT

TrEMBL top hitse value%identityAlignment
A0A2V2WH94 Uncharacterized protein3.4e-0925.55Show/hide
Query:  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTL
        SA  SS + C + +S    ++      + +   +  L S      +  L     A+  +  +  +   A++    SL       S ++S  S    A   
Subjt:  SARLSSSASCPSVWSKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTL

Query:  LGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRK
           SL       S ++S  S    A++    SL        PR+ PS S++ HS PR   +L      + R+  S S++ HS               PR+
Subjt:  LGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGR------PRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRK

Query:  FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY
         PS S++ H  PR   +L   G  + R+  SLS++ H  PR   +    G   PR+  S S++ H  PR   +L   G   PR+ PS S++ H  PR   
Subjt:  FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRKFPSLS
        +        PR+ PSLS
Subjt:  TLLGKGLGRPRKFPSLS

A0A3P8P8U8 Uncharacterized protein6.6e-1329.67Show/hide
Query:  SLGRPRKFP--SLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSS
        S  RP   P  S  + C      +  +L +    P   +         PR   +L+ + LG      SL     G PR    L+ + LG PR   SL   
Subjt:  SLGRPRKFP--SLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSS

Query:  CHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGL
          G PR    L+ + LG PR    L     G PR    L+ + LG PR    L     G PR    L+ + LG PR    L     G PR    L+ + L
Subjt:  CHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGL

Query:  GRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPR
        G PR    L     G PR    L+ + LG PR    L     G PR
Subjt:  GRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPR

A0A673XVP5 Uncharacterized protein3.4e-0932.34Show/hide
Query:  MSLSSSCHSHLR-GAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRK----FPSLSSSCHSHPRGVYTLLGKSLGRPRK---FSSLSSSCH
        M LS      LR G    L K LGRP +            +G    + K LGRP +     P          +G+   L K LGRP +    S L     
Subjt:  MSLSSSCHSHLR-GAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRK----FPSLSSSCHSHPRGVYTLLGKSLGRPRK---FSSLSSSCH

Query:  SHPR-GAYTLLGKGLGRPRK----FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTL
        S  R G  +LL KGLGRP++    +P           G      KGLGRP + +       G P  +G   LLRKGLGR  + + L  +     +G   L
Subjt:  SHPR-GAYTLLGKGLGRPRK----FPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTL

Query:  LGKGLGRPRKFPSLSSSCHGHP--RGAYTLLGKGLGRPRKFPSLSSSCH---GHP----------RGAYTLLGKGLGRPRKFQSLSSSCH---GHP--RG
        L KGLG P +         G P  +G    L KGLGRP++   L        G P          +G   LL KGLGR  + + L S      G P  +G
Subjt:  LGKGLGRPRKFPSLSSSCHGHP--RGAYTLLGKGLGRPRKFPSLSSSCH---GHP----------RGAYTLLGKGLGRPRKFQSLSSSCH---GHP--RG

Query:  AYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLGKGLGRPRKFPSLSSSYHGHP--RGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRP
          +LL KGLG P +         G P  +G   LL KGLGR  +         G P  +G   LL K LGRP +   L S      +G  +LL  GLGRP
Subjt:  AYTLLGKGLGRPRKFQSLSSSCHGHP--RGAYTLLGKGLGRPRKFPSLSSSYHGHP--RGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRP

Query:  RK
        ++
Subjt:  RK

A0A6Q2ZHZ9 Uncharacterized protein4.6e-1431.62Show/hide
Query:  KSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSC
        K +   +K +SLS S    L  + +   +  GRP K  SLS S       + T   +  GRP K  SLS S         T   +  GRP K  SLS S 
Subjt:  KSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRGAYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSC

Query:  HSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLG
              + T   +G GRP K  SLS S       + T   +G GRP K +SLS S      G+ + L++G GR  K +S+S S       + T   KG G
Subjt:  HSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLG

Query:  RPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSC-------HGHPRGAYTLLGKGLGRPRKF----
        RP K  S+S S       + T L +G GRP K  S+S S    P G  +   KG GRP+   S   +         G P+ +     K  GRP+K     
Subjt:  RPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSC-------HGHPRGAYTLLGKGLGRPRKF----

Query:  -QSLSSSCHGHPRG---AYTLLGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFP
         + L  +C  +P G      L  +  GRP K   LS+   G P      L    GRP+K  +         R +   + K LGRPR  P
Subjt:  -QSLSSSCHGHPRG---AYTLLGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFP

F2EDU9 Predicted protein (Fragment)3.8e-0831.09Show/hide
Query:  HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHGHPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYT
        +SH   +     +  GR R    L S+      GA +L  KG GRPRK   L SS     + +G ++   +G GRPRK  S       G PR    G  T
Subjt:  HSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSS--CHGHPRGAYTLLGKGLGRPRKFQSLS-SSCHGHPR----GAYT

Query:  LLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLS
         +++G GRPRK +S         +G  T +  G GRPRK  S          G  T L  G GRPRK P   S+       + T             SL+
Subjt:  LLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLS

Query:  SSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY
         S    G P      L +G G P    +  ++     RG    L  G  RP   R  P   S+       A T + K LGRPRK  S  +       G  
Subjt:  SSC--HGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRP---RKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAY

Query:  TLLGKGLGRPRK
        T + K  GRPRK
Subjt:  TLLGKGLGRPRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAGTCAGAAAACCGAGGAGAAGCTACAGATAGTAAAGGGAAGAGTCAAGAATCGAGTTTCGGAACCCTTCTTCCATGGAATTCGGTGGTGTTTCGGGGCGAACC
AGGCGAAACCGGGGCGATTCGGGGCATCAGGGACCGAAAAGAGGTCACCGGGCTCGGCCCGCGCAAACGGGCCGAATGGTCGGCCTCGGCCTTTTGCCGAGGCCGACCAT
ACGGGTCGGGTCATTTTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGGTCGGTCTTTTGGTCCTACCTCTGCCCGATTGTCCTCGTCAGCTTCTTGTCCATCTGTGTGG
TCCAAAATCACCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGACGTCCAGGCTTATGTGCTAAAAAGTAAAGTGACTGTGCTAAAGTCACGCTGCCTTGAATG
TGTGCCTGATGCTTGGCCTTGGAGTTGGGATGCTTGGCCTTGGCATGACGATGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTT
CAAGCTGTCATAGTCATCTCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATAGTCATCCTAGGGGTGTGTACACTCTTCTGGGGAAAAGCTT
GGGGAGGCCGAGGAAATTCTCGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCC
TCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCC
AGGGGTGCGTACACTCTTCTGAGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAA
AGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTAGGGAAAGGCTTGGGGAGGCCTAGGAAATTCC
CGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGT
CATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCT
GGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTATCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGA
AATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGC
TGTATTCATCCCGAGGGTGCGTACACTCAGTCGGGGGAAGTGCAGAAGTCTGGCCTCTACAAGGATCATTCTCGTCTGGCCACACGCGTTACATCCACGAAGAAACGCCA
CGGACATGTGTTATACAACACGAAAGGAAAATCAAAACTGTCGCCACAAGACCGTGGACTCGACCTGTCCTTACCCCTACCCCCCACTCAAAATGTGCATTCAATATCTT
GGACATGCATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAGTCAGAAAACCGAGGAGAAGCTACAGATAGTAAAGGGAAGAGTCAAGAATCGAGTTTCGGAACCCTTCTTCCATGGAATTCGGTGGTGTTTCGGGGCGAACC
AGGCGAAACCGGGGCGATTCGGGGCATCAGGGACCGAAAAGAGGTCACCGGGCTCGGCCCGCGCAAACGGGCCGAATGGTCGGCCTCGGCCTTTTGCCGAGGCCGACCAT
ACGGGTCGGGTCATTTTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGGTCGGTCTTTTGGTCCTACCTCTGCCCGATTGTCCTCGTCAGCTTCTTGTCCATCTGTGTGG
TCCAAAATCACCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGACGTCCAGGCTTATGTGCTAAAAAGTAAAGTGACTGTGCTAAAGTCACGCTGCCTTGAATG
TGTGCCTGATGCTTGGCCTTGGAGTTGGGATGCTTGGCCTTGGCATGACGATGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTT
CAAGCTGTCATAGTCATCTCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAATTCATGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCGAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATAGTCATCCTAGGGGTGTGTACACTCTTCTGGGGAAAAGCTT
GGGGAGGCCGAGGAAATTCTCGAGCCTCTCTTCAAGCTGTCATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCC
TCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCC
AGGGGTGCGTACACTCTTCTGAGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAA
AGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTAGGGAAAGGCTTGGGGAGGCCTAGGAAATTCC
CGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGT
CATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCAGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCT
GGGGAAAGGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTATCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGA
AATTCCCGAGCCTCTCTTCAAGCTGTCATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAGGCTTGGGGAGGCCTAGGAAATTCCCGAGCCTCTCTTCAAGCTGC
TGTATTCATCCCGAGGGTGCGTACACTCAGTCGGGGGAAGTGCAGAAGTCTGGCCTCTACAAGGATCATTCTCGTCTGGCCACACGCGTTACATCCACGAAGAAACGCCA
CGGACATGTGTTATACAACACGAAAGGAAAATCAAAACTGTCGCCACAAGACCGTGGACTCGACCTGTCCTTACCCCTACCCCCCACTCAAAATGTGCATTCAATATCTT
GGACATGCATTTTTTGA
Protein sequenceShow/hide protein sequence
MPKSENRGEATDSKGKSQESSFGTLLPWNSVVFRGEPGETGAIRGIRDRKEVTGLGPRKRAEWSASAFCRGRPYGSGHFGPTLWSGLPLGRSFGPTSARLSSSASCPSVW
SKITYNIKPPLLNWDSDVQAYVLKSKVTVLKSRCLECVPDAWPWSWDAWPWHDDAYTLLGKSLGRPRKFMSLSSSCHSHLRGAYTLLGKSLGRPRKFMSLSSSCHSHPRG
AYTLLGKSLGRPRKFPSLSSSCHSHPRGVYTLLGKSLGRPRKFSSLSSSCHSHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHGHP
RGAYTLLRKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFQSLSSSCHG
HPRGAYTLLGKGLGRPRKFQSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSYHGHPRGAYTLLGKGLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFPSLSSSC
CIHPEGAYTQSGEVQKSGLYKDHSRLATRVTSTKKRHGHVLYNTKGKSKLSPQDRGLDLSLPLPPTQNVHSISWTCIF