; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022687 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022687
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:35548021..35549041
RNA-Seq ExpressionLag0022687
SyntenyLag0022687
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]4.5e-4137.85Show/hide
Query:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLV
        + +L+SEE++ +V  D  A+E +   L   L+ KLL +R +   V++   + AWK++      D +G N+F+F F    DR R++R GPW F+++L+++ 
Subjt:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLV

Query:  LPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLEL
         P+   KP D  F  VS  +H FDL L   N+ MA R GNA+G++EDV+S      WG+ LR+RVR D+  P+ RG+++   GP+ G W+ ++YE L + 
Subjt:  LPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLEL

Query:  CSFCGIIGHVLRDC
           CG + H+L+DC
Subjt:  CSFCGIIGHVLRDC

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]6.9e-5045.02Show/hide
Query:  RLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPI
        +L+SEE+E ++  D DA++ ++  L + L+GKLL +R + ++V+ R    AWK+E+ L  + +GKNLF+F F  E D  RV++ GPW F+K+L+VL  P 
Subjt:  RLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPI

Query:  HKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSF
             ++  F+ V+F IH+FDLP+ W N+ MA R GNA+G + DVD       WGASLRIRV ID+  P+RRG++I   GP+ G W+ ++YE L + C F
Subjt:  HKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSF

Query:  CGIIGHVLRDC
        CG+IGH   DC
Subjt:  CGIIGHVLRDC

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.0e-3736.15Show/hide
Query:  DSLSTVASLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE
        D L    + +L+SEEEE ++  D  A   +   L   L+GKL  +RP+   VM+   R+AWK+E N  +   LG NLF+F F   +DR ++ + GPW F+
Subjt:  DSLSTVASLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE

Query:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR
        ++L+++  P+  + P++  F+ +   +  FDLPL    + MA R GNA+G +E+ D       WG++LR+RV +D+  P+RRG+++   GP+ G W+ ++
Subjt:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR

Query:  YECLLELCSFCGI
        YE L + C  CG+
Subjt:  YECLLELCSFCGI

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]9.6e-3636.4Show/hide
Query:  LSTVASLRLSSEEEEKSVSADRDAIERSDVLLG---FCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE
        L    +L L+SEE+    +  R   E + +++G    CL+GKLL RRP   E M+    S W+   G+Q   +G NLF+F F + +D+ RV+  GPW F+
Subjt:  LSTVASLRLSSEEEEKSVSADRDAIERSDVLLG---FCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE

Query:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR
        K LL+L      ++P+D   + V F +H+ +LPL   N+ + E  GNA+G + D+D   G + WG ++RIRV +D+  P+RRG+++        +WV  +
Subjt:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR

Query:  YECLLELCSFCGIIGHVLRDC-VQFNSTEQGLVPPPQYG
        YE L   C FCG +GH  R+C  + +S +   V   QYG
Subjt:  YECLLELCSFCGIIGHVLRDC-VQFNSTEQGLVPPPQYG

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]6.2e-3536.84Show/hide
Query:  SVSADRDAIER-----SDVLLG---FCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIH
        S++++ DA+ R     + +++G    CL+GKLL RRP   E M+    S W+   G+Q   +G NLF+F F + +D+ RV+  GPW F+K LL+L     
Subjt:  SVSADRDAIER-----SDVLLG---FCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIH

Query:  KLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFC
         ++P+D   + V F +H+ +LPL   N+ + +  GNA+G + D+D   G + WG ++RIRV ID+  P+RRG+++        +WV  +YE L   C FC
Subjt:  KLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFC

Query:  GIIGHVLRDCVQFNSTEQGL-VPPPQYG
        G +GH  R+C    S   G  V   QYG
Subjt:  GIIGHVLRDCVQFNSTEQGL-VPPPQYG

TrEMBL top hitse value%identityAlignment
A0A2N9E6P9 CCHC-type domain-containing protein3.7e-3337.61Show/hide
Query:  SEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKL
        +E+E    + + DA+  S V    CLLGKL+  +P     ++      W +  G +A  +G NLFIF+F +E +R RV+   PWLF   LL L       
Subjt:  SEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKL

Query:  KPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGI
          A   FS   F + +  +PL++  +   +R GNAMG    VD   G + WG  LRIR+ +D   PI RG R+  +     LWVS +YE L  LC FCG+
Subjt:  KPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGI

Query:  IGHVLRDCV-QFNSTEQGLVPPPQYG
        +GH  RDCV    S  QG     QYG
Subjt:  IGHVLRDCV-QFNSTEQGLVPPPQYG

A0A6J1BSZ1 uncharacterized protein LOC1110054812.2e-4137.85Show/hide
Query:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLV
        + +L+SEE++ +V  D  A+E +   L   L+ KLL +R +   V++   + AWK++      D +G N+F+F F    DR R++R GPW F+++L+++ 
Subjt:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLV

Query:  LPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLEL
         P+   KP D  F  VS  +H FDL L   N+ MA R GNA+G++EDV+S      WG+ LR+RVR D+  P+ RG+++   GP+ G W+ ++YE L + 
Subjt:  LPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLEL

Query:  CSFCGIIGHVLRDC
           CG + H+L+DC
Subjt:  CSFCGIIGHVLRDC

A0A6J1DU55 uncharacterized protein LOC1110231353.3e-5045.02Show/hide
Query:  RLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPI
        +L+SEE+E ++  D DA++ ++  L + L+GKLL +R + ++V+ R    AWK+E+ L  + +GKNLF+F F  E D  RV++ GPW F+K+L+VL  P 
Subjt:  RLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPI

Query:  HKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSF
             ++  F+ V+F IH+FDLP+ W N+ MA R GNA+G + DVD       WGASLRIRV ID+  P+RRG++I   GP+ G W+ ++YE L + C F
Subjt:  HKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSF

Query:  CGIIGHVLRDC
        CG+IGH   DC
Subjt:  CGIIGHVLRDC

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-3736.15Show/hide
Query:  DSLSTVASLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE
        D L    + +L+SEEEE ++  D  A   +   L   L+GKL  +RP+   VM+   R+AWK+E N  +   LG NLF+F F   +DR ++ + GPW F+
Subjt:  DSLSTVASLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIE-NGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFE

Query:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR
        ++L+++  P+  + P++  F+ +   +  FDLPL    + MA R GNA+G +E+ D       WG++LR+RV +D+  P+RRG+++   GP+ G W+ ++
Subjt:  KSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVR

Query:  YECLLELCSFCGI
        YE L + C  CG+
Subjt:  YECLLELCSFCGI

A0A6P6S3G2 uncharacterized protein LOC1136872933.7e-3336.09Show/hide
Query:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVL
        +LRL  +EEE  +    DA +  D L   C+LGKL  R+    E +    +  W    GL    LG NLF+F+F + +D+ +V   GPW F+ +LLV+  
Subjt:  SLRLSSEEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVL

Query:  PIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELC
         I  ++  +      SF + +++LPL W N   AE  GN +GVYE  + R     WG  LRIRV+I L+ P++R + ++  G +    V  +YE L  LC
Subjt:  PIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELC

Query:  SFCGIIGHVLRDCVQFNSTEQGLVPPPQYG
         +CG IGH  RDC             PQYG
Subjt:  SFCGIIGHVLRDCVQFNSTEQGLVPPPQYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding3.6e-1222.22Show/hide
Query:  EEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLK
        E+EE  ++   + +E  + L   C++ K+L  + +   V+ R  R  WK    +    L +  F+ RF  E + +  +  GPW    + L++     +  
Subjt:  EEEEKSVSADRDAIERSDVLLGFCLLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLK

Query:  PADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGII
        P           + + ++P +++++ +       +G    VD  T     G   R+ + ++L  P++  + I      +G    V YE L ++CS CGI 
Subjt:  PADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGII

Query:  GHVLRDC
        GH++  C
Subjt:  GHVLRDC

AT2G13450.1 unknown protein3.8e-0623.35Show/hide
Query:  AWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLV-----LVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDV
        AW + N +    L      F F +E+D + V+R+ PWL+    +      + L  H L       + +   + +  +PL +  +  A    + +G    +
Subjt:  AWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLV-----LVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDV

Query:  DSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDCV
        D         A +R+R+R  +   +R  LRI      + L +S +YE L  +CS C  + H    C+
Subjt:  DSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDCV

AT3G31430.1 unknown protein5.5e-1327.63Show/hide
Query:  FIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLD
        F F F  E     V+R+GPW F   +++L     + +P  P F F+ F + I  +P  + N+ + E  G A+G   D D     +      R+ +  D+ 
Subjt:  FIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLD

Query:  HPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDCVQFNSTEQ
        HP+ R  R + +       +  RYE L   C  CG++ H    C+  N  E+
Subjt:  HPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDCVQFNSTEQ

AT5G36228.1 nucleic acid binding;zinc ion binding1.5e-1021.74Show/hide
Query:  LLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFN
        LLG++L  +    E         W +   +    L    F  RF +E+D +  +R+ PW+F +    + L   +  P +   +F+   +HI  +PL + +
Subjt:  LLGKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFN

Query:  QAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDC------VQFNSTEQGLV
        +   E   + +G    +D           +R++VR+D   P+R   R+  +       +   YE L  +C+ C  + H +  C       + ++    LV
Subjt:  QAMAERTGNAMGVYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDC------VQFNSTEQGLV

Query:  PPPQYGD
         P +Y D
Subjt:  PPPQYGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGATTTCAGGCGAAGATTAAGGGGCCTCTGTCTGAAGGGGCTTTAATCAGCGTGATCAGGGCCGATTGTGTCAGGCTGGTTTATGCTATTTCGGTAATGGAGCG
GATGTTTTTAATCAATCCAGAGCCGTTTTCAAATCTGGAGGATCCCTCCAGCGGATTTTTAGCTTTGAGGCTACTATTTCAGACAGTGCGATGGAGGCTGAGAGATTCAC
TGTCAACAGTGGCTAGCTTGAGGCTCTCTTCGGAGGAGGAAGAGAAATCGGTGTCGGCTGATCGTGATGCAATCGAAAGATCAGACGTGCTGCTTGGCTTTTGTTTGTTG
GGAAAGTTGTTATGTCGTAGACCTCTGGGTTCTGAGGTGATGAGACGCAATTTTCGGTCAGCTTGGAAGATTGAAAATGGCCTTCAAGCAGACCGTTTGGGGAAAAATTT
GTTCATTTTTCGGTTCGTGAATGAGATGGATCGTGTTCGTGTAGTTCGACAAGGACCATGGCTTTTTGAGAAATCCCTTCTAGTGTTGGTGCTTCCAATCCACAAATTAA
AGCCTGCTGATCCTCCCTTTTCCTTCGTGTCTTTTTTGATTCATATTTTCGATCTACCTCTTGACTGGTTTAATCAGGCTATGGCGGAAAGAACCGGCAATGCCATGGGT
GTGTACGAAGATGTTGATAGCCGGACTGGATTTCTTTTCTGGGGTGCAAGCTTGAGAATCCGAGTACGGATTGATTTGGATCATCCTATTCGGAGAGGACTTCGAATTTA
TCCATATGGGCCCCTTAGCGGACTATGGGTTTCGGTCAGGTATGAGTGTTTGCTGGAGCTATGCTCCTTTTGTGGTATTATTGGTCATGTATTGCGTGATTGTGTTCAAT
TTAACAGTACAGAGCAAGGTCTGGTTCCTCCCCCTCAATATGGTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGATTTCAGGCGAAGATTAAGGGGCCTCTGTCTGAAGGGGCTTTAATCAGCGTGATCAGGGCCGATTGTGTCAGGCTGGTTTATGCTATTTCGGTAATGGAGCG
GATGTTTTTAATCAATCCAGAGCCGTTTTCAAATCTGGAGGATCCCTCCAGCGGATTTTTAGCTTTGAGGCTACTATTTCAGACAGTGCGATGGAGGCTGAGAGATTCAC
TGTCAACAGTGGCTAGCTTGAGGCTCTCTTCGGAGGAGGAAGAGAAATCGGTGTCGGCTGATCGTGATGCAATCGAAAGATCAGACGTGCTGCTTGGCTTTTGTTTGTTG
GGAAAGTTGTTATGTCGTAGACCTCTGGGTTCTGAGGTGATGAGACGCAATTTTCGGTCAGCTTGGAAGATTGAAAATGGCCTTCAAGCAGACCGTTTGGGGAAAAATTT
GTTCATTTTTCGGTTCGTGAATGAGATGGATCGTGTTCGTGTAGTTCGACAAGGACCATGGCTTTTTGAGAAATCCCTTCTAGTGTTGGTGCTTCCAATCCACAAATTAA
AGCCTGCTGATCCTCCCTTTTCCTTCGTGTCTTTTTTGATTCATATTTTCGATCTACCTCTTGACTGGTTTAATCAGGCTATGGCGGAAAGAACCGGCAATGCCATGGGT
GTGTACGAAGATGTTGATAGCCGGACTGGATTTCTTTTCTGGGGTGCAAGCTTGAGAATCCGAGTACGGATTGATTTGGATCATCCTATTCGGAGAGGACTTCGAATTTA
TCCATATGGGCCCCTTAGCGGACTATGGGTTTCGGTCAGGTATGAGTGTTTGCTGGAGCTATGCTCCTTTTGTGGTATTATTGGTCATGTATTGCGTGATTGTGTTCAAT
TTAACAGTACAGAGCAAGGTCTGGTTCCTCCCCCTCAATATGGTGATTAG
Protein sequenceShow/hide protein sequence
MVGFQAKIKGPLSEGALISVIRADCVRLVYAISVMERMFLINPEPFSNLEDPSSGFLALRLLFQTVRWRLRDSLSTVASLRLSSEEEEKSVSADRDAIERSDVLLGFCLL
GKLLCRRPLGSEVMRRNFRSAWKIENGLQADRLGKNLFIFRFVNEMDRVRVVRQGPWLFEKSLLVLVLPIHKLKPADPPFSFVSFLIHIFDLPLDWFNQAMAERTGNAMG
VYEDVDSRTGFLFWGASLRIRVRIDLDHPIRRGLRIYPYGPLSGLWVSVRYECLLELCSFCGIIGHVLRDCVQFNSTEQGLVPPPQYGD