; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016444 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016444
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr12:37860773..37862572
RNA-Seq ExpressionLag0016444
SyntenyLag0016444
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.5e-5842.17Show/hide
Query:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLD-NGLLVDRLGRNLFIFRFNNEADRIRVIRQGP
        M  S+L+++W    LT++E++I+V  D  A++ TG  L   L+ KLL  R ++  V++   + AWKLD     VD +G N+F+F FN  +DR R++R GP
Subjt:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLD-NGLLVDRLGRNLFIFRFNNEADRIRVIRQGP

Query:  WTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLW
        WTF++ L+++  P+   +P D  F +V  W+H F+L L   N++MA RLGNA+G+FEDV+S  N   WG+ LR+ VR D+ +P+ RG+++  DGP+ G W
Subjt:  WTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSG
        +PI+YERLP+   HCG + H+ +DC          S + QYG WLRF G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]8.6e-6544.69Show/hide
Query:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW
        MD  +L+ DW +  LT++E+EI++  D +AV      L + L+GKLL  R ++A+V+ R    AWK+++ L V+ +G+NLF+F F  E D  RV++ GPW
Subjt:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW

Query:  TFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWV
         F+K L+VL  P      S+  F+ V FWIH+F+LP+ W N++MA RLGNA+G F DVD       WGASLRI V ID+ +P+RRG++I  DGP+ G W+
Subjt:  TFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWV

Query:  PIRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQETSRS
        PI+YERLP+ C  CG+IGH S DC  R L  + +  ++ +YG WLRF G     A    +G++P RE     S
Subjt:  PIRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQETSRS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.6e-5034.76Show/hide
Query:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLL-VDRLGRNLFIFRFNNEADRIRVIRQGPWTFEK
        L+++W    LT++EEE ++  D  A   TG  L   L+GKL   RP+   VM+   R AWKL+N    V  LG NLF+F F    DR ++ + GPWTF++
Subjt:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLL-VDRLGRNLFIFRFNNEADRIRVIRQGPWTFEK

Query:  FLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY
         L+++  P+  I PS+  F+ +  W+  F+LPL    R MA RLGNALG FE+ D  +    WG++LR+ V +D+++P+RRG+++  DGP+ G W+PI+Y
Subjt:  FLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY

Query:  ERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQ------ETSRSHRLAIAVSP---------SLEPISPV
        ERLP+ C HCG+     +                QYG WLR+ G         V+   P  +Q      + S ++  + + SP         S     P+
Subjt:  ERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQ------ETSRSHRLAIAVSP---------SLEPISPV

Query:  ELPYRRPSGIRINEPAEGGRLQSQNAKS
         +P   P    + E  + G   SQ  KS
Subjt:  ELPYRRPSGIRINEPAEGGRLQSQNAKS

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]2.7e-4237.96Show/hide
Query:  SLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLG---FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWT
        SL+D    LSLT++E+ +  +      + T L++G    CL+GKLL  RP   E M+    + W+   G+ V  +G NLF+F F +  D+ RV+  GPWT
Subjt:  SLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLG---FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWT

Query:  FEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVP
        F+K LL+L      ++PSD   + V FW+HV  LPL   N+ + E +GNA+G F D+D  +  + WG ++RI V +D+ +P+RRG+++        +WV 
Subjt:  FEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVP

Query:  IRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLR
         +YERLP  C  CG +GH  R+C  ++   +     S QYG WLR
Subjt:  IRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]7.1e-4339.18Show/hide
Query:  SLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLG---FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWT
        SLVD    LSLT++E+ +  +      D T L++G    CL+GKLL  RP   E M+    + W+   G+ V  +G NLF+F F +  D+ RV+  GPWT
Subjt:  SLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLG---FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWT

Query:  FEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVP
        F+K LL+L      ++PSD   + V FW+HV  LPL   N+ + + +GNA+G F D+D  +  + WG ++RI V ID+ +P+RRG+++        +WV 
Subjt:  FEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVP

Query:  IRYERLPELCSHCGIIGHLSRDCIRVLRM-EREFSSSPQYGDWLR
         +YERLP  C  CG +GH  R+C   L   +     S QYG WLR
Subjt:  IRYERLPELCSHCGIIGHLSRDCIRVLRM-EREFSSSPQYGDWLR

TrEMBL top hitse value%identityAlignment
A0A2N9GF83 CCHC-type domain-containing protein9.3e-4137.19Show/hide
Query:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWTFEKF
        LV+DW R SLT DE       D EA+         CLLGKLL  +      ++      W +  G++   +G NLF+F+F NE+D  RV +  PW F+  
Subjt:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWTFEKF

Query:  LLVLVFPIRGIRPSDH-PFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY
        LLVL     G  P++   F+S CFW+ +  +PL +  +   ER+G A+GI E VD   + + WG  LR+ + +D+ +PI+RG R+   G    +W+  +Y
Subjt:  LLVLVFPIRGIRPSDH-PFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY

Query:  ERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLR
        ERLP  C HCG +GH  R+C +++       S   QYG WLR
Subjt:  ERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054811.7e-5842.17Show/hide
Query:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLD-NGLLVDRLGRNLFIFRFNNEADRIRVIRQGP
        M  S+L+++W    LT++E++I+V  D  A++ TG  L   L+ KLL  R ++  V++   + AWKLD     VD +G N+F+F FN  +DR R++R GP
Subjt:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLD-NGLLVDRLGRNLFIFRFNNEADRIRVIRQGP

Query:  WTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLW
        WTF++ L+++  P+   +P D  F +V  W+H F+L L   N++MA RLGNA+G+FEDV+S  N   WG+ LR+ VR D+ +P+ RG+++  DGP+ G W
Subjt:  WTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSG
        +PI+YERLP+   HCG + H+ +DC          S + QYG WLRF G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSG

A0A6J1DU55 uncharacterized protein LOC1110231354.2e-6544.69Show/hide
Query:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW
        MD  +L+ DW +  LT++E+EI++  D +AV      L + L+GKLL  R ++A+V+ R    AWK+++ L V+ +G+NLF+F F  E D  RV++ GPW
Subjt:  MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW

Query:  TFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWV
         F+K L+VL  P      S+  F+ V FWIH+F+LP+ W N++MA RLGNA+G F DVD       WGASLRI V ID+ +P+RRG++I  DGP+ G W+
Subjt:  TFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWV

Query:  PIRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQETSRS
        PI+YERLP+ C  CG+IGH S DC  R L  + +  ++ +YG WLRF G     A    +G++P RE     S
Subjt:  PIRYERLPELCSHCGIIGHLSRDC-IRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQETSRS

A0A6J1DX30 uncharacterized protein LOC1110248747.6e-5134.76Show/hide
Query:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLL-VDRLGRNLFIFRFNNEADRIRVIRQGPWTFEK
        L+++W    LT++EEE ++  D  A   TG  L   L+GKL   RP+   VM+   R AWKL+N    V  LG NLF+F F    DR ++ + GPWTF++
Subjt:  LVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLL-VDRLGRNLFIFRFNNEADRIRVIRQGPWTFEK

Query:  FLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY
         L+++  P+  I PS+  F+ +  W+  F+LPL    R MA RLGNALG FE+ D  +    WG++LR+ V +D+++P+RRG+++  DGP+ G W+PI+Y
Subjt:  FLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRY

Query:  ERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQ------ETSRSHRLAIAVSP---------SLEPISPV
        ERLP+ C HCG+     +                QYG WLR+ G         V+   P  +Q      + S ++  + + SP         S     P+
Subjt:  ERLPELCSHCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQ------ETSRSHRLAIAVSP---------SLEPISPV

Query:  ELPYRRPSGIRINEPAEGGRLQSQNAKS
         +P   P    + E  + G   SQ  KS
Subjt:  ELPYRRPSGIRINEPAEGGRLQSQNAKS

A0A6P6S3G2 uncharacterized protein LOC1136872932.9e-4236.23Show/hide
Query:  LSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWTFEKFLLVLVFP
        L L  DEEE+ ++   +A D    L   C+LGKL   +    E +    +  W    GL    LG NLF+F+FN+  D+ +V   GPW F+  LLV+   
Subjt:  LSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWTFEKFLLVLVFP

Query:  IRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCS
        I  ++ ++    +  FW+ V+ LPL W N   AE LGN LG++E  + R +   WG  LRI V+I L  P++R + ++ +G +    V  +YERLP LC 
Subjt:  IRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCS

Query:  HCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLR------FSGKDMFTASVAVRGENPNR-EQETSRSHRLAIAVSP
        +CG IGH  RDC   L      +  PQYG WLR      FSG+     +V +  +N NR  Q T +  R   + SP
Subjt:  HCGIIGHLSRDCIRVLRMEREFSSSPQYGDWLR------FSGKDMFTASVAVRGENPNR-EQETSRSHRLAIAVSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding7.9e-1625.71Show/hide
Query:  DEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW-TFEKFLLVLVFPIRGI
        ++EE  +    E ++    L   C++ K+L  + +   V+ R  R  WK    + V  L R  F+ RF  E + +  +  GPW     +LLV  +  R  
Subjt:  DEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPW-TFEKFLLVLVFPIRGI

Query:  RPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCSHCGI
         P      +   W+ +  +P ++++R +   +   LG    VD        G   R+ + ++L +P++  V I  D         + YE L ++CS CGI
Subjt:  RPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCSHCGI

Query:  IGHLSRDCIR
         GHL   C R
Subjt:  IGHLSRDCIR

AT3G31430.1 unknown protein1.8e-1226.6Show/hide
Query:  FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRL--GRNLFIFRFNNEADRIRVIRQGPWTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPL
        F L G+ +  R      +  +    W   +GL+  R+  GR  F F F  E     V+R+GPW F  ++++L    +   P    F  + FW+ +  +P 
Subjt:  FCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRL--GRNLFIFRFNNEADRIRVIRQGPWTFEKFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPL

Query:  DWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCSHCGIIGHLSRDCI
         + NR + E +G ALG   D D     +      R+ +  D+  P+R          ++ L +  RYERL   C  CG++ H    C+
Subjt:  DWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCSHCGIIGHLSRDCI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTCATCCTTGGTTGATGATTGGTCTCGTTTGAGTTTAACTACGGATGAGGAAGAAATTTCTGTGGTCGCCGACCATGAAGCGGTGGATCGGACTGGTCTACT
TTTGGGTTTTTGTCTTCTGGGCAAATTGCTTTGCCACCGACCTATGGCTGCTGAAGTAATGAGAAGGAACTTTCGTGCGGCGTGGAAGCTTGACAACGGTCTGCTTGTTG
ATCGATTGGGGAGAAATTTATTCATTTTCCGTTTCAACAACGAGGCAGATCGAATTCGAGTCATTCGCCAAGGTCCTTGGACTTTTGAAAAATTTCTGTTGGTCCTGGTA
TTTCCAATTCGCGGAATTCGTCCGTCAGATCATCCTTTTTCATCGGTGTGTTTCTGGATTCATGTTTTCGAACTACCCCTGGATTGGTTCAATCGCAGTATGGCAGAACG
GTTAGGTAATGCTTTGGGAATTTTTGAGGATGTCGACAGCAGGAATAATTTTCTCTTTTGGGGAGCAAGTTTGCGGATCAACGTCCGGATTGATCTTAACAGGCCTATTC
GTCGTGGTGTCCGGATTTACCCGGATGGTCCTCTCAGTGGACTTTGGGTTCCGATAAGGTATGAGCGTTTGCCGGAGCTTTGTTCTCATTGTGGTATAATCGGCCATTTA
TCTCGTGACTGTATTCGTGTACTGAGGATGGAACGTGAATTTAGTTCTTCACCTCAGTACGGTGATTGGTTGAGGTTCTCTGGAAAAGATATGTTTACAGCATCGGTGGC
GGTGAGGGGTGAAAATCCAAATCGGGAGCAAGAAACATCTCGCAGCCATCGATTGGCAATTGCGGTTTCGCCGTCGTTGGAGCCAATTTCCCCGGTCGAATTACCTTACC
GTAGACCGTCTGGCATTCGAATCAATGAACCTGCTGAAGGAGGAAGGTTGCAGAGCCAAAATGCCAAATCCTCGAATCGCTATTCTCTGTCCGGTGAGGATTTGATAAAA
GCGAAAGGAAAGGCGAAGGTGGATGATTCGACTTCAAGGATCCATTGCGGTTTTTCTACAGCGGTGACGGATGGATGGCCGGAAAGTGCAAATTGGAGATCGGAATGTTC
CAGCCGGCGTGCGTTACAAGAGGCTGAGCCGTTGGGCTCTCTTTCATCAGACCCTCGAGACGTTACAGAGAAATTAAAGGAAGTTGGGGAGGCTTCGGGAAAAGATTCTG
GATTTTTAAAGCAGCCGTTTCACAACATGGCGGTTTTTGGGCCAGAAGAGTATAAAGAGAAGTATAAAGAGGAATTGAGGGCCAGTAATAGACGGTTGTCCAAAACTTTA
CTATCTATTTTTAATGCTGCCGAAAATCATGCAGTGGGTCACGAGAAGCCGCAAGCTGTGGAAACCAACAAAGAAGGAGACATGGAGAGTGATCCAGAAGAGATGGAGGA
GATGGGCTTTCAGTCTCATGAAGGAAGTAATTGGGCTCACCCAAAGGAGCAGACGGGCTTGCAAGTGGTTGCTGAAAATTACCAGCTAAATTCAATTCCAAATGAGAAGG
CTGAGAATGGAGGTGAAAATAGTGACGTAGCTAGTTTTGGTTTTTCTTCCAAGAAAATTCAATCAATGAAGAAAGGAGGAATGTGGAAAAAGCGAGCACGAGCGGGTTTT
GTTCCTTTTGGTGTTAATGTGGATGCCGGGGAAGAAGCTCAAAAGCGAAAGGATGGGCCGTTGTTGTTCTCTCCGGGGAATATTAAGCGTCCCAAAGTTGACGATGGTGG
ACAGGCGGGGACTGCTGAGCAGCCCCGCCAATCATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCTCATCCTTGGTTGATGATTGGTCTCGTTTGAGTTTAACTACGGATGAGGAAGAAATTTCTGTGGTCGCCGACCATGAAGCGGTGGATCGGACTGGTCTACT
TTTGGGTTTTTGTCTTCTGGGCAAATTGCTTTGCCACCGACCTATGGCTGCTGAAGTAATGAGAAGGAACTTTCGTGCGGCGTGGAAGCTTGACAACGGTCTGCTTGTTG
ATCGATTGGGGAGAAATTTATTCATTTTCCGTTTCAACAACGAGGCAGATCGAATTCGAGTCATTCGCCAAGGTCCTTGGACTTTTGAAAAATTTCTGTTGGTCCTGGTA
TTTCCAATTCGCGGAATTCGTCCGTCAGATCATCCTTTTTCATCGGTGTGTTTCTGGATTCATGTTTTCGAACTACCCCTGGATTGGTTCAATCGCAGTATGGCAGAACG
GTTAGGTAATGCTTTGGGAATTTTTGAGGATGTCGACAGCAGGAATAATTTTCTCTTTTGGGGAGCAAGTTTGCGGATCAACGTCCGGATTGATCTTAACAGGCCTATTC
GTCGTGGTGTCCGGATTTACCCGGATGGTCCTCTCAGTGGACTTTGGGTTCCGATAAGGTATGAGCGTTTGCCGGAGCTTTGTTCTCATTGTGGTATAATCGGCCATTTA
TCTCGTGACTGTATTCGTGTACTGAGGATGGAACGTGAATTTAGTTCTTCACCTCAGTACGGTGATTGGTTGAGGTTCTCTGGAAAAGATATGTTTACAGCATCGGTGGC
GGTGAGGGGTGAAAATCCAAATCGGGAGCAAGAAACATCTCGCAGCCATCGATTGGCAATTGCGGTTTCGCCGTCGTTGGAGCCAATTTCCCCGGTCGAATTACCTTACC
GTAGACCGTCTGGCATTCGAATCAATGAACCTGCTGAAGGAGGAAGGTTGCAGAGCCAAAATGCCAAATCCTCGAATCGCTATTCTCTGTCCGGTGAGGATTTGATAAAA
GCGAAAGGAAAGGCGAAGGTGGATGATTCGACTTCAAGGATCCATTGCGGTTTTTCTACAGCGGTGACGGATGGATGGCCGGAAAGTGCAAATTGGAGATCGGAATGTTC
CAGCCGGCGTGCGTTACAAGAGGCTGAGCCGTTGGGCTCTCTTTCATCAGACCCTCGAGACGTTACAGAGAAATTAAAGGAAGTTGGGGAGGCTTCGGGAAAAGATTCTG
GATTTTTAAAGCAGCCGTTTCACAACATGGCGGTTTTTGGGCCAGAAGAGTATAAAGAGAAGTATAAAGAGGAATTGAGGGCCAGTAATAGACGGTTGTCCAAAACTTTA
CTATCTATTTTTAATGCTGCCGAAAATCATGCAGTGGGTCACGAGAAGCCGCAAGCTGTGGAAACCAACAAAGAAGGAGACATGGAGAGTGATCCAGAAGAGATGGAGGA
GATGGGCTTTCAGTCTCATGAAGGAAGTAATTGGGCTCACCCAAAGGAGCAGACGGGCTTGCAAGTGGTTGCTGAAAATTACCAGCTAAATTCAATTCCAAATGAGAAGG
CTGAGAATGGAGGTGAAAATAGTGACGTAGCTAGTTTTGGTTTTTCTTCCAAGAAAATTCAATCAATGAAGAAAGGAGGAATGTGGAAAAAGCGAGCACGAGCGGGTTTT
GTTCCTTTTGGTGTTAATGTGGATGCCGGGGAAGAAGCTCAAAAGCGAAAGGATGGGCCGTTGTTGTTCTCTCCGGGGAATATTAAGCGTCCCAAAGTTGACGATGGTGG
ACAGGCGGGGACTGCTGAGCAGCCCCGCCAATCATTATGA
Protein sequenceShow/hide protein sequence
MDPSSLVDDWSRLSLTTDEEEISVVADHEAVDRTGLLLGFCLLGKLLCHRPMAAEVMRRNFRAAWKLDNGLLVDRLGRNLFIFRFNNEADRIRVIRQGPWTFEKFLLVLV
FPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRSMAERLGNALGIFEDVDSRNNFLFWGASLRINVRIDLNRPIRRGVRIYPDGPLSGLWVPIRYERLPELCSHCGIIGHL
SRDCIRVLRMEREFSSSPQYGDWLRFSGKDMFTASVAVRGENPNREQETSRSHRLAIAVSPSLEPISPVELPYRRPSGIRINEPAEGGRLQSQNAKSSNRYSLSGEDLIK
AKGKAKVDDSTSRIHCGFSTAVTDGWPESANWRSECSSRRALQEAEPLGSLSSDPRDVTEKLKEVGEASGKDSGFLKQPFHNMAVFGPEEYKEKYKEELRASNRRLSKTL
LSIFNAAENHAVGHEKPQAVETNKEGDMESDPEEMEEMGFQSHEGSNWAHPKEQTGLQVVAENYQLNSIPNEKAENGGENSDVASFGFSSKKIQSMKKGGMWKKRARAGF
VPFGVNVDAGEEAQKRKDGPLLFSPGNIKRPKVDDGGQAGTAEQPRQSL