; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008161 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008161
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:13578311..13580023
RNA-Seq ExpressionLag0008161
SyntenyLag0008161
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]7.0e-4036.62Show/hide
Query:  LVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRN
        L+ K+ + R +N D  R V+   W     L +  +G+ +++   +S   ++R+ ++ PW F++ L++L     F   EEI  +   +W+Q H++P+G   
Subjt:  LVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRN

Query:  EKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPE----NKKSEKPKKQFG
        E V + +G   GEV+ IDT       G+FLR+R  L  N+PL RG+ LT    GK+LV  +YE+LPDFCY+CG L H +  C +     + S K KK++G
Subjt:  EKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPE----NKKSEKPKKQFG

Query:  SWLRAETYHMPRS
         WLRAE   +PRS
Subjt:  SWLRAETYHMPRS

TXG54301.1 hypothetical protein EZV62_019557 [Acer yangbiense]1.2e-3633.57Show/hide
Query:  NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDT
        +EE+G  G   ++ + LV K+ + R +N +AFR V+   W+   ++ I+ V +N+FL      + R+++ +  PW FD+CL++L  P    +  ++ F+ 
Subjt:  NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDT

Query:  TTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGK-VLVGIKYERLPDFCYICGMLGHIDGSCP
          +W+Q H++PI   N + A+ L  +IGEV+ I T+  D C G+FLRV+V++  ++PL R L L    +G  V+VG+KYERLP+FCY CG LGH    CP
Subjt:  TTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGK-VLVGIKYERLPDFCYICGMLGHIDGSCP

Query:  ENKKSEKPKK----QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRPMGRQEWGSDES
        + +  ++  +    +FG W+RA       +   P TG       G ++ + R G+ R    FR++     E G D +
Subjt:  ENKKSEKPKK----QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRPMGRQEWGSDES

VVA32948.1 PREDICTED: DUF4283 domain-containing [Prunus dulcis]2.1e-3632.52Show/hide
Query:  VEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLI
        +EK   L   ++++  +   ++   G  L M   LV KV T    N +AF++ M K W+  R++ + ++G+N+FL    +   R R++   PW FD+ L+
Subjt:  VEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLI

Query:  LLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLP
        LL TP     P +++     +W+Q HNVP+   +  + R +G   G  L +      +C GRFLR+RV +  ++PL RG  +T      V V  +YERLP
Subjt:  LLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLP

Query:  DFCYICGMLGHIDGSCP--ENKKSEKPKKQFGSWLRA-ETYHMPRS
        +FC+ CG LGH+   C   +    +  +K +GSWL+A + +H  R+
Subjt:  DFCYICGMLGHIDGSCP--ENKKSEKPKKQFGSWLRA-ETYHMPRS

XP_006485824.1 uncharacterized protein LOC102613298 [Citrus sinensis]7.2e-3732.5Show/hide
Query:  DIVEKLNSLKLSQKESCGIS-NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDR
        +++ K  ++ L  +E   IS    + ++G  L + + LV K+   R +  +  R  M +AW   ++  ++++GDNIFL    S   + RI+   PW FDR
Subjt:  DIVEKLNSLKLSQKESCGIS-NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDR

Query:  CLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYE
         L+++  P      ++  F    +WVQ HN+PI   ++++ + +GGRIG+V  ++T++  EC G F+RVR+ +    PL + L L     G++ + I YE
Subjt:  CLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYE

Query:  RLPDFCYICGMLGHIDGSCPENKKSEKPKKQFGSWLRAET
        +LPDFC+ CG++GH    C + K   K    +G+W+RA+T
Subjt:  RLPDFCYICGMLGHIDGSCPENKKSEKPKKQFGSWLRAET

XP_024035600.1 uncharacterized protein LOC112096408 [Citrus clementina]5.5e-3732.5Show/hide
Query:  DIVEKLNSLKLSQKESCGIS-NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDR
        +++ K  ++ L  +E   IS    + ++G  L + + LV K+   R +  +  R  M +AW   ++  ++++GDNIFL    S   + RI+   PW FDR
Subjt:  DIVEKLNSLKLSQKESCGIS-NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDR

Query:  CLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYE
         L+++  P      ++  F    +WVQ HN+PI   ++++ + +GGRIG+V  ++T++  EC G F+RVR+ +    PL + L L     G++ + I YE
Subjt:  CLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYE

Query:  RLPDFCYICGMLGHIDGSCPENKKSEKPKKQFGSWLRAET
        +LPDFC+ CG++GH    C + K   K    +G+W+RA+T
Subjt:  RLPDFCYICGMLGHIDGSCPENKKSEKPKKQFGSWLRAET

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase5.0e-3627.79Show/hide
Query:  DIVEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRC
        D+ E  N   L+++E+  ++ +   +        N L+ K+ + R +N +  R VM   W     L +  +G+N+F+   +S   ++R+ +++PW F++ 
Subjt:  DIVEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRC

Query:  LILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYER
        L++L +   +D  E+I  D  ++W Q H++P+G  NE + R++G   G V  IDT       G+FLR R +L   +PL RG+ LT    GK+L+  +YE+
Subjt:  LILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYER

Query:  LPDFCYICGMLGHIDGSCPENKKSEKPK----KQFGSWLRAETYHMPRSPFTG--------GRSYD--------------SGRRGRGRFRYRPMGRQEWG
        LPDFCY+CG L H++  C +     + K    K++G WLRAE        F G        GR  +               GR+G+   + +     E  
Subjt:  LPDFCYICGMLGHIDGSCPENKKSEKPK----KQFGSWLRAETYHMPRSPFTG--------GRSYD--------------SGRRGRGRFRYRPMGRQEWG

Query:  SDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRSPESQRRK
        SD  +D +  ++T +   +   ++ + +  Q   EK     SP   ++K
Subjt:  SDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRSPESQRRK

A0A1R3K847 Uncharacterized protein3.4e-4036.62Show/hide
Query:  LVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRN
        L+ K+ + R +N D  R V+   W     L +  +G+ +++   +S   ++R+ ++ PW F++ L++L     F   EEI  +   +W+Q H++P+G   
Subjt:  LVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRN

Query:  EKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPE----NKKSEKPKKQFG
        E V + +G   GEV+ IDT       G+FLR+R  L  N+PL RG+ LT    GK+LV  +YE+LPDFCY+CG L H +  C +     + S K KK++G
Subjt:  EKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPE----NKKSEKPKKQFG

Query:  SWLRAETYHMPRS
         WLRAE   +PRS
Subjt:  SWLRAETYHMPRS

A0A5C7H9Y2 CCHC-type domain-containing protein5.0e-3633.87Show/hide
Query:  DIVEKLNSLKLSQK----ESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWC
        DI  K   L L       E    S +E G Q  SL+    L+ K  TN++IN +AF+  +   W    ++T++ +G NIF    Q+   R RI+E  PW 
Subjt:  DIVEKLNSLKLSQK----ESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWC

Query:  FDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKV-LVG
        FD+ L++L   S  ++  ++ F    +W+Q HN+P+   N ++   LGG +G+V  ID  E  EC G+F+R+RV +    PL RGL +  G   KV  V 
Subjt:  FDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKV-LVG

Query:  IKYERLPDFCYICGMLGHIDGSCPENKK--SEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRPMGRQEWG-SDESEDEENEASTPERTG
        I YERLP+FCY CG +GH+   CP N K  +     +FG W+RA +               +  +G G  +  P G +E G SD  E+   + ST    G
Subjt:  IKYERLPDFCYICGMLGHIDGSCPENKK--SEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRPMGRQEWG-SDESEDEENEASTPERTG

Query:  RSDAQNMLAEEDQ
        + D+  +L + +Q
Subjt:  RSDAQNMLAEEDQ

A0A5C7HBM7 CCHC-type domain-containing protein6.0e-3733.57Show/hide
Query:  NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDT
        +EE+G  G   ++ + LV K+ + R +N +AFR V+   W+   ++ I+ V +N+FL      + R+++ +  PW FD+CL++L  P    +  ++ F+ 
Subjt:  NEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDT

Query:  TTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGK-VLVGIKYERLPDFCYICGMLGHIDGSCP
          +W+Q H++PI   N + A+ L  +IGEV+ I T+  D C G+FLRV+V++  ++PL R L L    +G  V+VG+KYERLP+FCY CG LGH    CP
Subjt:  TTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGK-VLVGIKYERLPDFCYICGMLGHIDGSCP

Query:  ENKKSEKPKK----QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRPMGRQEWGSDES
        + +  ++  +    +FG W+RA       +   P TG       G ++ + R G+ R    FR++     E G D +
Subjt:  ENKKSEKPKK----QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRPMGRQEWGSDES

A0A5E4G034 PREDICTED: DUF4283 domain-containing1.0e-3632.52Show/hide
Query:  VEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLI
        +EK   L   ++++  +   ++   G  L M   LV KV T    N +AF++ M K W+  R++ + ++G+N+FL    +   R R++   PW FD+ L+
Subjt:  VEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLI

Query:  LLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLP
        LL TP     P +++     +W+Q HNVP+   +  + R +G   G  L +      +C GRFLR+RV +  ++PL RG  +T      V V  +YERLP
Subjt:  LLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLP

Query:  DFCYICGMLGHIDGSCP--ENKKSEKPKKQFGSWLRA-ETYHMPRS
        +FC+ CG LGH+   C   +    +  +K +GSWL+A + +H  R+
Subjt:  DFCYICGMLGHIDGSCP--ENKKSEKPKKQFGSWLRA-ETYHMPRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.3e-1224.43Show/hide
Query:  INTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPW-CFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGG
        I      R + + W     +T+ ++    F++  +  +     +   PW      L++    S FD   + +  TT  WV+  N+P    +  +   +  
Subjt:  INTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPW-CFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGG

Query:  RIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPEN
         +G  L++D N ++   GRF RV +++   +PL +G  L  G        + YE L   C  CG+ GH+  SCP N
Subjt:  RIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPEN

AT3G31430.1 unknown protein4.5e-1329.75Show/hide
Query:  DRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTL
        + ++   PW F+  +ILL       +P+  +F    +WVQ   +P    N  V   +G  +G+VL  D N        F RV +      PL   R    
Subjt:  DRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTL

Query:  TGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC-PENKKSEKPKKQFGSWLRAETYH
        T G     L+  +YERL  FC +CGML H  G+C  +N   E+           +TYH
Subjt:  TGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC-PENKKSEKPKKQFGSWLRAETYH

AT5G36228.1 nucleic acid binding;zinc ion binding7.7e-1327.69Show/hide
Query:  IEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPT
        +  +PW F+   I L     F  P E        WV    +P+   +E+   I+   +GEV+ +D NE       F+RV+V++ F EPL     +     
Subjt:  IEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPT

Query:  GKVLVGIKYERLPDFCYICGMLGHIDGSCP
         + ++G +YE+L   C  C  + H    CP
Subjt:  GKVLVGIKYERLPDFCYICGMLGHIDGSCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTGTTGAGAAACTAAACTCACTCAAGCTCTCCCAAAAGGAGTCTTGTGGGATTAGTAATGAAGAACTAGGCATTCAAGGGAATAGTTTGAATATGGAAAATCG
TTTGGTTTGTAAAGTTGCTACGAACCGGATTATCAACACAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGATTTAACGATCGATAACGTTGGTG
ATAATATCTTCCTTGTTAATCCGCAATCCGCTCAAGGGCGTGACAGAATCATCGAGGAAAGTCCTTGGTGTTTTGATAGATGCTTAATTCTCCTTGCAACACCATCAACA
TTTGACCAACCAGAAGAGATTGTGTTCGATACGACAACATATTGGGTGCAATTCCACAATGTTCCTATTGGTCTGAGAAACGAAAAGGTGGCCCGAATCTTAGGTGGTCG
GATAGGCGAGGTCCTAAGAATTGATACCAACGAAGTTGACGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTCAAATTACAATTCAATGAACCACTTTGTCGTGGATTAA
CTTTAACCGGCGGCCCAACAGGTAAAGTTCTAGTTGGAATAAAATATGAACGCTTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACATCGATGGATCCTGTCCG
GAGAACAAAAAATCGGAGAAACCGAAAAAACAGTTTGGGTCTTGGCTTCGAGCAGAAACATACCATATGCCAAGAAGCCCATTCACCGGCGGCCGTTCTTACGATTCAGG
ACGAAGAGGACGTGGACGATTCAGGTATCGTCCAATGGGGAGGCAAGAATGGGGATCTGATGAATCTGAGGATGAAGAAAATGAAGCATCCACACCGGAAAGAACAGGTC
GTTCCGATGCCCAGAACATGCTCGCTGAAGAAGACCAAAACCGGCCGGAAAAAGATGAAAACCAGAGGTCACCGGAATCCCAGCGACGGAAATGGAATGAAACCGACCAT
CAACCACAATCAATGTCGCCTGCGAAATCTATGGAAACTGAATCACCCAAAATCGGTACAAAATCATTTAATGCAGGTTTATTTCCAAAAACGACCGAACGTGATGCTGA
AGGGAAGAAATCGTTCTTCAAGGAAGCCAAATCCCACGCTCCATTAATGGTTGGAGATTTATTCCCAAATATAGAGGAGAGAGAAGAGAACCCACACAAGAGGAATCTAA
CGGATCAAATTGAAGGGAGGATGCATCAACAAGAAATTGGGCCTAGCAACCCTTTGGGCCAAATGGAAGATGAAGCCCACAAACAGAACACTTATTACATTGGGCCCTCC
AACGATTTTTACAACATGGATGTGGAAATTAGCAACCCCACGCCTAATACATTACAAAATAAATTTGAGGCAGTGGATGGGCAATTTTTAAATATCCGTCCAATGGATCC
CAACCAAAGCAAGACTGGAGAATTACCCATTGCTGAACAAGCAGTTGATCCAAACGTGGGGAGGAAAATTAAAAAGTTGTGGAAGAGAGTGAACAGAACGGGAGGTCAAA
GTGTTACTAAAAATTCCACCTCACAATCTCTCACGAGCTCTATTGGAAAGAAAAGAGATGCAGATGAACCATATTTAGAAGACAATGAAGGAAAGAAAATCAAATGTACC
AAACCAGAGATTGACACGATTATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTGTTGAGAAACTAAACTCACTCAAGCTCTCCCAAAAGGAGTCTTGTGGGATTAGTAATGAAGAACTAGGCATTCAAGGGAATAGTTTGAATATGGAAAATCG
TTTGGTTTGTAAAGTTGCTACGAACCGGATTATCAACACAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGATTTAACGATCGATAACGTTGGTG
ATAATATCTTCCTTGTTAATCCGCAATCCGCTCAAGGGCGTGACAGAATCATCGAGGAAAGTCCTTGGTGTTTTGATAGATGCTTAATTCTCCTTGCAACACCATCAACA
TTTGACCAACCAGAAGAGATTGTGTTCGATACGACAACATATTGGGTGCAATTCCACAATGTTCCTATTGGTCTGAGAAACGAAAAGGTGGCCCGAATCTTAGGTGGTCG
GATAGGCGAGGTCCTAAGAATTGATACCAACGAAGTTGACGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTCAAATTACAATTCAATGAACCACTTTGTCGTGGATTAA
CTTTAACCGGCGGCCCAACAGGTAAAGTTCTAGTTGGAATAAAATATGAACGCTTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACATCGATGGATCCTGTCCG
GAGAACAAAAAATCGGAGAAACCGAAAAAACAGTTTGGGTCTTGGCTTCGAGCAGAAACATACCATATGCCAAGAAGCCCATTCACCGGCGGCCGTTCTTACGATTCAGG
ACGAAGAGGACGTGGACGATTCAGGTATCGTCCAATGGGGAGGCAAGAATGGGGATCTGATGAATCTGAGGATGAAGAAAATGAAGCATCCACACCGGAAAGAACAGGTC
GTTCCGATGCCCAGAACATGCTCGCTGAAGAAGACCAAAACCGGCCGGAAAAAGATGAAAACCAGAGGTCACCGGAATCCCAGCGACGGAAATGGAATGAAACCGACCAT
CAACCACAATCAATGTCGCCTGCGAAATCTATGGAAACTGAATCACCCAAAATCGGTACAAAATCATTTAATGCAGGTTTATTTCCAAAAACGACCGAACGTGATGCTGA
AGGGAAGAAATCGTTCTTCAAGGAAGCCAAATCCCACGCTCCATTAATGGTTGGAGATTTATTCCCAAATATAGAGGAGAGAGAAGAGAACCCACACAAGAGGAATCTAA
CGGATCAAATTGAAGGGAGGATGCATCAACAAGAAATTGGGCCTAGCAACCCTTTGGGCCAAATGGAAGATGAAGCCCACAAACAGAACACTTATTACATTGGGCCCTCC
AACGATTTTTACAACATGGATGTGGAAATTAGCAACCCCACGCCTAATACATTACAAAATAAATTTGAGGCAGTGGATGGGCAATTTTTAAATATCCGTCCAATGGATCC
CAACCAAAGCAAGACTGGAGAATTACCCATTGCTGAACAAGCAGTTGATCCAAACGTGGGGAGGAAAATTAAAAAGTTGTGGAAGAGAGTGAACAGAACGGGAGGTCAAA
GTGTTACTAAAAATTCCACCTCACAATCTCTCACGAGCTCTATTGGAAAGAAAAGAGATGCAGATGAACCATATTTAGAAGACAATGAAGGAAAGAAAATCAAATGTACC
AAACCAGAGATTGACACGATTATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MDIVEKLNSLKLSQKESCGISNEELGIQGNSLNMENRLVCKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPST
FDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP
ENKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRSPESQRRKWNETDH
QPQSMSPAKSMETESPKIGTKSFNAGLFPKTTERDAEGKKSFFKEAKSHAPLMVGDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPS
NDFYNMDVEISNPTPNTLQNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQSLTSSIGKKRDADEPYLEDNEGKKIKCT
KPEIDTIISAEPVKQARREP