; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002697 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002697
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:44916273..44917504
RNA-Seq ExpressionLag0002697
SyntenyLag0002697
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69259.1 hypothetical protein EZV62_004194 [Acer yangbiense]2.3e-4236.44Show/hide
Query:  KKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLE
        +  S+K+ +RE H      V EE V  V+HCL+GK+L+ + +   A  + +   W    K +IE  G+N+F F FQ+ EDR  V+  GPW FDK+L+VLE
Subjt:  KKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLE

Query:  KPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDL
         P+    I+ +RFNK  FW+++ ++P+   N+  A+ +  ++G+ +E+  +     WG  +R++VR++I  PL R   L  +       + ++YE++P+ 
Subjt:  KPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDL

Query:  CFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK
        CF CGRVGH   EC D+E K+E    NN  FG W++
Subjt:  CFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]6.0e-4334.69Show/hide
Query:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        +L++EWK F L   E +     +++  E     +   L+ KLL+ R I+ + +KN +  AW+   K F +++ G N+F F F    DR+ +   GPWTFD
Subjt:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        + L++++ P    +  DM F   + W+   +L +   NK MA ++GN +G F +V+ + +   WG  +R++VR ++  PL RG  L  +G  G CWI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQG
        YE+LPD  ++CGR+ HI K+C D     +    N+++G WL+FQG
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.3e-4935.56Show/hide
Query:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        E+L+ +W+KF L   E E     +    +     + + L+GKLLA R I+   +   +  AW+   +  +E  GKN+F F F  + D + V   GPW FD
Subjt:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        K L+VL+KP  ++ IS++ FN+  FW+ L +LP+ + NK MA ++GN +G FV+VD ++ G  WG ++RI+V ++I  PL RG  +  +G  G CWI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKEC-LDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVD
        YE+LPD C+ CG +GH + +C       ++ +    E+G WL+F GS    +K    R  ++    D C  G+S +N ++  V+
Subjt:  YEKLPDLCFNCGRVGHIAKEC-LDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]9.5e-4132.57Show/hide
Query:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        DL++EWK F L   E E     +A+       ++   L+GKL   R IT   +KN M  AW+     F+++  G N+F F F    DR+ ++ +GPWTFD
Subjt:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        + LV++ KP      S++ F K   W+R  +LP+G   + MA ++GN LG F E D D     WG N+R++V L+I  PL RG  L  +G  G  WI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGS--GWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES
        YE+LPD C++CG      K                ++G WL++QG+    M +   P+ +      N+   + TS +    + V   A   G +    ES
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGS--GWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES

Query:  SIND
         + +
Subjt:  SIND

XP_024033132.1 uncharacterized protein LOC112095437 [Citrus clementina]1.2e-4037.6Show/hide
Query:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        E+LI + +  +L E E    F F   + E+    V  CL+GK+L  R +    +K A++ AWRT   F +E  G N+F FKF S+ D+  VFN GPW FD
Subjt:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        + L+VL++PK    I    F+  +FW+R+ N+P+   +  +  ++G+R+GK  ++  D  G  +G+ +RIQV +NI  PL +  +LK E  E +  + + 
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLK
        YE+LPD CF CG +GH  +EC++ +G+++    N+ FG WLK
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLK

TrEMBL top hitse value%identityAlignment
A0A5C7IHI0 CCHC-type domain-containing protein1.5e-3934.71Show/hide
Query:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDK
        DL    +  S+K+ + E     E  +  + V+ V+HCL+GK+L+ + +   A K  +   W      +IEV G N F F F + EDRD ++  GPW FD+
Subjt:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDK

Query:  NLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRY
        +L+VLEKP+    IS + FNK  FW+++ ++P+   NK MA+ +  ++G+ VE+  +     WG  +R++V ++I  PL R   LK +  +    + ++Y
Subjt:  NLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRY

Query:  EKLPDLCFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK
        E+LP+ C+ CG+VGH   +C D E K+E       +FG WL+
Subjt:  EKLPDLCFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK

A0A5C7IJL3 CCHC-type domain-containing protein1.1e-4236.44Show/hide
Query:  KKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLE
        +  S+K+ +RE H      V EE V  V+HCL+GK+L+ + +   A  + +   W    K +IE  G+N+F F FQ+ EDR  V+  GPW FDK+L+VLE
Subjt:  KKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLE

Query:  KPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDL
         P+    I+ +RFNK  FW+++ ++P+   N+  A+ +  ++G+ +E+  +     WG  +R++VR++I  PL R   L  +       + ++YE++P+ 
Subjt:  KPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDL

Query:  CFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK
        CF CGRVGH   EC D+E K+E    NN  FG W++
Subjt:  CFNCGRVGHIAKECLDLEGKEEP-NWNNMEFGGWLK

A0A6J1BSZ1 uncharacterized protein LOC1110054812.9e-4334.69Show/hide
Query:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        +L++EWK F L   E +     +++  E     +   L+ KLL+ R I+ + +KN +  AW+   K F +++ G N+F F F    DR+ +   GPWTFD
Subjt:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        + L++++ P    +  DM F   + W+   +L +   NK MA ++GN +G F +V+ + +   WG  +R++VR ++  PL RG  L  +G  G CWI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQG
        YE+LPD  ++CGR+ HI K+C D     +    N+++G WL+FQG
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQG

A0A6J1DU55 uncharacterized protein LOC1110231352.1e-4935.56Show/hide
Query:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        E+L+ +W+KF L   E E     +    +     + + L+GKLLA R I+   +   +  AW+   +  +E  GKN+F F F  + D + V   GPW FD
Subjt:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        K L+VL+KP  ++ IS++ FN+  FW+ L +LP+ + NK MA ++GN +G FV+VD ++ G  WG ++RI+V ++I  PL RG  +  +G  G CWI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKEC-LDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVD
        YE+LPD C+ CG +GH + +C       ++ +    E+G WL+F GS    +K    R  ++    D C  G+S +N ++  V+
Subjt:  YEKLPDLCFNCGRVGHIAKEC-LDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVD

A0A6J1DX30 uncharacterized protein LOC1110248744.6e-4132.57Show/hide
Query:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        DL++EWK F L   E E     +A+       ++   L+GKL   R IT   +KN M  AW+     F+++  G N+F F F    DR+ ++ +GPWTFD
Subjt:  DLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQK-FDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        + LV++ KP      S++ F K   W+R  +LP+G   + MA ++GN LG F E D D     WG N+R++V L+I  PL RG  L  +G  G  WI I+
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGS--GWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES
        YE+LPD C++CG      K                ++G WL++QG+    M +   P+ +      N+   + TS +    + V   A   G +    ES
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGS--GWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES

Query:  SIND
         + +
Subjt:  SIND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding9.9e-1226.22Show/hide
Query:  RTRQKFDIEVAGK----NMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLEK-PKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDY
        R  +  D EV G+    +   F FQS+E    +   GPW+F+  + V+++  KL+   SD  F +  FW+++  +P+ F    +   IG R+G F+E + 
Subjt:  RTRQKFDIEVAGK----NMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLEK-PKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDY

Query:  DKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDLCFNCGRVGHIAKEC
         +D         + V                        +  +YEKL + C  CG + H A EC
Subjt:  DKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDLCFNCGRVGHIAKEC

AT5G32613.1 Zinc knuckle (CCHC-type) family protein5.3e-0523.71Show/hide
Query:  WLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGE-CWITIRYEKLPDLCFNCGRVGHIAKEC
        W  L N+P    +      I + +G+ +  +  + G       +++V  N+  PLP   +++   ++G    + + Y + P  C NCGR GH+   C
Subjt:  WLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGE-CWITIRYEKLPDLCFNCGRVGHIAKEC

AT5G36228.1 nucleic acid binding;zinc ion binding2.1e-0920.53Show/hide
Query:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD
        ++L +  +   L   E E +  + A V     ++++  LLG++L  +  +V      +   W    +    +     F  +F+S+ D        PW F+
Subjt:  EDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFD

Query:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR
        +  + L++ +       + F     W+ +  +P+ + ++   E I + LG+ V +D++++       +R++VR++   PL R F            I   
Subjt:  KNLVVLEKPKLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIR

Query:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKR--NEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES
        YEKL  +C NC RV H    C  +  +EE + N  +                 SP+R  +E +  + D  +   S +      +   +     +V+ N++
Subjt:  YEKLPDLCFNCGRVGHIAKECLDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKR--NEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNES

Query:  SINDMEDGFPEINLLS---------CAERSASKDS---PCGSSTNGISGGMMLTDKKRRTWKRRVRGGQQGDPVN
         I ++   FP  ++ S          A     KD      G S+    G  +L   +R   +RR+  G +  PVN
Subjt:  SINDMEDGFPEINLLS---------CAERSASKDS---PCGSSTNGISGGMMLTDKKRRTWKRRVRGGQQGDPVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATCTGATCGATGAGTGGAAAAAATTCAGTTTGAAGGAAATTGAAAGAGAGGCGCATTTTTCGTTTGAAGCTACGGTGGCCGAAGAAGTTGTTGATCAAGTGAA
CCATTGTCTGCTAGGCAAACTCTTGGCAAACAGATTCATAACAGTGTCCGCTATCAAGAATGCTATGAATGGCGCATGGAGAACGAGGCAAAAATTCGATATTGAAGTGG
CGGGAAAGAATATGTTTGCCTTCAAATTCCAGAGTCAAGAAGACAGGGACTGGGTGTTCAATAATGGTCCATGGACGTTTGACAAGAATCTTGTGGTTCTTGAAAAGCCA
AAACTGAACCAGCGTATTTCTGATATGAGATTCAATAAAACGACATTCTGGCTGCGATTACTTAACCTTCCAGTGGGCTTTCAGAACAAACACATGGCTGAACAGATTGG
AAACCGTCTAGGGAAATTTGTAGAGGTGGATTATGACAAAGATGGGTTGCATTGGGGAGACAATATGAGAATTCAAGTTCGCCTAAATATCTTTAACCCTTTACCTCGCG
GCTTCATGCTAAAGGCAGAGGGGATCGAAGGTGAATGCTGGATCACAATCAGGTATGAAAAGCTACCTGATCTTTGCTTTAACTGCGGACGAGTTGGCCACATAGCGAAA
GAGTGCTTGGATCTGGAGGGGAAAGAAGAACCAAATTGGAATAATATGGAATTTGGAGGATGGCTTAAATTCCAAGGTTCGGGTTGGATGGGAAGGAAAGATTCACCCAA
GAGAAATGAGCAGGCGACGGGGGAAAATGATGATTGTCAGGCAGGAACGTCAAAGATTAATATGGAAAAAGAAATGGTTGATGTCAGGGCTGAGGGTAAAGGGAAGTTGG
TGGATCTAAATGAGTCAAGTATCAATGATATGGAAGATGGTTTCCCTGAGATCAACTTACTTTCCTGCGCCGAAAGATCGGCTAGCAAAGATAGTCCTTGTGGTTCATCG
ACAAATGGTATCTCTGGGGGTATGATGTTGACTGACAAGAAGCGCAGAACCTGGAAGAGAAGAGTTCGAGGGGGACAGCAGGGTGATCCTGTGAATGAAGTTGAGGCAAG
TGATGTGGGTTTGAAAAAAGAAAGAACAAGGATGACTCATCCAACAATGAAAAGCGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATCTGATCGATGAGTGGAAAAAATTCAGTTTGAAGGAAATTGAAAGAGAGGCGCATTTTTCGTTTGAAGCTACGGTGGCCGAAGAAGTTGTTGATCAAGTGAA
CCATTGTCTGCTAGGCAAACTCTTGGCAAACAGATTCATAACAGTGTCCGCTATCAAGAATGCTATGAATGGCGCATGGAGAACGAGGCAAAAATTCGATATTGAAGTGG
CGGGAAAGAATATGTTTGCCTTCAAATTCCAGAGTCAAGAAGACAGGGACTGGGTGTTCAATAATGGTCCATGGACGTTTGACAAGAATCTTGTGGTTCTTGAAAAGCCA
AAACTGAACCAGCGTATTTCTGATATGAGATTCAATAAAACGACATTCTGGCTGCGATTACTTAACCTTCCAGTGGGCTTTCAGAACAAACACATGGCTGAACAGATTGG
AAACCGTCTAGGGAAATTTGTAGAGGTGGATTATGACAAAGATGGGTTGCATTGGGGAGACAATATGAGAATTCAAGTTCGCCTAAATATCTTTAACCCTTTACCTCGCG
GCTTCATGCTAAAGGCAGAGGGGATCGAAGGTGAATGCTGGATCACAATCAGGTATGAAAAGCTACCTGATCTTTGCTTTAACTGCGGACGAGTTGGCCACATAGCGAAA
GAGTGCTTGGATCTGGAGGGGAAAGAAGAACCAAATTGGAATAATATGGAATTTGGAGGATGGCTTAAATTCCAAGGTTCGGGTTGGATGGGAAGGAAAGATTCACCCAA
GAGAAATGAGCAGGCGACGGGGGAAAATGATGATTGTCAGGCAGGAACGTCAAAGATTAATATGGAAAAAGAAATGGTTGATGTCAGGGCTGAGGGTAAAGGGAAGTTGG
TGGATCTAAATGAGTCAAGTATCAATGATATGGAAGATGGTTTCCCTGAGATCAACTTACTTTCCTGCGCCGAAAGATCGGCTAGCAAAGATAGTCCTTGTGGTTCATCG
ACAAATGGTATCTCTGGGGGTATGATGTTGACTGACAAGAAGCGCAGAACCTGGAAGAGAAGAGTTCGAGGGGGACAGCAGGGTGATCCTGTGAATGAAGTTGAGGCAAG
TGATGTGGGTTTGAAAAAAGAAAGAACAAGGATGACTCATCCAACAATGAAAAGCGTTTGA
Protein sequenceShow/hide protein sequence
MEDLIDEWKKFSLKEIEREAHFSFEATVAEEVVDQVNHCLLGKLLANRFITVSAIKNAMNGAWRTRQKFDIEVAGKNMFAFKFQSQEDRDWVFNNGPWTFDKNLVVLEKP
KLNQRISDMRFNKTTFWLRLLNLPVGFQNKHMAEQIGNRLGKFVEVDYDKDGLHWGDNMRIQVRLNIFNPLPRGFMLKAEGIEGECWITIRYEKLPDLCFNCGRVGHIAK
ECLDLEGKEEPNWNNMEFGGWLKFQGSGWMGRKDSPKRNEQATGENDDCQAGTSKINMEKEMVDVRAEGKGKLVDLNESSINDMEDGFPEINLLSCAERSASKDSPCGSS
TNGISGGMMLTDKKRRTWKRRVRGGQQGDPVNEVEASDVGLKKERTRMTHPTMKSV