; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008566 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008566
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:25516933..25518510
RNA-Seq ExpressionLag0008566
SyntenyLag0008566
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]8.7e-3729.81Show/hide
Query:  EETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWI
        E   IS++ EKL++  ++ G I R+K    ++  Q +   L+ K +T K IN E FKS I  IW  + +VT++  G N+F+  F + + ++RI+E  PW+
Subjt:  EETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWI

Query:  YDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWID
        +D  LL   E  G+E+ + +QFR+VPFW+   +LP  C NR+    LG  VG  + +D  +    VG+ +R+RV IDV  PL+RG  + LG + +   + 
Subjt:  YDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWID

Query:  MRYEKLPEFCYGCGRIGHLAREC--TDLEAMNKEELDYGPWLRKEGTARGK--------PRQKRGGGG-----------------GKHQR---EDKDQKQ
        + YE+LP FCY CG+IGHL R+C     E  +     +GPW+R     R K        P   R GG                  GK       D +Q  
Subjt:  MRYEKLPEFCYGCGRIGHLAREC--TDLEAMNKEELDYGPWLRKEGTARGK--------PRQKRGGGG-----------------GKHQR---EDKDQKQ

Query:  RWIEKKNGNKAEEK---------ENQE------AHRPEKPPE-------KME----VQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEK
           E K+GN  E K           QE      +H  EK  E       +ME    V +PV   N +NQ  +  S+        D   + T  +  ++  
Subjt:  RWIEKKNGNKAEEK---------ENQE------AHRPEKPPE-------KME----VQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEK

Query:  RKVCERGHQTNKHVG-QEGDNETQIKGPRCEGQLADLGQTSQSGQLKGKLYPDMEKNLSTSTQ
        R+ C   ++ N+ +G ++GD + +    R +  +  + +  Q G L      D+E ++ + T+
Subjt:  RKVCERGHQTNKHVG-QEGDNETQIKGPRCEGQLADLGQTSQSGQLKGKLYPDMEKNLSTSTQ

TXG57113.1 hypothetical protein EZV62_018426 [Acer yangbiense]2.1e-3835.71Show/hide
Query:  ETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIY
        E  I+K  E L I  E+R  I+ M +E     V+D++  LV K+L+ K +N E FK +I +IW   G+V ++  G+N F   F +   + R+    PW +
Subjt:  ETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIY

Query:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM
         N+L+  E+  G    S++ F    FWV   D+P +C NR+ A+ L   +G    +  +  E   G+ MRV+V+ID+ KPL+R   + LG   E   + +
Subjt:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM

Query:  RYEKLPEFCYGCGRIGHLARECTDLE----AMNKEELDYGPWLRKEGTARGKPRQKRGGGGGKHQR
        +YE+LPEFCY CG+IGH  +EC D E    A+      +G WL+   T R KPR    G G    R
Subjt:  RYEKLPEFCYGCGRIGHLARECTDLE----AMNKEELDYGPWLRKEGTARGKPRQKRGGGGGKHQR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.9e-3633.88Show/hide
Query:  TSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKV-TIKKAGENMFECNFDSKYRKRRIMEASPWIY
        +++ ++ +   +T+EE    V +    L+   + +E  L+CK+L+++SI+  + K+ +   W ++ K  ++   G N+F  NF+    + RI+   PW +
Subjt:  TSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKV-TIKKAGENMFECNFDSKYRKRRIMEASPWIY

Query:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM
        D AL+  +      +   + FR+V  WVHF DL   C N+  A  LGNA+G+FE V+ + + +  G  +RVRV+ DV KPL RG  + L       WI +
Subjt:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM

Query:  RYEKLPEFCYGCGRIGHLARECTD--LEAMNKEELDYGPWLRKEG
        +YE+LP+F Y CGR+ H+ ++C+D  +++++K  L YGPWLR +G
Subjt:  RYEKLPEFCYGCGRIGHLARECTD--LEAMNKEELDYGPWLRKEG

XP_028071384.1 uncharacterized protein LOC114273772 [Camellia sinensis]4.3e-3635.8Show/hide
Query:  LNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNALLAFEEI
        L++T+EE   +VR+  +     +   +  LV K+LT +  N E  KS +  +W     + ++  G+N+F   F     KRRI+   PW +D  LL   E+
Subjt:  LNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNALLAFEEI

Query:  KGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRYEKLPEFCY
            + S IQ   V FWVH  +LP I  N+K  + +GNAVG F  +D++      G +M +RV +DV+KPLRRG  + L ++ E  W+D +YE+LP +CY
Subjt:  KGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRYEKLPEFCY

Query:  GCGRIGHLAREC----TDLEAMNKEELDYGPWLRKEGTARGKPRQKRG---GGGGKH
         CGR+GH  REC    +  +    + L YG WLR +      PR+      G GG H
Subjt:  GCGRIGHLAREC----TDLEAMNKEELDYGPWLRKEGTARGKPRQKRG---GGGGKH

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.7e-3736.43Show/hide
Query:  LNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNALLAFEEI
        L++T+EE   +VR+  E     +   +  LV K+LT +  N E  K+ +  +W     + ++  G+N+F   F     KRR++   PW +D  LL   E+
Subjt:  LNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNALLAFEEI

Query:  KGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRYEKLPEFCY
            + S IQ   V FWVH  +LP +  N+K  E +GNAVG F  +D++      G +MR+RV +DV+KPLRRG  + L ++AE  W+D +YE+LP +CY
Subjt:  KGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRYEKLPEFCY

Query:  GCGRIGHLARECTD----LEAMNKEELDYGPWLRKEGTARGKPRQKRG----GGGGKH
         CGR+GH  REC D     +    + L YG WLR +   + K  ++ G    G GG H
Subjt:  GCGRIGHLARECTD----LEAMNKEELDYGPWLRKEGTARGKPRQKRG----GGGGKH

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)2.3e-3536.68Show/hide
Query:  PEETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPW
        PEE  + K+   + ++ EE G  V  K+       + +   LV K+L  + ++ E  K  + R+W    +V I+K GEN+F   F S+  KR IM   PW
Subjt:  PEETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPW

Query:  IYDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEE-EW
         +D AL+   E  G     +  F HV FWV   D+P +C ++  A +LG  +G  E V+ D      G+ +R+R+ +D+ KPL++  +I L    E+ + 
Subjt:  IYDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEE-EW

Query:  IDMR--YEKLPEFCYGCGRIGHLARECTDLEAMNKEELDYGPWLRKEGTARGKPRQKRG
        I MR  YE+LP+FC+ CGRIGH  REC   ++ +K+EL YGPWL K  T   K +Q RG
Subjt:  IDMR--YEKLPEFCYGCGRIGHLARECTDLEAMNKEELDYGPWLRKEGTARGKPRQKRG

A0A5C7H9Y2 CCHC-type domain-containing protein4.2e-3729.81Show/hide
Query:  EETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWI
        E   IS++ EKL++  ++ G I R+K    ++  Q +   L+ K +T K IN E FKS I  IW  + +VT++  G N+F+  F + + ++RI+E  PW+
Subjt:  EETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWI

Query:  YDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWID
        +D  LL   E  G+E+ + +QFR+VPFW+   +LP  C NR+    LG  VG  + +D  +    VG+ +R+RV IDV  PL+RG  + LG + +   + 
Subjt:  YDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWID

Query:  MRYEKLPEFCYGCGRIGHLAREC--TDLEAMNKEELDYGPWLRKEGTARGK--------PRQKRGGGG-----------------GKHQR---EDKDQKQ
        + YE+LP FCY CG+IGHL R+C     E  +     +GPW+R     R K        P   R GG                  GK       D +Q  
Subjt:  MRYEKLPEFCYGCGRIGHLAREC--TDLEAMNKEELDYGPWLRKEGTARGK--------PRQKRGGGG-----------------GKHQR---EDKDQKQ

Query:  RWIEKKNGNKAEEK---------ENQE------AHRPEKPPE-------KME----VQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEK
           E K+GN  E K           QE      +H  EK  E       +ME    V +PV   N +NQ  +  S+        D   + T  +  ++  
Subjt:  RWIEKKNGNKAEEK---------ENQE------AHRPEKPPE-------KME----VQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEK

Query:  RKVCERGHQTNKHVG-QEGDNETQIKGPRCEGQLADLGQTSQSGQLKGKLYPDMEKNLSTSTQ
        R+ C   ++ N+ +G ++GD + +    R +  +  + +  Q G L      D+E ++ + T+
Subjt:  RKVCERGHQTNKHVG-QEGDNETQIKGPRCEGQLADLGQTSQSGQLKGKLYPDMEKNLSTSTQ

A0A5C7HB59 Uncharacterized protein3.9e-3531.07Show/hide
Query:  ISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNA
        I++  E L++ AEE G ++ M +E     V+D+   LV K+L+ K +N E FK +I +IW   G+V ++  G+N+F   F++   + R+++  PW + N+
Subjt:  ISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIYDNA

Query:  LLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEV-GESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRY
        L+  E+  G    S++ F    FWV   D+P +C NR+ A+ L   +G  E V+   +  E  G+ M V+V+ID+ KPL+R   + +G   E   + ++Y
Subjt:  LLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEV-GESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDMRY

Query:  EKLPEFCYGCGRIGHLARECTDLEAMNKEELD-----YGPWLRKEGTARGKPRQKRGGGGGKHQREDKDQKQRWIEKKNGNKAE-EKENQEAHRP-EKPP
        EKLP+FCY CGR+G+  +EC D EA  K  L+     +G WL+     + KPR    G G    R           + +G   + E +N  +H+P    P
Subjt:  EKLPEFCYGCGRIGHLARECTDLEAMNKEELD-----YGPWLRKEGTARGKPRQKRGGGGGKHQREDKDQKQRWIEKKNGNKAE-EKENQEAHRP-EKPP

Query:  EKMEVQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEKRKVCER
        ++ EV SPV  +    + +  K  +T   +  D  P+  +  +    + KV  R
Subjt:  EKMEVQSPVEPRNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEKRKVCER

A0A5C7HJ97 CCHC-type domain-containing protein1.0e-3835.71Show/hide
Query:  ETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIY
        E  I+K  E L I  E+R  I+ M +E     V+D++  LV K+L+ K +N E FK +I +IW   G+V ++  G+N F   F +   + R+    PW +
Subjt:  ETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSKYRKRRIMEASPWIY

Query:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM
         N+L+  E+  G    S++ F    FWV   D+P +C NR+ A+ L   +G    +  +  E   G+ MRV+V+ID+ KPL+R   + LG   E   + +
Subjt:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM

Query:  RYEKLPEFCYGCGRIGHLARECTDLE----AMNKEELDYGPWLRKEGTARGKPRQKRGGGGGKHQR
        +YE+LPEFCY CG+IGH  +EC D E    A+      +G WL+   T R KPR    G G    R
Subjt:  RYEKLPEFCYGCGRIGHLARECTDLE----AMNKEELDYGPWLRKEGTARGKPRQKRGGGGGKHQR

A0A6J1BSZ1 uncharacterized protein LOC1110054819.4e-3733.88Show/hide
Query:  TSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKV-TIKKAGENMFECNFDSKYRKRRIMEASPWIY
        +++ ++ +   +T+EE    V +    L+   + +E  L+CK+L+++SI+  + K+ +   W ++ K  ++   G N+F  NF+    + RI+   PW +
Subjt:  TSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKV-TIKKAGENMFECNFDSKYRKRRIMEASPWIY

Query:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM
        D AL+  +      +   + FR+V  WVHF DL   C N+  A  LGNA+G+FE V+ + + +  G  +RVRV+ DV KPL RG  + L       WI +
Subjt:  DNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEEWIDM

Query:  RYEKLPEFCYGCGRIGHLARECTD--LEAMNKEELDYGPWLRKEG
        +YE+LP+F Y CGR+ H+ ++C+D  +++++K  L YGPWLR +G
Subjt:  RYEKLPEFCYGCGRIGHLARECTD--LEAMNKEELDYGPWLRKEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.8e-0621.43Show/hide
Query:  FDSKYRKRRIMEASPWIYDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLR
        F S+     I+   PW +++ +   +  + T+ +S  +F+ +PFW+    +P      +    +G  +G+F   +  +D             + V K   
Subjt:  FDSKYRKRRIMEASPWIYDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLR

Query:  RGTVITLGTNAEEEWIDMRYEKLPEFCYGCGRIGHLAREC
                          +YEKL  FC  CG + H A EC
Subjt:  RGTVITLGTNAEEEWIDMRYEKLPEFCYGCGRIGHLAREC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACCCAGGTTGAAACACCAGGGAAGACACAAACCCACGAGAAAGAGCAGCCGAGCAAACAACCTGAAGAAACCTCGATAAGTAAGCAATTGGAAAAACTGAACAT
TACAGCTGAAGAAAGGGGAAACATAGTTAGGATGAAAGATGAAGATCTGAAGAAGAAAGTTCAAGACATGGAAAATATATTGGTCTGTAAGATATTAACGGAGAAAAGCA
TTAACCCGGAGATTTTTAAGTCCATGATCCCACGGATTTGGGGAATGGAAGGCAAAGTAACCATCAAAAAAGCAGGTGAAAACATGTTTGAATGCAATTTTGACTCAAAA
TATAGAAAAAGAAGGATTATGGAAGCAAGCCCATGGATTTACGACAATGCGCTTCTGGCCTTTGAGGAAATTAAAGGAACAGAGAGGTATTCAAGAATCCAATTCAGACA
CGTCCCTTTTTGGGTTCACTTCCTAGATCTACCAAGAATCTGTTTTAATAGGAAGTGGGCAGAGGATCTGGGAAATGCAGTAGGAGTTTTTGAAAGAGTGGATTTCGACA
AAGACGAATATGAAGTAGGGGAATCCATGAGAGTTAGAGTCAAAATCGACGTCCAGAAACCTTTACGAAGAGGAACTGTGATCACTCTGGGGACCAACGCCGAAGAGGAA
TGGATCGACATGAGGTACGAGAAGCTACCAGAATTCTGTTATGGATGTGGTAGAATAGGCCATCTGGCGAGGGAATGCACCGATCTAGAAGCTATGAATAAAGAAGAGCT
GGATTATGGCCCTTGGCTGAGGAAAGAAGGCACAGCAAGGGGGAAACCTCGACAGAAAAGAGGAGGAGGAGGGGGAAAACACCAACGAGAAGACAAAGATCAGAAACAGA
GATGGATAGAAAAAAAGAACGGCAACAAAGCAGAAGAAAAGGAGAATCAGGAAGCACACCGGCCGGAAAAACCACCGGAAAAGATGGAAGTTCAAAGTCCGGTGGAACCA
AGGAACACGACAAATCAAACGGTTAGTTCAAAATCCCAACGTACAGAAGACGCAATGATGGTGGACCAAACCCCGAGGAACACAGAAAATCAGATGGTGGAAAAGGAAAA
AAGGAAAGTATGTGAAAGAGGCCACCAGACCAACAAACATGTGGGCCAAGAAGGTGACAACGAAACCCAAATAAAGGGGCCCAGGTGTGAAGGCCAATTAGCAGACCTGG
GCCAAACAAGTCAATCTGGCCAACTAAAAGGAAAATTATACCCAGACATGGAAAAGAACTTAAGTACATCCACTCAGAAAATAGAGTTACCACGCAAAACCAAAGCAGAA
GATATACTTCAAACTGGAGAAGAGCAACAACTAAACACCGACCCTAAAAGGACTAGAAAGTCATGGAAAAGAAGAGCTCGAGAAGGCCTCAAGGGATACGAGAACAGTGG
ATATACTAGTAAGGTGGGAGAAAACAGAAAACATGAAAGGGAAAGTCAAGAAGAAGCCAGAGAAGGGAAGAAAGCGTGCAAGGAATACTTCGGGTCAACCGTTGGGATAT
CGGCGGAGGCTGAATTTCAGCCCCGCCGGATGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACCCAGGTTGAAACACCAGGGAAGACACAAACCCACGAGAAAGAGCAGCCGAGCAAACAACCTGAAGAAACCTCGATAAGTAAGCAATTGGAAAAACTGAACAT
TACAGCTGAAGAAAGGGGAAACATAGTTAGGATGAAAGATGAAGATCTGAAGAAGAAAGTTCAAGACATGGAAAATATATTGGTCTGTAAGATATTAACGGAGAAAAGCA
TTAACCCGGAGATTTTTAAGTCCATGATCCCACGGATTTGGGGAATGGAAGGCAAAGTAACCATCAAAAAAGCAGGTGAAAACATGTTTGAATGCAATTTTGACTCAAAA
TATAGAAAAAGAAGGATTATGGAAGCAAGCCCATGGATTTACGACAATGCGCTTCTGGCCTTTGAGGAAATTAAAGGAACAGAGAGGTATTCAAGAATCCAATTCAGACA
CGTCCCTTTTTGGGTTCACTTCCTAGATCTACCAAGAATCTGTTTTAATAGGAAGTGGGCAGAGGATCTGGGAAATGCAGTAGGAGTTTTTGAAAGAGTGGATTTCGACA
AAGACGAATATGAAGTAGGGGAATCCATGAGAGTTAGAGTCAAAATCGACGTCCAGAAACCTTTACGAAGAGGAACTGTGATCACTCTGGGGACCAACGCCGAAGAGGAA
TGGATCGACATGAGGTACGAGAAGCTACCAGAATTCTGTTATGGATGTGGTAGAATAGGCCATCTGGCGAGGGAATGCACCGATCTAGAAGCTATGAATAAAGAAGAGCT
GGATTATGGCCCTTGGCTGAGGAAAGAAGGCACAGCAAGGGGGAAACCTCGACAGAAAAGAGGAGGAGGAGGGGGAAAACACCAACGAGAAGACAAAGATCAGAAACAGA
GATGGATAGAAAAAAAGAACGGCAACAAAGCAGAAGAAAAGGAGAATCAGGAAGCACACCGGCCGGAAAAACCACCGGAAAAGATGGAAGTTCAAAGTCCGGTGGAACCA
AGGAACACGACAAATCAAACGGTTAGTTCAAAATCCCAACGTACAGAAGACGCAATGATGGTGGACCAAACCCCGAGGAACACAGAAAATCAGATGGTGGAAAAGGAAAA
AAGGAAAGTATGTGAAAGAGGCCACCAGACCAACAAACATGTGGGCCAAGAAGGTGACAACGAAACCCAAATAAAGGGGCCCAGGTGTGAAGGCCAATTAGCAGACCTGG
GCCAAACAAGTCAATCTGGCCAACTAAAAGGAAAATTATACCCAGACATGGAAAAGAACTTAAGTACATCCACTCAGAAAATAGAGTTACCACGCAAAACCAAAGCAGAA
GATATACTTCAAACTGGAGAAGAGCAACAACTAAACACCGACCCTAAAAGGACTAGAAAGTCATGGAAAAGAAGAGCTCGAGAAGGCCTCAAGGGATACGAGAACAGTGG
ATATACTAGTAAGGTGGGAGAAAACAGAAAACATGAAAGGGAAAGTCAAGAAGAAGCCAGAGAAGGGAAGAAAGCGTGCAAGGAATACTTCGGGTCAACCGTTGGGATAT
CGGCGGAGGCTGAATTTCAGCCCCGCCGGATGCCATGA
Protein sequenceShow/hide protein sequence
METQVETPGKTQTHEKEQPSKQPEETSISKQLEKLNITAEERGNIVRMKDEDLKKKVQDMENILVCKILTEKSINPEIFKSMIPRIWGMEGKVTIKKAGENMFECNFDSK
YRKRRIMEASPWIYDNALLAFEEIKGTERYSRIQFRHVPFWVHFLDLPRICFNRKWAEDLGNAVGVFERVDFDKDEYEVGESMRVRVKIDVQKPLRRGTVITLGTNAEEE
WIDMRYEKLPEFCYGCGRIGHLARECTDLEAMNKEELDYGPWLRKEGTARGKPRQKRGGGGGKHQREDKDQKQRWIEKKNGNKAEEKENQEAHRPEKPPEKMEVQSPVEP
RNTTNQTVSSKSQRTEDAMMVDQTPRNTENQMVEKEKRKVCERGHQTNKHVGQEGDNETQIKGPRCEGQLADLGQTSQSGQLKGKLYPDMEKNLSTSTQKIELPRKTKAE
DILQTGEEQQLNTDPKRTRKSWKRRAREGLKGYENSGYTSKVGENRKHERESQEEAREGKKACKEYFGSTVGISAEAEFQPRRMP