; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001684 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001684
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold8:33402448..33403371
RNA-Seq ExpressionSpg001684
SyntenySpg001684
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3952870.1 hypothetical protein CMV_021626 [Castanea mollissima]6.2e-3533.22Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        M+  +   L +F++++ E + + I++   S    E    L G L+++R  N  A + T+  AW+  S ++ VE+  NV  F F+ +  + +V+K GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKWVW
        E  LL+L +W   + SK        FW+QI GLPF+   +E  + +  K+G + + +    Q G  Q KF+R +VE+ I KPL RG  V N EG + WV 
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKWVW

Query:  LKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKEDRSGPWRRGNQDSEKTTVDQVPESETNCPDISGKNLGEGGSSKPP
         +YERLP FC  CG++GH    C     + P    ++  +GEWL+AG           G ++ G    EK  V + P +E      S  +L EGG +   
Subjt:  LKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKEDRSGPWRRGNQDSEKTTVDQVPESETNCPDISGKNLGEGGSSKPP

Query:  TEVGGLR
         EVG +R
Subjt:  TEVGGLR

PQQ04601.1 uncharacterized protein Pyn_40319 [Prunus yedoensis var. nudiflora]1.0e-3736Show/hide
Query:  TISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPK
        T+ + +K  D+   + CL+G+L+T+R +NR+A  +T+  +W+ +  V+F+E+EENVF   F     +  V + GPWLF++ + +L +W   ++  +    
Subjt:  TISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPK

Query:  VCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
         C FW+Q H LP DCRG++   ++  ++G ++  +   EQ G  ++++IR +V +D  KPLARG+++ + G K W+  KYERLP FC+ CG++GH    C
Subjt:  VCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC

PQQ05662.1 uncharacterized protein Pyn_36355 [Prunus yedoensis var. nudiflora]1.7e-4034.68Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        M+  +  +L R ++ + E +++ I  E+K  D+   + CL+G+L+T+R +NR+A  +T+  +W+ +  V+F+E+EENVF   F     +  V + GPWLF
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWL
        ++ + +L +W   ++  +     C FW+Q H LP DCRG++   ++  ++G ++  +   EQ G  ++++IR +V +D  KPLARG+++ + G K W+  
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWL

Query:  KYERLPKFCSACGMIGHSVHWC
        KYERLP FC+ CG++GH    C
Subjt:  KYERLPKFCSACGMIGHSVHWC

XP_023874702.1 uncharacterized protein LOC111987231 [Quercus suber]2.1e-3535.32Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        M++ +   L + ++++ E + + I++   +  L E    L G L++NR  N  A + T+  AW+  S ++ VE+  NV  F F     + +V+K GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW
        E  LL+L +W   + SK        FW+QI GLPF+   ++  + + +K+G +     EV++R +Q  Q KF+R +VE+ I KPL RG  V N EG + W
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW

Query:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLL
        V  +YERLP FC  CG++GH    C     V P   Q    +GEWL+AG ++
Subjt:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLL

XP_030936538.1 uncharacterized protein LOC115961754 [Quercus lobata]3.6e-3533.07Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        ME+ + D L + +++  + D + I++  +S  + E    L G L+TNR+ N+ AF+ T+  AW+  S +K VE+  N+  F F     + +V+K GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRG--VQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKW
        +  LL+L++W   + +         FW+Q+ GLPF+   ++A + +  K+G +     EV++R   V+Q KF+R +VE+ I KPL +G  + N +G + W
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRG--VQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKW

Query:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKE
        +  +YERLP FC  CG++GH    C    Q+    G  +  + EWL+AG ++    E
Subjt:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKE

TrEMBL top hitse value%identityAlignment
A0A314YDX0 CCHC-type domain-containing protein8.2e-4134.68Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        M+  +  +L R ++ + E +++ I  E+K  D+   + CL+G+L+T+R +NR+A  +T+  +W+ +  V+F+E+EENVF   F     +  V + GPWLF
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWL
        ++ + +L +W   ++  +     C FW+Q H LP DCRG++   ++  ++G ++  +   EQ G  ++++IR +V +D  KPLARG+++ + G K W+  
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWL

Query:  KYERLPKFCSACGMIGHSVHWC
        KYERLP FC+ CG++GH    C
Subjt:  KYERLPKFCSACGMIGHSVHWC

A0A314YHJ0 CCHC-type domain-containing protein5.0e-3836Show/hide
Query:  TISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPK
        T+ + +K  D+   + CL+G+L+T+R +NR+A  +T+  +W+ +  V+F+E+EENVF   F     +  V + GPWLF++ + +L +W   ++  +    
Subjt:  TISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPK

Query:  VCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
         C FW+Q H LP DCRG++   ++  ++G ++  +   EQ G  ++++IR +V +D  KPLARG+++ + G K W+  KYERLP FC+ CG++GH    C
Subjt:  VCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC

A0A7N2N3A8 CCHC-type domain-containing protein4.6e-3636.55Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        ME+ I D L   ++++ E + + ISS      L E    L G+L+ +R  N+ A + T+   W+  S ++ VE+  N+  F F+    + +V++ GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW
        E  LL+L +W   + S   +     FW+QI GLPF+   +EA R +  K+G +     EV++R  Q  Q KF+R +++L I KPL RG  V N +G + W
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW

Query:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAG
        V  KYERLP FC +CG IGH    C+ + +  P   Q    +G+WLRAG
Subjt:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAG

A0A7N2RAY6 Uncharacterized protein1.8e-3533.07Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        ME+ + D L + +++  + D + I++  +S  + E    L G L+TNR+ N+ AF+ T+  AW+  S +K VE+  N+  F F     + +V+K GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRG--VQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKW
        +  LL+L++W   + +         FW+Q+ GLPF+   ++A + +  K+G +     EV++R   V+Q KF+R +VE+ I KPL +G  + N +G + W
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRG--VQQRKFIRFKVELDIMKPLARGLIVAN-EGVKKW

Query:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKE
        +  +YERLP FC  CG++GH    C    Q+    G  +  + EWL+AG ++    E
Subjt:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKE

A0A7N2RFJ0 CCHC-type domain-containing protein4.6e-3636.55Show/hide
Query:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF
        ME+ I D L   ++++ E + + ISS      L E    L G+L+ +R  N+ A + T+   W+  S ++ VE+  N+  F F+    + +V++ GPW F
Subjt:  MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLF

Query:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW
        E  LL+L +W   + S   +     FW+QI GLPF+   +EA R +  K+G +     EV++R  Q  Q KF+R +++L I KPL RG  V N +G + W
Subjt:  EECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQ--QRKFIRFKVELDIMKPLARGLIVAN-EGVKKW

Query:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAG
        V  KYERLP FC +CG IGH    C+ + +  P   Q    +G+WLRAG
Subjt:  VWLKYERLPKFCSACGMIGHSVHWCTSMKQVGPQTGQSQVMFGEWLRAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02103.1 unknown protein1.0e-1126Show/hide
Query:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF
        ++     F FA+   +L VQ++ PWLF    +  T+W+  +    ++    D W+QI G+P     +E    +A  +G I     +  +    Q  FIR 
Subjt:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF

Query:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
        +V   I   L     ++ + G    +  +YERL + CS+C    H+  +C
Subjt:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC

AT2G17920.1 nucleic acid binding;zinc ion binding2.4e-1324.28Show/hide
Query:  ECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSK
        E  S+ I +E   +  G  +  +I   +  R  N  A    +  AW   + V    I++    F F     +L VQ++ PWLF    +   +W+P     
Subjt:  ECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKS-VKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSK

Query:  ADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIG
         +     D W+Q+ G+PF    +E A  +AQ++G I     +       Q  +IR +V + I   L     I    G    +  +YERL + CS C    
Subjt:  ADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIG

Query:  HSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKEDRSGP
        H+ ++C    +V     +   +    LR+      Q  + S P
Subjt:  HSVHWCTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKEDRSGP

AT3G31430.1 unknown protein4.3e-1829Show/hide
Query:  ISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAW-QCKSVKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKV
        +  +I +  + E ++ L G  +  R+ N  +   +M   W Q   V    +E   F F F    S+  V ++GPW F + +++L +WEP+I     IP  
Subjt:  ISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAW-QCKSVKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKV

Query:  CDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
          FW+QI G+PF    +     + + +G + D +  VE   V +  F R  +  DI  PL  +       GV   +  +YERL  FC  CGM+ H    C
Subjt:  CDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC

AT5G18636.1 unknown protein4.6e-1226.67Show/hide
Query:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF
        ++     F FA+   ++ VQ++ PWLF    +  T+W+  +    ++    D W+QI G+P     +E    +AQ +G I     +  +    Q  FIR 
Subjt:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF

Query:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
        +V   I   L     I+ + G    +  +YERL + CS+C    H+  +C
Subjt:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC

AT5G25200.1 unknown protein4.6e-1226.67Show/hide
Query:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF
        ++     F FA+   ++ VQ++ PWLF    +  T+W+  +    ++    D W+QI G+P     +E    +AQ +G I     +  +    Q  FIR 
Subjt:  IEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWEPRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRF

Query:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC
        +V   I   L     I+ + G    +  +YERL + CS+C    H+  +C
Subjt:  KVELDIMKPLA-RGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHWC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGCTATTGAGGATGTCCTGGCTAGGTTCAGGATATCAGAGAGTGAGTGTGACTCAGTGACGATTTCGAGTGAGATCAAATCTATTGATCTCGGGGAGAGGAA
ATGGTGCCTGATCGGCGAACTGATAACTAATCGGAAGTATAATAGAGCTGCCTTCCGGAAAACGATGGTAGATGCGTGGCAATGCAAATCGGTAAAGTTCGTGGAGATTG
AGGAAAATGTATTCATGTTCTGCTTTGCAGATGCTGCATCGATGCTGTATGTGCAAAAGCAAGGACCATGGCTCTTTGAGGAATGTCTGCTGATCCTTACCAAATGGGAA
CCGAGAATTAAATCAAAGGCGGACATTCCAAAGGTATGTGATTTCTGGTTACAAATTCACGGGCTCCCTTTTGATTGTAGAGGCCAGGAAGCGGCAAGAGTTGTGGCACA
GAAAGTAGGGATGATAACTGATGAGGAAGGGGAGGTGGAACAGCGGGGTGTGCAGCAACGCAAGTTCATCCGATTCAAAGTTGAGCTTGATATTATGAAACCTTTAGCGA
GGGGTTTAATTGTGGCTAATGAAGGAGTGAAAAAATGGGTGTGGCTCAAATATGAACGCCTGCCCAAATTTTGTTCAGCCTGTGGAATGATCGGACACTCCGTTCACTGG
TGCACTTCTATGAAGCAAGTTGGGCCTCAAACTGGTCAATCGCAAGTTATGTTTGGTGAATGGTTGCGGGCGGGGCCACTGCTGTCAAGGCAGAAGGAGGATCGTTCGGG
GCCTTGGCGACGGGGAAACCAGGACTCAGAGAAGACGACGGTGGACCAGGTGCCGGAATCGGAAACCAATTGTCCAGATATATCGGGCAAGAATCTCGGGGAGGGTGGTT
CCTCTAAACCACCCACGGAAGTAGGGGGATTGAGACTCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGCTATTGAGGATGTCCTGGCTAGGTTCAGGATATCAGAGAGTGAGTGTGACTCAGTGACGATTTCGAGTGAGATCAAATCTATTGATCTCGGGGAGAGGAA
ATGGTGCCTGATCGGCGAACTGATAACTAATCGGAAGTATAATAGAGCTGCCTTCCGGAAAACGATGGTAGATGCGTGGCAATGCAAATCGGTAAAGTTCGTGGAGATTG
AGGAAAATGTATTCATGTTCTGCTTTGCAGATGCTGCATCGATGCTGTATGTGCAAAAGCAAGGACCATGGCTCTTTGAGGAATGTCTGCTGATCCTTACCAAATGGGAA
CCGAGAATTAAATCAAAGGCGGACATTCCAAAGGTATGTGATTTCTGGTTACAAATTCACGGGCTCCCTTTTGATTGTAGAGGCCAGGAAGCGGCAAGAGTTGTGGCACA
GAAAGTAGGGATGATAACTGATGAGGAAGGGGAGGTGGAACAGCGGGGTGTGCAGCAACGCAAGTTCATCCGATTCAAAGTTGAGCTTGATATTATGAAACCTTTAGCGA
GGGGTTTAATTGTGGCTAATGAAGGAGTGAAAAAATGGGTGTGGCTCAAATATGAACGCCTGCCCAAATTTTGTTCAGCCTGTGGAATGATCGGACACTCCGTTCACTGG
TGCACTTCTATGAAGCAAGTTGGGCCTCAAACTGGTCAATCGCAAGTTATGTTTGGTGAATGGTTGCGGGCGGGGCCACTGCTGTCAAGGCAGAAGGAGGATCGTTCGGG
GCCTTGGCGACGGGGAAACCAGGACTCAGAGAAGACGACGGTGGACCAGGTGCCGGAATCGGAAACCAATTGTCCAGATATATCGGGCAAGAATCTCGGGGAGGGTGGTT
CCTCTAAACCACCCACGGAAGTAGGGGGATTGAGACTCAAATGA
Protein sequenceShow/hide protein sequence
MEEAIEDVLARFRISESECDSVTISSEIKSIDLGERKWCLIGELITNRKYNRAAFRKTMVDAWQCKSVKFVEIEENVFMFCFADAASMLYVQKQGPWLFEECLLILTKWE
PRIKSKADIPKVCDFWLQIHGLPFDCRGQEAARVVAQKVGMITDEEGEVEQRGVQQRKFIRFKVELDIMKPLARGLIVANEGVKKWVWLKYERLPKFCSACGMIGHSVHW
CTSMKQVGPQTGQSQVMFGEWLRAGPLLSRQKEDRSGPWRRGNQDSEKTTVDQVPESETNCPDISGKNLGEGGSSKPPTEVGGLRLK