; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028629 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028629
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold101:85986..87317
RNA-Seq ExpressionMS028629
SyntenyMS028629
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.0e-6349Show/hide
Query:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAP
        M A  L+EEW+N  LT EE++  VD+D SA     + LE SLI K+  KR I   VL   L++ W ++ + F V+++G NIFLF+F++++  NRILR  P
Subjt:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAP

Query:  WAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCW
        W FDR LI++  PV+  K  ++ F   S WVHF D+ L C+N +MA RLGNAIG FE V+ + + F WG+ LRVRV  D+ KPL RG+K+NL+GP+GGCW
Subjt:  WAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCW

Query:  TPIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKN-QYGPWLKFQG
         PI+YERLPDF  +CG + H   +C    +      V KN QYGPWL+FQG
Subjt:  TPIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKN-QYGPWLKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.4e-7244.59Show/hide
Query:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPW
        M+ + L+ +W+   LT EE+E  +DVD  A    E+ L  SL+GK+  KRII  DVL + L L W VE    VE +GKN+FLF F +    NR+++  PW
Subjt:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPW

Query:  AFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWT
         FD+ LIVLQKP ++  +SE+ FN+ +FW+H  D+P+  LN +MA RLGNAIG F  VDC++  F WGASLR+RV IDI KPLRRG+K+N++GP+GGCW 
Subjt:  AFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWT

Query:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYSPTERV
        PI+YERLPDFC +CG+IGH+  +C+  ++         ++YGPWL+F G   G +K R      R    G  + +S++R + E +++ S        +  
Subjt:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYSPTERV

Query:  SISEQ
        + +EQ
Subjt:  SISEQ

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.3e-6246.4Show/hide
Query:  LVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDR
        L+EEW+N  LT EEEET +DVD SA      RLE  L+GK+F+KR I   V+   +R  W +E   F+V+ LG N+FLFSF +A   N+I +  PW FDR
Subjt:  LVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDR

Query:  VLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRY
         L+++ KPVA +  SE+ F K   WV F D+PL C+   MA RLGNA+G FE  DC D    WG++LRVRV +DI+KPLRRG+K+NL+GPIGG W PI+Y
Subjt:  VLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRY

Query:  ERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLG
        ERLPDFC +CG+    K                K+QYG WL++QG  +    +  Q       K G  + SS    +G
Subjt:  ERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLG

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.7e-4129.44Show/hide
Query:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF
        AD L++    LSLT  EE+ +V + G +   +  + +  L+GK+  +R    + +   L  VW   +G  V V+G N+F+F F       R+L   PW F
Subjt:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF

Query:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEG--PIGGCWT
        D+ L++L +   N++ S++   +  FWVH C++PLV +N  + + +GNA+G+F  +D  D    WG ++R+RV++D+ KPLRRG+K+ L    PI   W 
Subjt:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEG--PIGGCWT

Query:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWL-----KFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYS
          +YERLP +C +CG +GH++ EC+ +       +V   QYG WL     K +G  R G  ++  G    G+  GG+    R       Q EG+  R  S
Subjt:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWL-----KFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYS

Query:  PTERVSISEQVETSECLPEGHKVVECVDLQEVDVAAGRHSEIGGRGSEQ---ELRFIDGVLMDRVEDNGI------QQMGKEGMERKMVCA---------
        PT RV +            G  +   ++  +   A    +  G   S        ++ G   + +E+ G+       ++G +G +  ++ +         
Subjt:  PTERVSISEQVETSECLPEGHKVVECVDLQEVDVAAGRHSEIGGRGSEQ---ELRFIDGVLMDRVEDNGI------QQMGKEGMERKMVCA---------

Query:  LPVLGSKRE--SLAGVVTNVRGWKRKAR
         P+L  + E  SL+G V   + WKR AR
Subjt:  LPVLGSKRE--SLAGVVTNVRGWKRKAR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]7.6e-4235.1Show/hide
Query:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF
        AD LV+    LSLT  EE+ +V + G +   +  + +  L+GK+  +R    + +   L  VW   +G  V V+G N+F+F F       R+L   PW F
Subjt:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF

Query:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEG--PIGGCWT
        D+ L++L +   N++ S++      FWVH C++PLV +N  + Q +GNA+G+F  +D  D    WG ++R+RV+ID+ KPLRRG+K+ L    PI   W 
Subjt:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEG--PIGGCWT

Query:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWL-----KFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYS
          +YERLP +C +CG +GH+  EC+ +       +V   QYG WL     K +G  R G  ++  G    G+  GG+    R       Q EG+  R  S
Subjt:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWL-----KFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYS

Query:  PT
        PT
Subjt:  PT

TrEMBL top hitse value%identityAlignment
A0A1R3K847 Uncharacterized protein9.3e-3834.29Show/hide
Query:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF
        A+GL   WEN +LT EE   +  V+ +   +     +  LIGK+  +R +  DV+   L +VW +  G  V  +G+ +++F F+      R+ +  PW F
Subjt:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF

Query:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGG-CWTP
        ++ L+VL+   A + + E+     +FW+   D+PL  +  S+ + +G++ G+   +D   DK  WG  LR+R  +++NKPLRRG  + L  P GG     
Subjt:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGG-CWTP

Query:  IRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLK
         RYE+LPDFC  CG + HT+ ECE   I        K +YGPWL+
Subjt:  IRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLK

A0A2N9FJK9 CCHC-type domain-containing protein4.5e-4035.71Show/hide
Query:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF
        AD LVE+W   SLT +E    + +   A  D E      L+GK+   +   +  L   +  +W    G   + +G+N+FLF F   A C R+L G+PW F
Subjt:  ADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAF

Query:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPI
        D  L+VL     +   +++ FN   FWV F  +PL  +     +RLG AIG  E VD S +   WG  LRVR+S+D+ KPL+RG  +   G  G  W   
Subjt:  DRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPI

Query:  RYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLK-----FQGRGRGGEKERVQGDVYRGSKKGGEAESSR
        +YERLP  C +CG +GH + EC ++   GGA   +   YGPWL+     F+ R   G+ +R  G     +  GG  E+ R
Subjt:  RYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLK-----FQGRGRGGEKERVQGDVYRGSKKGGEAESSR

A0A6J1BSZ1 uncharacterized protein LOC1110054819.9e-6449Show/hide
Query:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAP
        M A  L+EEW+N  LT EE++  VD+D SA     + LE SLI K+  KR I   VL   L++ W ++ + F V+++G NIFLF+F++++  NRILR  P
Subjt:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAP

Query:  WAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCW
        W FDR LI++  PV+  K  ++ F   S WVHF D+ L C+N +MA RLGNAIG FE V+ + + F WG+ LRVRV  D+ KPL RG+K+NL+GP+GGCW
Subjt:  WAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCW

Query:  TPIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKN-QYGPWLKFQG
         PI+YERLPDF  +CG + H   +C    +      V KN QYGPWL+FQG
Subjt:  TPIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKN-QYGPWLKFQG

A0A6J1DU55 uncharacterized protein LOC1110231351.2e-7244.59Show/hide
Query:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPW
        M+ + L+ +W+   LT EE+E  +DVD  A    E+ L  SL+GK+  KRII  DVL + L L W VE    VE +GKN+FLF F +    NR+++  PW
Subjt:  MNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPW

Query:  AFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWT
         FD+ LIVLQKP ++  +SE+ FN+ +FW+H  D+P+  LN +MA RLGNAIG F  VDC++  F WGASLR+RV IDI KPLRRG+K+N++GP+GGCW 
Subjt:  AFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWT

Query:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYSPTERV
        PI+YERLPDFC +CG+IGH+  +C+  ++         ++YGPWL+F G   G +K R      R    G  + +S++R + E +++ S        +  
Subjt:  PIRYERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYSPTERV

Query:  SISEQ
        + +EQ
Subjt:  SISEQ

A0A6J1DX30 uncharacterized protein LOC1110248741.1e-6246.4Show/hide
Query:  LVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDR
        L+EEW+N  LT EEEET +DVD SA      RLE  L+GK+F+KR I   V+   +R  W +E   F+V+ LG N+FLFSF +A   N+I +  PW FDR
Subjt:  LVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVE-EGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDR

Query:  VLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRY
         L+++ KPVA +  SE+ F K   WV F D+PL C+   MA RLGNA+G FE  DC D    WG++LRVRV +DI+KPLRRG+K+NL+GPIGG W PI+Y
Subjt:  VLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRY

Query:  ERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLG
        ERLPDFC +CG+    K                K+QYG WL++QG  +    +  Q       K G  + SS    +G
Subjt:  ERLPDFCSYCGMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein2.3e-1225.67Show/hide
Query:  IFVKRIIPR----DVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLN
        +F + ++PR      ++  +  +W         ++    F F F        +LR  PWAF+  +I+LQ+    + L    F    FWV    +P   LN
Subjt:  IFVKRIIPR----DVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDRVLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLN

Query:  ASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRYERLPDFCSYCGMIGHTKGECEMEH
          + + +G A+G+    D + +        RV +  DI  PLR          +       RYERL  FC  CGM+ H  G C +++
Subjt:  ASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRYERLPDFCSYCGMIGHTKGECEMEH

AT3G42140.1 zinc ion binding;nucleic acid binding5.3e-0926.67Show/hide
Query:  DVEVLGKNIFL----FSFDKAAVCNRILRGAPWAFDRVLIVLQKPVANLKL-SEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFF
        D EV+G+ + +    F F        ILR  PW+F+  + V+Q+     KL S+  F +  FW+    +PL  L A +   +G  +G F           
Subjt:  DVEVLGKNIFL----FSFDKAAVCNRILRGAPWAFDRVLIVLQKPVANLKL-SEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFF

Query:  WGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRYERLPDFCSYCGMIGHTKGECEMEHIQG
                    +   L R V V             +YE+L +FC+ CGM+ H   EC     QG
Subjt:  WGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRYERLPDFCSYCGMIGHTKGECEMEHIQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGGGTTTGGAACATGAATGCTGATGGACTAGTAGAAGAGTGGGAAAATTTATCGTTGACTCCCGAAGAGGAGGAGACATTGGTGGATGTGGATGGTTCGGCTGC
CATTGATGTGGAAAGGAGGTTGGAAACCAGCCTAATTGGGAAAATTTTTGTGAAGAGAATTATTCCTCGGGATGTTCTGATTAAAGAATTACGGCTGGTGTGGCATGTGG
AGGAAGGGTTTGACGTCGAAGTTTTGGGGAAGAATATTTTTCTGTTCAGTTTTGATAAAGCGGCAGTATGCAATAGAATTTTGAGAGGTGCTCCGTGGGCCTTTGACAGG
GTGTTGATTGTTTTACAGAAGCCAGTTGCAAATTTGAAACTATCTGAGGTTCACTTCAATAAGGCTTCGTTTTGGGTGCACTTCTGCGATGTACCTTTGGTTTGCTTGAA
TGCATCTATGGCACAACGACTAGGCAATGCAATTGGGAAGTTTGAATATGTTGATTGTTCGGATGATAAGTTTTTTTGGGGTGCAAGTCTGAGAGTTCGAGTTTCCATTG
ACATAAATAAACCGCTTAGAAGGGGAGTTAAAGTTAACTTGGAAGGACCTATTGGGGGTTGTTGGACTCCGATCAGGTATGAGAGGCTTCCAGACTTTTGTTCCTACTGT
GGAATGATTGGTCATACGAAAGGTGAATGTGAGATGGAGCATATTCAAGGTGGTGCATGGCAAGTACAGAAGAACCAATATGGGCCGTGGCTCAAGTTTCAAGGGAGGGG
CAGAGGTGGCGAGAAGGAACGGGTGCAGGGAGATGTGTATAGAGGAAGCAAAAAAGGTGGTGAAGCTGAGTCTTCGAGGGATAGGAGTTTGGGTGAAAGACAAAGGGAGG
GTTCATCGGATCGGGTTTATTCTCCTACTGAGCGTGTTAGTATCTCTGAGCAGGTGGAGACTTCCGAGTGTTTACCCGAAGGACACAAAGTGGTGGAATGTGTGGACCTT
CAGGAGGTTGATGTTGCGGCTGGCCGTCATAGTGAAATTGGTGGGAGGGGCTCTGAACAAGAGTTGAGATTTATAGATGGGGTGTTAATGGATAGGGTGGAGGATAATGG
TATACAACAAATGGGTAAGGAGGGCATGGAGAGGAAGATGGTATGCGCTTTGCCAGTATTGGGTAGCAAGAGGGAGTCATTGGCAGGGGTGGTGACAAATGTCCGGGGCT
GGAAAAGGAAGGCACGTATGAAAAAATTACTCAGAATGTGCCAGACCTTTTGTGTGGTAAGCGAAAGGGGGTGGTCGAGCAAGTTGAGATGGGAGGGGTTGAAAAGAGAA
GGTGTGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGGGTTTGGAACATGAATGCTGATGGACTAGTAGAAGAGTGGGAAAATTTATCGTTGACTCCCGAAGAGGAGGAGACATTGGTGGATGTGGATGGTTCGGCTGC
CATTGATGTGGAAAGGAGGTTGGAAACCAGCCTAATTGGGAAAATTTTTGTGAAGAGAATTATTCCTCGGGATGTTCTGATTAAAGAATTACGGCTGGTGTGGCATGTGG
AGGAAGGGTTTGACGTCGAAGTTTTGGGGAAGAATATTTTTCTGTTCAGTTTTGATAAAGCGGCAGTATGCAATAGAATTTTGAGAGGTGCTCCGTGGGCCTTTGACAGG
GTGTTGATTGTTTTACAGAAGCCAGTTGCAAATTTGAAACTATCTGAGGTTCACTTCAATAAGGCTTCGTTTTGGGTGCACTTCTGCGATGTACCTTTGGTTTGCTTGAA
TGCATCTATGGCACAACGACTAGGCAATGCAATTGGGAAGTTTGAATATGTTGATTGTTCGGATGATAAGTTTTTTTGGGGTGCAAGTCTGAGAGTTCGAGTTTCCATTG
ACATAAATAAACCGCTTAGAAGGGGAGTTAAAGTTAACTTGGAAGGACCTATTGGGGGTTGTTGGACTCCGATCAGGTATGAGAGGCTTCCAGACTTTTGTTCCTACTGT
GGAATGATTGGTCATACGAAAGGTGAATGTGAGATGGAGCATATTCAAGGTGGTGCATGGCAAGTACAGAAGAACCAATATGGGCCGTGGCTCAAGTTTCAAGGGAGGGG
CAGAGGTGGCGAGAAGGAACGGGTGCAGGGAGATGTGTATAGAGGAAGCAAAAAAGGTGGTGAAGCTGAGTCTTCGAGGGATAGGAGTTTGGGTGAAAGACAAAGGGAGG
GTTCATCGGATCGGGTTTATTCTCCTACTGAGCGTGTTAGTATCTCTGAGCAGGTGGAGACTTCCGAGTGTTTACCCGAAGGACACAAAGTGGTGGAATGTGTGGACCTT
CAGGAGGTTGATGTTGCGGCTGGCCGTCATAGTGAAATTGGTGGGAGGGGCTCTGAACAAGAGTTGAGATTTATAGATGGGGTGTTAATGGATAGGGTGGAGGATAATGG
TATACAACAAATGGGTAAGGAGGGCATGGAGAGGAAGATGGTATGCGCTTTGCCAGTATTGGGTAGCAAGAGGGAGTCATTGGCAGGGGTGGTGACAAATGTCCGGGGCT
GGAAAAGGAAGGCACGTATGAAAAAATTACTCAGAATGTGCCAGACCTTTTGTGTGGTAAGCGAAAGGGGGTGGTCGAGCAAGTTGAGATGGGAGGGGTTGAAAAGAGAA
GGTGTGTTGTGA
Protein sequenceShow/hide protein sequence
MGGVWNMNADGLVEEWENLSLTPEEEETLVDVDGSAAIDVERRLETSLIGKIFVKRIIPRDVLIKELRLVWHVEEGFDVEVLGKNIFLFSFDKAAVCNRILRGAPWAFDR
VLIVLQKPVANLKLSEVHFNKASFWVHFCDVPLVCLNASMAQRLGNAIGKFEYVDCSDDKFFWGASLRVRVSIDINKPLRRGVKVNLEGPIGGCWTPIRYERLPDFCSYC
GMIGHTKGECEMEHIQGGAWQVQKNQYGPWLKFQGRGRGGEKERVQGDVYRGSKKGGEAESSRDRSLGERQREGSSDRVYSPTERVSISEQVETSECLPEGHKVVECVDL
QEVDVAAGRHSEIGGRGSEQELRFIDGVLMDRVEDNGIQQMGKEGMERKMVCALPVLGSKRESLAGVVTNVRGWKRKARMKKLLRMCQTFCVVSERGWSSKLRWEGLKRE
GVL