; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g07140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g07140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:5044231..5053342
RNA-Seq ExpressionMoc04g07140
SyntenyMoc04g07140
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.1e-4138.78Show/hide
Query:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------
        LLE+W   +LT  E++  VD+D    +   + LEL L+ KLLS R ISC                                                   
Subjt:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------

Query:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY
             + P++L KP +++F+ V+ W HF++L +  MNKTMA+ LGNAIG+  DV+ +++  C G  L V+VRFD+ KPL RG+K+N+DGPMG CW+PI+Y
Subjt:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY

Query:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQG
        ERLPDF Y CG + H+ K C     DS    ++ LQYG WLRFQG
Subjt:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.3e-4641.82Show/hide
Query:  EVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC-------------------------------------------------
        E LL DW K +LT  E+E  +DVD+   K  EQ L   LVGKLL+ R IS                                                  
Subjt:  EVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC-------------------------------------------------

Query:  ------EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK
              +KP +    SELEF  VAFW H ++LPM ++NKTMA  LGNAIG   DVDC+      G SL ++V  DITKPLRRG+KINIDGPMG CW+PI+
Subjt:  ------EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK

Query:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHR
        YERLPDFCYFCG+IGH +  CD+    ++       +YG WLRF G  K G  K RK +S +   S  S  +N +
Subjt:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.6e-3334.57Show/hide
Query:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------
        LLE+W   +LT  EEET +DVD         RLE  LVGKL   R I+C                                                   
Subjt:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------

Query:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY
              KP+ LI PSEL+F  +  W  F++LP+  + + MA  LGNA+G   + DCD      G +L V+V  DI+KPLRRG+K+N+DGP+G  W+PI+Y
Subjt:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY

Query:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK-RESFSDRSSKQS
        ERLPDFCY CG+     K                 QYG+WLR+QG  K  + + ++ +E   D+S   S
Subjt:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK-RESFSDRSSKQS

XP_028071384.1 uncharacterized protein LOC114273772 [Camellia sinensis]4.7e-2538.51Show/hide
Query:  IKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCG
        ++PS+++   V FW H  NLP+  MNK +   +GNA+G   D+D +      G ++ ++V  D+ KPLRRG+K+ +   +   WV  KYERLP +CYFCG
Subjt:  IKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCG

Query:  MIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRES
         +GH  + CD+ +  ++  + + LQYGAWLR      +G   PR+  S
Subjt:  MIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRES

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]8.1e-2538.85Show/hide
Query:  IKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCG
        ++PS+++   V FW H  NLP+  MNK +   +GNA+G   D+D +      G ++ ++V  D+ KPLRRG+K+ +       WV  KYERLP +CYFCG
Subjt:  IKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCG

Query:  MIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEG
         +GH  + CD  +  ++  + + LQYGAWLR      +G
Subjt:  MIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEG

TrEMBL top hitse value%identityAlignment
A0A5C7GPP6 CCHC-type domain-containing protein1.1e-2238.22Show/hide
Query:  HRFISCEKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK
        +  I  EKP+   + SEL F  V  W   +NLP+ +MN+  A  L   IG   ++  D+ E C G+ L VKVR D+ KPL+R +K+ +D        P+ 
Subjt:  HRFISCEKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK

Query:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK
        YERLP FCY CG +GH+ + C ++    + ++    Q+GAWLR  G    G  KP+K
Subjt:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK

A0A6J1BSZ1 uncharacterized protein LOC1110054811.0e-4138.78Show/hide
Query:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------
        LLE+W   +LT  E++  VD+D    +   + LEL L+ KLLS R ISC                                                   
Subjt:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------

Query:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY
             + P++L KP +++F+ V+ W HF++L +  MNKTMA+ LGNAIG+  DV+ +++  C G  L V+VRFD+ KPL RG+K+N+DGPMG CW+PI+Y
Subjt:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY

Query:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQG
        ERLPDF Y CG + H+ K C     DS    ++ LQYG WLRFQG
Subjt:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQG

A0A6J1DU55 uncharacterized protein LOC1110231356.2e-4741.82Show/hide
Query:  EVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC-------------------------------------------------
        E LL DW K +LT  E+E  +DVD+   K  EQ L   LVGKLL+ R IS                                                  
Subjt:  EVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC-------------------------------------------------

Query:  ------EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK
              +KP +    SELEF  VAFW H ++LPM ++NKTMA  LGNAIG   DVDC+      G SL ++V  DITKPLRRG+KINIDGPMG CW+PI+
Subjt:  ------EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIK

Query:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHR
        YERLPDFCYFCG+IGH +  CD+    ++       +YG WLRF G  K G  K RK +S +   S  S  +N +
Subjt:  YERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHR

A0A6J1DX30 uncharacterized protein LOC1110248747.8e-3434.57Show/hide
Query:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------
        LLE+W   +LT  EEET +DVD         RLE  LVGKL   R I+C                                                   
Subjt:  LLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISC---------------------------------------------------

Query:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY
              KP+ LI PSEL+F  +  W  F++LP+  + + MA  LGNA+G   + DCD      G +L V+V  DI+KPLRRG+K+N+DGP+G  W+PI+Y
Subjt:  -----EKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKY

Query:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK-RESFSDRSSKQS
        ERLPDFCY CG+     K                 QYG+WLR+QG  K  + + ++ +E   D+S   S
Subjt:  ERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRK-RESFSDRSSKQS

A0A6J5WHB3 CCHC-type domain-containing protein1.1e-2233.19Show/hide
Query:  QRRRTEEVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLL-----SHRFI------------SCEKPLNLIKPSELEFKLVAFWSHFYNL
        ++   EEV+      L LT  EE+  + ++   +  M +RL LCLVG +L     +H                 E P     P+ +  +   FW   +NL
Subjt:  QRRRTEEVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLL-----SHRFI------------SCEKPLNLIKPSELEFKLVAFWSHFYNL

Query:  PMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAKTCDSNVDDSEKVK
        P+  M       +GN +G C DV   +D  CLG  L ++VR D+TKPLRR +K+  D  + +  +  KYERLPDFCY CG IGH+ K C   +  + + +
Subjt:  PMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAKTCDSNVDDSEKVK

Query:  AEYLQYGAWLRFQGQ-GKEGLSKPRKRESFSDRSSKQS
        A+   YG+WL  + + G+     PRK+E   +R+S++S
Subjt:  AEYLQYGAWLRFQGQ-GKEGLSKPRKRESFSDRSSKQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding8.9e-0623.08Show/hide
Query:  WSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAKTCDSNV
        W    N+P  + ++ +   +   +G    VD ++     G    V +  ++ KPL+  V IN D         + YE L   C  CG+ GHL  +C  NV
Subjt:  WSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAKTCDSNV

Query:  DDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHRLMRWLKEFPRRCRQTKRQSQLAEFWIWGKRYKERRGATN-KELFEATINE
               AE +   A +    +G +G        +   R++++      +++  +     R +Q  R+    +      R+    G  +  +L E  I E
Subjt:  DDSEKVKAEYLQYGAWLRFQGQGKEGLSKPRKRESFSDRSSKQSLYINHRLMRWLKEFPRRCRQTKRQSQLAEFWIWGKRYKERRGATN-KELFEATINE

Query:  GPGGSNEI-TRSVGDDLGLKL
        GP   NE   R+VG  +G+ L
Subjt:  GPGGSNEI-TRSVGDDLGLKL

AT3G31430.1 unknown protein3.9e-0929.41Show/hide
Query:  FKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAK
        F  + FW     +P  F+N+ +  H+G A+G   D D + +     +   V + +DIT PLR          + +  +  +YERL  FC  CGM+ H   
Subjt:  FKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAK

Query:  TC
         C
Subjt:  TC

AT3G42140.1 zinc ion binding;nucleic acid binding6.4e-0422.94Show/hide
Query:  SELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIG
        S+ EFK + FW     +P+ F+   + + +G  +G+  + +       LG  +SV                          +  +YE+L +FC  CGM+ 
Subjt:  SELEFKLVAFWSHFYNLPMPFMNKTMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIG

Query:  HLAKTCDSN
        H A  C ++
Subjt:  HLAKTCDSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGATATGTTCCGAAATTGGTGTAGAATTGAGCGGCCGATTGCCCTGTGGTTGCCGTGGGGGTGTAAGAAGAGCAAGATTGGACCCAGAGAATGGAACAGGGGA
TTCCGAAATCGTGAGCCACGTCGACGACCCAAGGCAGGAAGGGGTTTCCGATGAGGCAGGACACGGGGCGGTTTCGGAGGATCTGGGCCAGCGCGGGCTTTCCGGTGAGC
TCGAGCTGCGGCAAATAGACGTCGAAGTCTCCACGCCTGGGGTCGTCGTGTACACAGCCGTCGTCCCAAAACTGGAAGTCGATGGAACCGGCGCCGACGGGGGAGGGGCG
GTCGGCGATGCTGCCGGCGTTTATGAGACAGTGGCCGAAGTGGAGGGCAGTGAAGATGGTAGTGCGAAGGCCGTTGGAAGCGAGGGTTTTGGCGAGGCGGAGCGTGGGGT
TCATGTGGCCTTGCGCCGGGAAGCTTTGAACTCTGAAAATGGCGATTTGAAGCTTGGGGGACTATGCTCGTTATCTTTGGGGACCTCTCTTGCTGTTATGGGGGCGATTT
TCAGGGAGGATCTTTCTTCCCCGCTGGTTTCTGTGATGGTGGATGATGGTCGGGTGTTGGGGATTTCCCCGTTGGTTGTTGAGCATATTGTTGTGGCTGATAGAATCAAA
TGCAACATTGTTATACAACTTCCATGGTCCAAGGGCAAGAAAGACGGGGTAGCAGGTGAGCAACGGCGGAGGACAGAAGAGGTTCTTTTAGAGGATTGGAATAAGCTGCA
GCTTACTTTTACAGAGGAAGAAACTACAGTGGATGTTGATTTAAAGGTGTCCAAACAGATGGAACAACGACTAGAACTTTGCTTGGTAGGAAAGCTTCTGTCCCATCGTT
TCATCTCATGTGAGAAGCCACTTAATTTGATTAAACCATCAGAATTGGAATTCAAATTAGTGGCATTTTGGTCACACTTCTACAATTTGCCGATGCCTTTCATGAACAAA
ACGATGGCTTCTCATCTCGGCAATGCTATTGGGGTGTGTGGGGATGTTGACTGTGATTCTGACGAGTGTTGTTTGGGGGAAAGTCTGAGTGTCAAAGTGAGATTTGACAT
AACAAAACCCTTGAGACGAGGTGTGAAGATCAATATTGATGGTCCTATGGGTAGCTGTTGGGTTCCAATCAAGTATGAGCGTTTACCAGATTTTTGTTATTTCTGTGGGA
TGATTGGCCATCTAGCTAAAACTTGTGATTCAAATGTTGATGATTCTGAGAAAGTGAAAGCAGAATATTTGCAGTATGGGGCTTGGCTTCGTTTTCAAGGGCAGGGGAAA
GAAGGTCTATCTAAGCCGCGAAAGCGGGAATCCTTTTCAGACCGTTCCTCCAAGCAAAGTCTGTACATCAACCACCGGCTCATGAGGTGGCTGAAGGAGTTTCCAAGGAG
GTGCCGGCAAACCAAGCGGCAGAGCCAGTTGGCAGAATTCTGGATTTGGGGAAAACGTTACAAAGAACGGAGAGGTGCTACAAATAAGGAGTTATTTGAGGCAACAATTA
ATGAGGGGCCAGGTGGATCAAATGAGATAACTCGTTCAGTTGGGGATGATTTGGGCCTGAAGCTTGGGGGTCTTTCGCAGCCCACTTCTGTTAGAAACTGGAAAAGACTG
GCCCGTGAGAAGGCTTTGGCCTCCTCATTGTCAACGGTCGTGGGACAGAAACGAAAGAAGACTGTCACGAAGGATGTAGCATCGGTGGCAGCTGATTGTGATACAATTCA
CAAGCTGAAAGGTGATATGGATGACTTACTTGAAACAGAGGAAATTTATTGGAAGCAACGCTCCCGTGAGAATTGCCTACAGTGGGGTGACAGAAACACTAAATGGTTCC
ACAAACGTACCTCTATGCGTAGGCAACAGAACATGATTAGAGGGATTGTGATTGATCGGAACATTTGGACCGAGGATTCGAGGGATTTACTTGGTCATGGGCTGCGGAAA
GTGGTTGGGGATAGCACTGATATTGATTTTTTTATGGATGCTTGGGTCCCCAGAGAGAGGAAAAATGAAGAAGAAAGCAGCAATGCACCTGCTGATGACATAGAGGCTAT
CTTAAAACTAGCAGCGATGCGTCTGGCCCATCGCCAACACTCCAGAAACTTCAGCAATGCATCTGGCGCATTCTCCACCTACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCGATATGTTCCGAAATTGGTGTAGAATTGAGCGGCCGATTGCCCTGTGGTTGCCGTGGGGGTGTAAGAAGAGCAAGATTGGACCCAGAGAATGGAACAGGGGA
TTCCGAAATCGTGAGCCACGTCGACGACCCAAGGCAGGAAGGGGTTTCCGATGAGGCAGGACACGGGGCGGTTTCGGAGGATCTGGGCCAGCGCGGGCTTTCCGGTGAGC
TCGAGCTGCGGCAAATAGACGTCGAAGTCTCCACGCCTGGGGTCGTCGTGTACACAGCCGTCGTCCCAAAACTGGAAGTCGATGGAACCGGCGCCGACGGGGGAGGGGCG
GTCGGCGATGCTGCCGGCGTTTATGAGACAGTGGCCGAAGTGGAGGGCAGTGAAGATGGTAGTGCGAAGGCCGTTGGAAGCGAGGGTTTTGGCGAGGCGGAGCGTGGGGT
TCATGTGGCCTTGCGCCGGGAAGCTTTGAACTCTGAAAATGGCGATTTGAAGCTTGGGGGACTATGCTCGTTATCTTTGGGGACCTCTCTTGCTGTTATGGGGGCGATTT
TCAGGGAGGATCTTTCTTCCCCGCTGGTTTCTGTGATGGTGGATGATGGTCGGGTGTTGGGGATTTCCCCGTTGGTTGTTGAGCATATTGTTGTGGCTGATAGAATCAAA
TGCAACATTGTTATACAACTTCCATGGTCCAAGGGCAAGAAAGACGGGGTAGCAGGTGAGCAACGGCGGAGGACAGAAGAGGTTCTTTTAGAGGATTGGAATAAGCTGCA
GCTTACTTTTACAGAGGAAGAAACTACAGTGGATGTTGATTTAAAGGTGTCCAAACAGATGGAACAACGACTAGAACTTTGCTTGGTAGGAAAGCTTCTGTCCCATCGTT
TCATCTCATGTGAGAAGCCACTTAATTTGATTAAACCATCAGAATTGGAATTCAAATTAGTGGCATTTTGGTCACACTTCTACAATTTGCCGATGCCTTTCATGAACAAA
ACGATGGCTTCTCATCTCGGCAATGCTATTGGGGTGTGTGGGGATGTTGACTGTGATTCTGACGAGTGTTGTTTGGGGGAAAGTCTGAGTGTCAAAGTGAGATTTGACAT
AACAAAACCCTTGAGACGAGGTGTGAAGATCAATATTGATGGTCCTATGGGTAGCTGTTGGGTTCCAATCAAGTATGAGCGTTTACCAGATTTTTGTTATTTCTGTGGGA
TGATTGGCCATCTAGCTAAAACTTGTGATTCAAATGTTGATGATTCTGAGAAAGTGAAAGCAGAATATTTGCAGTATGGGGCTTGGCTTCGTTTTCAAGGGCAGGGGAAA
GAAGGTCTATCTAAGCCGCGAAAGCGGGAATCCTTTTCAGACCGTTCCTCCAAGCAAAGTCTGTACATCAACCACCGGCTCATGAGGTGGCTGAAGGAGTTTCCAAGGAG
GTGCCGGCAAACCAAGCGGCAGAGCCAGTTGGCAGAATTCTGGATTTGGGGAAAACGTTACAAAGAACGGAGAGGTGCTACAAATAAGGAGTTATTTGAGGCAACAATTA
ATGAGGGGCCAGGTGGATCAAATGAGATAACTCGTTCAGTTGGGGATGATTTGGGCCTGAAGCTTGGGGGTCTTTCGCAGCCCACTTCTGTTAGAAACTGGAAAAGACTG
GCCCGTGAGAAGGCTTTGGCCTCCTCATTGTCAACGGTCGTGGGACAGAAACGAAAGAAGACTGTCACGAAGGATGTAGCATCGGTGGCAGCTGATTGTGATACAATTCA
CAAGCTGAAAGGTGATATGGATGACTTACTTGAAACAGAGGAAATTTATTGGAAGCAACGCTCCCGTGAGAATTGCCTACAGTGGGGTGACAGAAACACTAAATGGTTCC
ACAAACGTACCTCTATGCGTAGGCAACAGAACATGATTAGAGGGATTGTGATTGATCGGAACATTTGGACCGAGGATTCGAGGGATTTACTTGGTCATGGGCTGCGGAAA
GTGGTTGGGGATAGCACTGATATTGATTTTTTTATGGATGCTTGGGTCCCCAGAGAGAGGAAAAATGAAGAAGAAAGCAGCAATGCACCTGCTGATGACATAGAGGCTAT
CTTAAAACTAGCAGCGATGCGTCTGGCCCATCGCCAACACTCCAGAAACTTCAGCAATGCATCTGGCGCATTCTCCACCTACAAATAA
Protein sequenceShow/hide protein sequence
MGSICSEIGVELSGRLPCGCRGGVRRARLDPENGTGDSEIVSHVDDPRQEGVSDEAGHGAVSEDLGQRGLSGELELRQIDVEVSTPGVVVYTAVVPKLEVDGTGADGGGA
VGDAAGVYETVAEVEGSEDGSAKAVGSEGFGEAERGVHVALRREALNSENGDLKLGGLCSLSLGTSLAVMGAIFREDLSSPLVSVMVDDGRVLGISPLVVEHIVVADRIK
CNIVIQLPWSKGKKDGVAGEQRRRTEEVLLEDWNKLQLTFTEEETTVDVDLKVSKQMEQRLELCLVGKLLSHRFISCEKPLNLIKPSELEFKLVAFWSHFYNLPMPFMNK
TMASHLGNAIGVCGDVDCDSDECCLGESLSVKVRFDITKPLRRGVKINIDGPMGSCWVPIKYERLPDFCYFCGMIGHLAKTCDSNVDDSEKVKAEYLQYGAWLRFQGQGK
EGLSKPRKRESFSDRSSKQSLYINHRLMRWLKEFPRRCRQTKRQSQLAEFWIWGKRYKERRGATNKELFEATINEGPGGSNEITRSVGDDLGLKLGGLSQPTSVRNWKRL
AREKALASSLSTVVGQKRKKTVTKDVASVAADCDTIHKLKGDMDDLLETEEIYWKQRSRENCLQWGDRNTKWFHKRTSMRRQQNMIRGIVIDRNIWTEDSRDLLGHGLRK
VVGDSTDIDFFMDAWVPRERKNEEESSNAPADDIEAILKLAAMRLAHRQHSRNFSNASGAFSTYK