; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:9107532..9108930
RNA-Seq ExpressionMoc09g10730
SyntenyMoc09g10730
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]7.5e-10067.44Show/hide
Query:  SEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQ
        ++ENETVTID GKPI+T++NV+LC V K+HT+KRIS EA +SVMKSVW VH+STR E + MNI+V LFKSL EK RVL+SGPWTFNKS LVL+SPTAT+Q
Subjt:  SEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQ

Query:  PLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIG
        PL+MNFN CAFW+Q+HNIPFEC+  EMA +LG KLG+VEE+E +G  GWAGPF+R RVKID+SKPLRRG+K+++S+GK +WCP+RYEKL DFCY+CG IG
Subjt:  PLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIG

Query:  HSSRECDNRSTV----HQNQYDDWLHATLLKKSTRYQEDGFGIRGGRFKRGQRSNGGR
        HS REC+ RS V       QY DWL ATLLKKS  + E+    RGGRF RG + NGGR
Subjt:  HSSRECDNRSTV----HQNQYDDWLHATLLKKSTRYQEDGFGIRGGRFKRGQRSNGGR

XP_022155933.1 uncharacterized protein LOC111022932 [Momordica charantia]3.7e-6759.7Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M++I     N  L  E+ ET+ ID  KP+MT ENVQ   VGK+HT+KRISVEAF+SVMKS+W VH+ST IET  MN++V +FKS+ EK RVL SGPW+F 
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY
         S LVL+SPTATDQP +MNFN  A W+Q+H IPF+CM+K+MA  LG ++GEVEE++C G   W GPF+R RV+IDISKP +RG+K+R+ + K  WCP+RY
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY

Query:  E
        E
Subjt:  E

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]4.3e-7157.08Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M+DI     N  L  EE +T+ ID  KPI+T +N+QLC VGK+H +KRI+VEAF SVMK VW +H+STRIET  +NI+V  FK++ EKIRV + GPWTF+
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY
        KS L+L   TA ++PL+++ ++CAFWVQ+H I FECM K+MA  LG +LGEVEEV+      W  PFL  RVKI++ KPLRRG+K+++S+GK +WCP+RY
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY

Query:  EKLLDFCYDCGMIGHSSRE
        E+L DFCY CG +GHS RE
Subjt:  EKLLDFCYDCGMIGHSSRE

XP_022158119.1 uncharacterized protein LOC111024676, partial [Momordica charantia]1.1e-4557.69Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M++I     N NL +EE ET  +D  +PI+T+EN+QLC VGK+HT+KRIS +A  SVMK VW +H+STRIE   +NI+V  FK++ EK RVL+SGPWTF+
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVE
        KS  VL SPTA D+PL+++F  CAFWVQ+H IPFE + ++MA LLG +LG+VEEV+
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVE

XP_028102454.1 uncharacterized protein LOC114301689 [Camellia sinensis]2.1e-4139.26Show/hide
Query:  LNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPT
        L L ++E +TV +DE       E   L  VGK+ T +  +VEA KS + +VW          I MN+F+F F  L +K RVL +GPW+F+K  ++L+   
Subjt:  LNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPT

Query:  ATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWA-GPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYD
           QP ++  +   FW+QVHN+P   M KE+  L+G K+G + +VE  GPGG A G +LR RV I+++KPL RGMK+     + +W   +YE+L +FCY 
Subjt:  ATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWA-GPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYD

Query:  CGMIGHSSRECD------NRSTVHQNQYDDWLHATLLKKSTR
        CG++GHS +EC+         T    QY  WL A + K   R
Subjt:  CGMIGHSSRECD------NRSTVHQNQYDDWLHATLLKKSTR

TrEMBL top hitse value%identityAlignment
A0A5C7IW83 CCHC-type domain-containing protein2.5e-4034.41Show/hide
Query:  NLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSP
        NL++  E+   +   E +    +++V  C VGK+ T K+++ EAF+S+++ +W+      +E ++ NIF+F F    ++ RV   GPW F KS +VL  P
Subjt:  NLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSP

Query:  TATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYL-WCPIRYEKLLDFCY
          T    ++ FN  AFWVQ+H+ P  CM + MA  +  ++GEV E+  +    W G F+R +V IDISKPLRR ++++  + + +    ++YE+L +FCY
Subjt:  TATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYL-WCPIRYEKLLDFCY

Query:  DCGMIGHSSREC---DNRSTVHQ---NQYDDWLHA-TLLKKSTRYQEDGFGIRGGRFKRGQRSNGGRREGEGIG-----GGVLIVMKIRKFGWQVTG---
         CG +GH   EC   D R    +    +Y  WL A T  K  +R    G+G    R     RS  G REGEG G     GG L+ MK       VTG   
Subjt:  DCGMIGHSSREC---DNRSTVHQ---NQYDDWLHA-TLLKKSTRYQEDGFGIRGGRFKRGQRSNGGRREGEGIG-----GGVLIVMKIRKFGWQVTG---

Query:  -ENKVNNEVPMDFEFNGNPTE---NHQNKGPKISSHDIGL
           K+   V         P E   +   KGP I   D+GL
Subjt:  -ENKVNNEVPMDFEFNGNPTE---NHQNKGPKISSHDIGL

A0A6J1D765 uncharacterized protein LOC1110179023.6e-10067.44Show/hide
Query:  SEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQ
        ++ENETVTID GKPI+T++NV+LC V K+HT+KRIS EA +SVMKSVW VH+STR E + MNI+V LFKSL EK RVL+SGPWTFNKS LVL+SPTAT+Q
Subjt:  SEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQ

Query:  PLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIG
        PL+MNFN CAFW+Q+HNIPFEC+  EMA +LG KLG+VEE+E +G  GWAGPF+R RVKID+SKPLRRG+K+++S+GK +WCP+RYEKL DFCY+CG IG
Subjt:  PLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIG

Query:  HSSRECDNRSTV----HQNQYDDWLHATLLKKSTRYQEDGFGIRGGRFKRGQRSNGGR
        HS REC+ RS V       QY DWL ATLLKKS  + E+    RGGRF RG + NGGR
Subjt:  HSSRECDNRSTV----HQNQYDDWLHATLLKKSTRYQEDGFGIRGGRFKRGQRSNGGR

A0A6J1DP89 uncharacterized protein LOC1110229321.8e-6759.7Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M++I     N  L  E+ ET+ ID  KP+MT ENVQ   VGK+HT+KRISVEAF+SVMKS+W VH+ST IET  MN++V +FKS+ EK RVL SGPW+F 
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY
         S LVL+SPTATDQP +MNFN  A W+Q+H IPF+CM+K+MA  LG ++GEVEE++C G   W GPF+R RV+IDISKP +RG+K+R+ + K  WCP+RY
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY

Query:  E
        E
Subjt:  E

A0A6J1DVS4 uncharacterized protein LOC1110235552.1e-7157.08Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M+DI     N  L  EE +T+ ID  KPI+T +N+QLC VGK+H +KRI+VEAF SVMK VW +H+STRIET  +NI+V  FK++ EKIRV + GPWTF+
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY
        KS L+L   TA ++PL+++ ++CAFWVQ+H I FECM K+MA  LG +LGEVEEV+      W  PFL  RVKI++ KPLRRG+K+++S+GK +WCP+RY
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRY

Query:  EKLLDFCYDCGMIGHSSRE
        E+L DFCY CG +GHS RE
Subjt:  EKLLDFCYDCGMIGHSSRE

A0A6J1DYG3 uncharacterized protein LOC1110246765.1e-4657.69Show/hide
Query:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN
        M++I     N NL +EE ET  +D  +PI+T+EN+QLC VGK+HT+KRIS +A  SVMK VW +H+STRIE   +NI+V  FK++ EK RVL+SGPWTF+
Subjt:  MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFN

Query:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVE
        KS  VL SPTA D+PL+++F  CAFWVQ+H IPFE + ++MA LLG +LG+VEEV+
Subjt:  KSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.0e-0924.56Show/hide
Query:  NVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGG
        NV +      + ++   FLF+S      +L  GPW+FN    V+   T      E  F    FW+Q+  IP   +   +   +G ++             
Subjt:  NVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGG

Query:  WAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIGHSSRECDNRSTVHQNQYDD
          G FL T +  D+S                     +YEKL +FC  CGM+ H + EC        +  DD
Subjt:  WAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIGHSSRECDNRSTVHQNQYDD

AT5G36228.1 nucleic acid binding;zinc ion binding7.4e-1323.83Show/hide
Query:  QLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFE
        +L  +G+I   +  SVE     +   W +        +    F   F+S  + +  L   PW FN+ F+ L      D P E        WV +  IP  
Subjt:  QLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLVLSSPTATDQPLEMNFNICAFWVQVHNIPFE

Query:  CMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIGHSSRECDNRSTVHQNQYDDWL
         + +    ++   LGEV  ++          F+R +V++D ++PLR   ++R +  +       YEKL   C +C  + H    C     VHQ + D+  
Subjt:  CMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDCGMIGHSSRECDNRSTVHQNQYDDWL

Query:  HATLLKKSTRYQED
           +L    RY ++
Subjt:  HATLLKKSTRYQED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACATCATAGCAGGTTGTGGAAATCTCAACTTGAGAAGCGAGGAAAATGAGACTGTTACAATCGACGAAGGCAAGCCAATCATGACCTCGGAGAATGTG
CAACTTTGTACTGTGGGCAAAATCCACACCAATAAAAGAATTAGTGTGGAAGCCTTTAAGTCGGTGATGAAATCAGTGTGGAATGTTCATGATTCCACAAGGATA
GAGACTATCAGCATGAATATTTTTGTGTTTCTGTTCAAGTCTCTACGTGAGAAAATTCGTGTTCTAAATTCAGGACCTTGGACTTTTAACAAATCCTTTCTCGTT
CTCTCATCTCCAACAGCAACAGATCAACCTCTTGAAATGAATTTTAATATCTGCGCTTTTTGGGTTCAAGTTCATAACATTCCTTTTGAATGTATGTTAAAAGAG
ATGGCGGTGCTTTTAGGAGGCAAATTGGGGGAAGTGGAGGAAGTTGAATGTGAAGGCCCTGGTGGCTGGGCAGGACCCTTCTTACGAACGCGGGTGAAGATAGAC
ATTTCAAAACCCCTGCGAAGAGGCATGAAGATTAGAAGTAGTGAAGGAAAATACCTATGGTGTCCTATCCGATATGAAAAGCTCCTAGATTTTTGTTATGATTGT
GGAATGATTGGCCATTCGAGTAGGGAATGTGATAACCGAAGCACAGTTCATCAGAATCAATATGACGATTGGCTTCATGCGACTTTACTGAAGAAGAGCACAAGA
TATCAAGAAGATGGTTTTGGAATAAGGGGCGGTCGGTTTAAAAGAGGACAACGAAGCAATGGTGGTCGTCGGGAGGGCGAGGGGATTGGCGGAGGAGTACTGATA
GTCATGAAGATCAGGAAATTTGGCTGGCAAGTCACCGGAGAAAATAAGGTTAATAATGAGGTGCCGATGGATTTTGAATTTAATGGAAATCCTACAGAAAATCAC
CAGAATAAAGGACCTAAAATTTCCTCCCATGACATTGGGCTACCTGATATAGTTAGGCTCAAAGAGAGTGAAATTAAAGGATCCAAACAGGGCTGGAAACGACTA
AACAGAGCTGAAAAGGGTAAGAGCGTGATCGTGGATTTATTAAACCCAGCGTCAGGGAAAAGGAAAGAAGCCCAAGTGGAATCTTTGAATGATTTGGATAGTGGA
AAACAGAAACTTCAAAAGCTTGAGAAGGGAGAAAGTTTGCTCCTAAATCAAAAGCCTGCAACTGAGATTACTATTGTGGCGGTGGCTGACCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACATCATAGCAGGTTGTGGAAATCTCAACTTGAGAAGCGAGGAAAATGAGACTGTTACAATCGACGAAGGCAAGCCAATCATGACCTCGGAGAATGTG
CAACTTTGTACTGTGGGCAAAATCCACACCAATAAAAGAATTAGTGTGGAAGCCTTTAAGTCGGTGATGAAATCAGTGTGGAATGTTCATGATTCCACAAGGATA
GAGACTATCAGCATGAATATTTTTGTGTTTCTGTTCAAGTCTCTACGTGAGAAAATTCGTGTTCTAAATTCAGGACCTTGGACTTTTAACAAATCCTTTCTCGTT
CTCTCATCTCCAACAGCAACAGATCAACCTCTTGAAATGAATTTTAATATCTGCGCTTTTTGGGTTCAAGTTCATAACATTCCTTTTGAATGTATGTTAAAAGAG
ATGGCGGTGCTTTTAGGAGGCAAATTGGGGGAAGTGGAGGAAGTTGAATGTGAAGGCCCTGGTGGCTGGGCAGGACCCTTCTTACGAACGCGGGTGAAGATAGAC
ATTTCAAAACCCCTGCGAAGAGGCATGAAGATTAGAAGTAGTGAAGGAAAATACCTATGGTGTCCTATCCGATATGAAAAGCTCCTAGATTTTTGTTATGATTGT
GGAATGATTGGCCATTCGAGTAGGGAATGTGATAACCGAAGCACAGTTCATCAGAATCAATATGACGATTGGCTTCATGCGACTTTACTGAAGAAGAGCACAAGA
TATCAAGAAGATGGTTTTGGAATAAGGGGCGGTCGGTTTAAAAGAGGACAACGAAGCAATGGTGGTCGTCGGGAGGGCGAGGGGATTGGCGGAGGAGTACTGATA
GTCATGAAGATCAGGAAATTTGGCTGGCAAGTCACCGGAGAAAATAAGGTTAATAATGAGGTGCCGATGGATTTTGAATTTAATGGAAATCCTACAGAAAATCAC
CAGAATAAAGGACCTAAAATTTCCTCCCATGACATTGGGCTACCTGATATAGTTAGGCTCAAAGAGAGTGAAATTAAAGGATCCAAACAGGGCTGGAAACGACTA
AACAGAGCTGAAAAGGGTAAGAGCGTGATCGTGGATTTATTAAACCCAGCGTCAGGGAAAAGGAAAGAAGCCCAAGTGGAATCTTTGAATGATTTGGATAGTGGA
AAACAGAAACTTCAAAAGCTTGAGAAGGGAGAAAGTTTGCTCCTAAATCAAAAGCCTGCAACTGAGATTACTATTGTGGCGGTGGCTGACCCTTAG
Protein sequenceShow/hide protein sequence
MEDIIAGCGNLNLRSEENETVTIDEGKPIMTSENVQLCTVGKIHTNKRISVEAFKSVMKSVWNVHDSTRIETISMNIFVFLFKSLREKIRVLNSGPWTFNKSFLV
LSSPTATDQPLEMNFNICAFWVQVHNIPFECMLKEMAVLLGGKLGEVEEVECEGPGGWAGPFLRTRVKIDISKPLRRGMKIRSSEGKYLWCPIRYEKLLDFCYDC
GMIGHSSRECDNRSTVHQNQYDDWLHATLLKKSTRYQEDGFGIRGGRFKRGQRSNGGRREGEGIGGGVLIVMKIRKFGWQVTGENKVNNEVPMDFEFNGNPTENH
QNKGPKISSHDIGLPDIVRLKESEIKGSKQGWKRLNRAEKGKSVIVDLLNPASGKRKEAQVESLNDLDSGKQKLQKLEKGESLLLNQKPATEITIVAVADP