; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03990 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03990
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:2586113..2587618
RNA-Seq ExpressionMoc01g03990
SyntenyMoc01g03990
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153147.1 probable serine/threonine-protein kinase PBL11 [Momordica charantia]4.1e-5468.64Show/hide
Query:  EMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSDATHLVIDA
        E+V PAAAP  N ILL +DGERAIRAY A   H FHPVIAG EIEAERFELK VMFQMLQTVG+FFGNP ED HLHLRYFLEI+ FYNGLS+AT LV+DA
Subjt:  EMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSDATHLVIDA

Query:  SDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTAT
        S N AL SKSYVEA+DILERI ANNYHWSDS+A  +R+NHG NDNE  ++        T  +  + T T
Subjt:  SDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTAT

XP_022158490.1 uncharacterized protein LOC111024970 [Momordica charantia]5.5e-8353.41Show/hide
Query:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYN----
        +VE INE E+V  AAAPP N ILLV+DGER IRAY APA HGFHPVIAG  IEAERFELKS+MFQMLQTVGQFFGNPSEDPHLHLRYFLE+   +N    
Subjt:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYN----

Query:  -----GLSDATHLVIDASDN--GALSSKSYVEAVDILERIYANNYH-----WSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPT
              L    +L+ D + +   ++ ++S     D+ E+ +   +       SDSKAV ERNNH ANDNEAMAAL +QIANLTNM+K+M+TATTSSN+P 
Subjt:  -----GLSDATHLVIDASDN--GALSSKSYVEAVDILERIYANNYH-----WSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPT

Query:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQ
          + +                P N  +  Y +   ++  LLMP+QQQGTRPAA SSNSME MMREYM RNDALIQSQAA  RNLEVQ+GQ A+KLK+RP 
Subjt:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQ

Query:  GILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQRQTDKVEHPAEPQLESPEDDAMENMPINEDK
                                           + QTDK+EH AEPQLE+ E+D  ENMPI+EDK
Subjt:  GILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQRQTDKVEHPAEPQLESPEDDAMENMPINEDK

XP_024046666.1 uncharacterized protein LOC112101006 [Citrus clementina]1.0e-5237.11Show/hide
Query:  HNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI---------------------------
        +N I + +D +RAIR Y        HP I   E+ A  FELK VMFQMLQTVGQF G P+E PHLHL+ FLE+                           
Subjt:  HNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI---------------------------

Query:  -------------------------------------------ETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNH
                                                   ETFYNGL+ +T L++DAS NGAL SKSY EA +ILERI  NNY W  ++    R + 
Subjt:  -------------------------------------------ETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNH

Query:  GANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMP------SQQQGTRPAATSS-N
          +  +A+  L +Q+ +LTNM+K M+ A       T  +V  + C YC  +H +D+CPGN  S       N  +R   P       Q QG +  +     
Subjt:  GANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMP------SQQQGTRPAATSS-N

Query:  SMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQR
        S+E +++EY+A+N+A++QSQA +LRNLE QMGQ A+ + SR QG LPS+IE+ +RE KE C+ ++LR GK +  P+ V +
Subjt:  SMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQR

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.0e-5735.14Show/hide
Query:  NAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI----------------------------
        N I L +D  RAIR Y AP F+  +P I   EI+A  FELK VMFQMLQTVGQF G+P+EDPHLH+R FLE+                            
Subjt:  NAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI----------------------------

Query:  --------------------------------------------------------------------------ETFYNGLSDATHLVIDASDNGALSSK
                                                                                  ETFYNGL+ A+ +V+DAS NGA+ SK
Subjt:  --------------------------------------------------------------------------ETFYNGLSDATHLVIDASDNGALSSK

Query:  SYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYL--
        SY EA +ILERI +NNY WS ++A T R   G  + +A+ AL  Q+A++TN+LK+M+   +        +  +  C YC   H +++CP N  SV Y+  
Subjt:  SYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYL--

Query:  ENPNRKNR------------------------------LLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQG
        +N NR N                                  P  QQ  +P  + ++S+E++MR+YMA+ND +IQSQAA+LRNLEVQ+GQ A+ LK+RPQG
Subjt:  ENPNRKNR------------------------------LLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQG

Query:  ILPSDIENLKREGKEQCQALTLRCGKAL---------PPPYPVQRQTDKVEHPAEPQLESP
         LPSD EN +R+GKE C+A+TLR GK +           P  +Q++ +  + PA   +E P
Subjt:  ILPSDIENLKREGKEQCQALTLRCGKAL---------PPPYPVQRQTDKVEHPAEPQLESP

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.4e-5434.36Show/hide
Query:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLE----------
        M E ++++ +     +P    I+LV+D  RAIR Y AP F+  +P I   EI+A +FELK VMFQMLQTVGQF   P+EDPHLHLR FLE          
Subjt:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLE----------

Query:  --------------------------------------------------------------------------------------------IETFYNGL
                                                                                                    +ETFYNGL
Subjt:  --------------------------------------------------------------------------------------------IETFYNGL

Query:  SDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTS-KVMSIFCSYCE
        +  + +V+DAS NGA+ SKSY EA +ILE I +NNY WS+++A   R   G  + +A+ AL  Q+A++TN+LK++S   + +  P  + +   + C +C 
Subjt:  SDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTS-KVMSIFCSYCE

Query:  GEHHYDSCPGNPTSVFYL--ENPNRKNRLLMPSQQQ----------GTRPAA--------------------------TSSNSMEAMMREYMARNDALIQ
          H ++ CP NP SV Y+  +N NR N     S  Q          G+R                             +  +S+E++MR+YMA+NDA+IQ
Subjt:  GEHHYDSCPGNPTSVFYL--ENPNRKNRLLMPSQQQ----------GTRPAA--------------------------TSSNSMEAMMREYMARNDALIQ

Query:  SQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKAL
        SQAA LRNLE+Q+G  A++LK+RPQG LPSD EN +R+GKEQC+++ LR GK L
Subjt:  SQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKAL

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like3.0e-4236.96Show/hide
Query:  PHLHLRYFLEIETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMST-ATTS
        PH  + + +++ETFYNGL   T +V+DAS NGAL SKSY EA +I+ERI +NNY W  S+A + R   G ++ +A+ +L +Q++++++M K+++T  + S
Subjt:  PHLHLRYFLEIETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMST-ATTS

Query:  SNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNR----------------KNRLLMPSQQQG-------TRPAAT----------------SS
              ++  +I   YC   H  + CP NP SV+Y+ N N+                +N L      QG       T+P  T                +S
Subjt:  SNNPTTSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNR----------------KNRLLMPSQQQG-------TRPAAT----------------SS

Query:  NSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKALPP-----------PYPVQRQTDKVEHP
        NS+E++++ YMA+NDALIQSQAATL+NLE Q+GQ A++L++R QG LPSD EN +  GKE C+ALTLR  K + P               +     VE P
Subjt:  NSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKALPP-----------PYPVQRQTDKVEHP

Query:  AEPQLESPEDDAMENMPINEDK
          P+ +S + D + + P+N D+
Subjt:  AEPQLESPEDDAMENMPINEDK

A0A6J1DI54 probable serine/threonine-protein kinase PBL112.0e-5468.64Show/hide
Query:  EMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSDATHLVIDA
        E+V PAAAP  N ILL +DGERAIRAY A   H FHPVIAG EIEAERFELK VMFQMLQTVG+FFGNP ED HLHLRYFLEI+ FYNGLS+AT LV+DA
Subjt:  EMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSDATHLVIDA

Query:  SDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTAT
        S N AL SKSYVEA+DILERI ANNYHWSDS+A  +R+NHG NDNE  ++        T  +  + T T
Subjt:  SDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTAT

A0A6J1DVZ9 uncharacterized protein LOC1110249702.6e-8353.41Show/hide
Query:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYN----
        +VE INE E+V  AAAPP N ILLV+DGER IRAY APA HGFHPVIAG  IEAERFELKS+MFQMLQTVGQFFGNPSEDPHLHLRYFLE+   +N    
Subjt:  MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYN----

Query:  -----GLSDATHLVIDASDN--GALSSKSYVEAVDILERIYANNYH-----WSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPT
              L    +L+ D + +   ++ ++S     D+ E+ +   +       SDSKAV ERNNH ANDNEAMAAL +QIANLTNM+K+M+TATTSSN+P 
Subjt:  -----GLSDATHLVIDASDN--GALSSKSYVEAVDILERIYANNYH-----WSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPT

Query:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQ
          + +                P N  +  Y +   ++  LLMP+QQQGTRPAA SSNSME MMREYM RNDALIQSQAA  RNLEVQ+GQ A+KLK+RP 
Subjt:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLENPNRKNRLLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQ

Query:  GILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQRQTDKVEHPAEPQLESPEDDAMENMPINEDK
                                           + QTDK+EH AEPQLE+ E+D  ENMPI+EDK
Subjt:  GILPSDIENLKREGKEQCQALTLRCGKALPPPYPVQRQTDKVEHPAEPQLESPEDDAMENMPINEDK

A0A6J1DXK5 uncharacterized protein LOC1110255001.2e-4848.1Show/hide
Query:  LEIETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNM----LKHMSTATTSSNNPT
        ++I+T+YNGL DAT LVIDAS NGAL +K Y EA +ILERI +NN  WSD +A+  + + G N++E+  AL  +I NLT++    + H ST   S+    
Subjt:  LEIETFYNGLSDATHLVIDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNM----LKHMSTATTSSNNPT

Query:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLEN-PNRKNR--LLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKS
         S +  I CS+C GE+ Y++CPGNP SV YL N  N +N    +  +   GT          +  M +YM  ND  +QSQA +LRNLE+Q+GQ A+ LKS
Subjt:  TSKVMSIFCSYCEGEHHYDSCPGNPTSVFYLEN-PNRKNR--LLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKS

Query:  RPQGILPSDIENLKREGKEQCQALTLRCGKALPPPYP
        +P+G+LPSDI+  KR+GKEQC ALTLR GK LP  +P
Subjt:  RPQGILPSDIENLKREGKEQCQALTLRCGKALPPPYP

A0A6J1G7Q6 uncharacterized protein LOC1114515981.5e-4131.7Show/hide
Query:  NAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFL------------------------------
        NAI + +D ERAIRAY  PA    +P I   E++A  FELK VMFQMLQT+GQF G  S+DPHLHL+ FL                              
Subjt:  NAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFL------------------------------

Query:  ------------------------------------------------------------------------EIETFYNGLSDATHLVIDASDNGALSSK
                                                                                +IETFYNGL+ AT  V+DAS NG + SK
Subjt:  ------------------------------------------------------------------------EIETFYNGLSDATHLVIDASDNGALSSK

Query:  SYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMS------------TATTSSNNPTTSKVMSIFCSYCEGEHHYDSC
        +Y EA +ILERI +NN  W D ++   +      + +A++++  Q+A++TN+L++++            TAT      T S      C YC  +H +D C
Subjt:  SYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMS------------TATTSSNNPTTSKVMSIFCSYCEGEHHYDSC

Query:  PGNPTSVFYLEN-----------------------PN------------------------RKNRLLMPSQQQGTRPAAT------SSNSMEAMMREYMA
        P NP S+FY+ N                       PN                         +N+L   SQQ  T+   T      S   +E++++EYMA
Subjt:  PGNPTSVFYLEN-----------------------PN------------------------RKNRLLMPSQQQGTRPAAT------SSNSMEAMMREYMA

Query:  RNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKE
        RNDA+IQSQ  +LRNLEVQ+GQ A++L++RP G LP+D E  KREG E
Subjt:  RNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAAAGAATTAACGAGGAAGAGATGGTTGCGCCAGCAGCTGCTCCCCCTCATAACGCCATTCTGTTGGTAAATGATGGAGAAAGAGCCATCAGGGCCTAT
GTTGCACCAGCATTTCATGGGTTTCATCCAGTAATAGCAGGCCTAGAGATAGAAGCTGAGAGGTTTGAATTAAAATCGGTTATGTTCCAGATGCTCCAAACAGTG
GGGCAATTCTTTGGAAACCCATCTGAGGACCCTCATCTACATTTGAGGTATTTTCTGGAAATAGAGACCTTCTACAATGGACTGAGTGATGCAACACATCTGGTA
ATTGACGCATCAGATAATGGAGCATTGTCATCTAAGTCGTACGTAGAAGCAGTTGATATATTGGAAAGAATTTATGCTAATAATTATCACTGGTCAGATTCCAAA
GCGGTAACTGAGAGGAACAATCATGGAGCTAATGATAATGAGGCAATGGCTGCACTGATGAATCAAATTGCCAATTTAACCAACATGTTAAAGCACATGAGCACA
GCTACAACATCATCGAATAATCCTACAACGAGTAAAGTGATGTCCATCTTTTGCTCTTATTGCGAGGGTGAACACCATTATGATTCATGCCCCGGGAACCCAACA
TCTGTTTTCTATCTAGAAAACCCAAACAGAAAAAATCGATTGTTAATGCCCAGTCAGCAACAAGGAACAAGACCAGCAGCAACTTCCTCCAACTCAATGGAAGCC
ATGATGAGAGAATATATGGCCAGAAACGATGCACTGATTCAGAGTCAAGCTGCAACGCTAAGAAATTTAGAAGTTCAAATGGGGCAAAAAGCTAGCAAGCTAAAG
AGCAGACCTCAGGGTATCTTGCCGAGCGATATTGAAAACCTAAAAAGGGAAGGGAAAGAGCAGTGTCAAGCCTTGACCCTGCGTTGTGGAAAAGCGTTACCACCA
CCATATCCTGTGCAAAGGCAAACAGACAAAGTAGAACATCCTGCGGAACCACAGTTAGAAAGTCCAGAAGATGATGCTATGGAGAATATGCCAATCAATGAAGAC
AAAGCTGACGGTGCTGGAGCATCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAAAGAATTAACGAGGAAGAGATGGTTGCGCCAGCAGCTGCTCCCCCTCATAACGCCATTCTGTTGGTAAATGATGGAGAAAGAGCCATCAGGGCCTAT
GTTGCACCAGCATTTCATGGGTTTCATCCAGTAATAGCAGGCCTAGAGATAGAAGCTGAGAGGTTTGAATTAAAATCGGTTATGTTCCAGATGCTCCAAACAGTG
GGGCAATTCTTTGGAAACCCATCTGAGGACCCTCATCTACATTTGAGGTATTTTCTGGAAATAGAGACCTTCTACAATGGACTGAGTGATGCAACACATCTGGTA
ATTGACGCATCAGATAATGGAGCATTGTCATCTAAGTCGTACGTAGAAGCAGTTGATATATTGGAAAGAATTTATGCTAATAATTATCACTGGTCAGATTCCAAA
GCGGTAACTGAGAGGAACAATCATGGAGCTAATGATAATGAGGCAATGGCTGCACTGATGAATCAAATTGCCAATTTAACCAACATGTTAAAGCACATGAGCACA
GCTACAACATCATCGAATAATCCTACAACGAGTAAAGTGATGTCCATCTTTTGCTCTTATTGCGAGGGTGAACACCATTATGATTCATGCCCCGGGAACCCAACA
TCTGTTTTCTATCTAGAAAACCCAAACAGAAAAAATCGATTGTTAATGCCCAGTCAGCAACAAGGAACAAGACCAGCAGCAACTTCCTCCAACTCAATGGAAGCC
ATGATGAGAGAATATATGGCCAGAAACGATGCACTGATTCAGAGTCAAGCTGCAACGCTAAGAAATTTAGAAGTTCAAATGGGGCAAAAAGCTAGCAAGCTAAAG
AGCAGACCTCAGGGTATCTTGCCGAGCGATATTGAAAACCTAAAAAGGGAAGGGAAAGAGCAGTGTCAAGCCTTGACCCTGCGTTGTGGAAAAGCGTTACCACCA
CCATATCCTGTGCAAAGGCAAACAGACAAAGTAGAACATCCTGCGGAACCACAGTTAGAAAGTCCAGAAGATGATGCTATGGAGAATATGCCAATCAATGAAGAC
AAAGCTGACGGTGCTGGAGCATCGTAA
Protein sequenceShow/hide protein sequence
MVERINEEEMVAPAAAPPHNAILLVNDGERAIRAYVAPAFHGFHPVIAGLEIEAERFELKSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSDATHLV
IDASDNGALSSKSYVEAVDILERIYANNYHWSDSKAVTERNNHGANDNEAMAALMNQIANLTNMLKHMSTATTSSNNPTTSKVMSIFCSYCEGEHHYDSCPGNPT
SVFYLENPNRKNRLLMPSQQQGTRPAATSSNSMEAMMREYMARNDALIQSQAATLRNLEVQMGQKASKLKSRPQGILPSDIENLKREGKEQCQALTLRCGKALPP
PYPVQRQTDKVEHPAEPQLESPEDDAMENMPINEDKADGAGAS