; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019523 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019523
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor bHLH162-like
Genome locationtig00153348:468282..469617
RNA-Seq ExpressionSgr019523
SyntenySgr019523
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR015660 - Achaete-scute transcription factor-related
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573268.1 Transcription factor basic helix-loop-helix 162, partial [Cucurbita argyrosperma subsp. sororia]2.0e-4353.6Show/hide
Query:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL
        MANN   H  SSA  DRKL+ERNRR +MKAL+S+L+SLVP+Q S                             +   TL DQLE+ATNYI  LK N+EKL
Subjt:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL

Query:  KEKREKL--LGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGG
        KEK+EKL  LG E  R       SE K+RLL QVE HQVGSS+E +L TGSDYH V +Q L LLQE G +IV +N S   DRVFHKIIAE+VG G     
Subjt:  KEKREKL--LGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGG

Query:  SDGARICERVKKCVSQYKDGQH
         +G RICE VKK VSQYKD Q+
Subjt:  SDGARICERVKKCVSQYKDGQH

XP_004146605.1 transcription factor bHLH162 isoform X1 [Cucumis sativus]8.9e-4450.91Show/hide
Query:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR
        PTP       ++RK VERNRR +MKAL+S L SL+P+Q S                            ++  +T+PDQLEDATNYI  L+ N++KLKEK+
Subjt:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR

Query:  EKLLGM---EEPRGIIRRTCSESKSR---LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGS
        EKL+GM   EE  G  RR   E +++    + V+ HQ+GSS+EV L TGSDYHF  +Q L LLQ+ GAEI+NVNQSMF DRVFHKI A+V G G+  GG 
Subjt:  EKLLGM---EEPRGIIRRTCSESKSR---LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGS

Query:  DGARICERVKKCVSQYKDGQ
        DG RICE VKK VS+YKDG+
Subjt:  DGARICERVKKCVSQYKDGQ

XP_008442634.2 PREDICTED: transcription factor bHLH118-like [Cucumis melo]8.9e-4451.35Show/hide
Query:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR
        PTP       +DRK VERNRR +MK+L+S L SL+P+  SR                            +  +T+PDQLEDATNYI  L+ N++KLKEK+
Subjt:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR

Query:  EKLLGM------EEPRGIIRRTCS-ESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPG
        E+L+G       EE  G  RR    E+K + LLQV+ HQ+GSS+EV L TGSDYHF+ +Q L LLQ+ GAEI+NVNQSMF DRVFHKI A+V G G+ PG
Subjt:  EKLLGM------EEPRGIIRRTCS-ESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPG

Query:  GSDGARICERVKKCVSQYKDGQ
          DG RIC+ VKK VSQYKD Q
Subjt:  GSDGARICERVKKCVSQYKDGQ

XP_022994108.1 transcription factor bHLH162-like [Cucurbita maxima]8.1e-4552.38Show/hide
Query:  SSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLGM
        SSA TD++++ERNRR++MKAL S+L+SLVP+Q S                             +   TLPDQLE+ATNYI  LK N+EKLKEKREKL+G+
Subjt:  SSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLGM

Query:  EEPRGIIRRTCSESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVKK
         E   I     SE+K+R ++QVE H VGSS+E++L TGSDYH V RQ + LLQE G EIV++NQS  A+R FHKIIA++ G G    G+ G RICERVKK
Subjt:  EEPRGIIRRTCSESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVKK

Query:  CVSQYKDGQH
         VS YKD Q+
Subjt:  CVSQYKDGQH

XP_022994474.1 transcription factor bHLH167-like [Cucurbita maxima]8.9e-4453.18Show/hide
Query:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL
        MANN   H  SS   DRKL+ERNRR +MKAL+S+L+SLVP+Q S                             +   TL DQLE+ATNYI  LK N+EKL
Subjt:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL

Query:  KEKREKLLGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSD
        KEKREKL+G  E          E+K+RLL QVE HQVGSS+EV+L TGSDY FV  Q L LLQE G +IV +N S   DRVFHKIIAE+VG G     + 
Subjt:  KEKREKLLGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSD

Query:  GARICERVKKCVSQYKDGQH
        G RICE VKK VSQYKD Q+
Subjt:  GARICERVKKCVSQYKDGQH

TrEMBL top hitse value%identityAlignment
A0A0A0LUL3 BHLH domain-containing protein4.3e-4450.91Show/hide
Query:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR
        PTP       ++RK VERNRR +MKAL+S L SL+P+Q S                            ++  +T+PDQLEDATNYI  L+ N++KLKEK+
Subjt:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR

Query:  EKLLGM---EEPRGIIRRTCSESKSR---LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGS
        EKL+GM   EE  G  RR   E +++    + V+ HQ+GSS+EV L TGSDYHF  +Q L LLQ+ GAEI+NVNQSMF DRVFHKI A+V G G+  GG 
Subjt:  EKLLGM---EEPRGIIRRTCSESKSR---LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGS

Query:  DGARICERVKKCVSQYKDGQ
        DG RICE VKK VS+YKDG+
Subjt:  DGARICERVKKCVSQYKDGQ

A0A1S3B660 transcription factor bHLH118-like4.3e-4451.35Show/hide
Query:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR
        PTP       +DRK VERNRR +MK+L+S L SL+P+  SR                            +  +T+PDQLEDATNYI  L+ N++KLKEK+
Subjt:  PTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKR

Query:  EKLLGM------EEPRGIIRRTCS-ESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPG
        E+L+G       EE  G  RR    E+K + LLQV+ HQ+GSS+EV L TGSDYHF+ +Q L LLQ+ GAEI+NVNQSMF DRVFHKI A+V G G+ PG
Subjt:  EKLLGM------EEPRGIIRRTCS-ESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPG

Query:  GSDGARICERVKKCVSQYKDGQ
          DG RIC+ VKK VSQYKD Q
Subjt:  GSDGARICERVKKCVSQYKDGQ

A0A6J1GT92 uncharacterized protein LOC1114572291.6e-4353.15Show/hide
Query:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL
        MANN   H  SSA  DRKL+ERNRR +MKAL+S+L+SLVP+Q S                            ++   TL DQLE+ATNYI  LK N+EKL
Subjt:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL

Query:  KEKREKL--LGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGG
        KEK+EKL  LG E  R       SE K+RLL QVE HQVGSS+E +L T SDYH V +Q L LLQE G +IV +N S   DRVFHKIIAE+VG G     
Subjt:  KEKREKL--LGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGG

Query:  SDGARICERVKKCVSQYKDGQH
         +G RICE VKK VSQYKD Q+
Subjt:  SDGARICERVKKCVSQYKDGQH

A0A6J1K2Y3 transcription factor bHLH167-like4.3e-4453.18Show/hide
Query:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL
        MANN   H  SS   DRKL+ERNRR +MKAL+S+L+SLVP+Q S                             +   TL DQLE+ATNYI  LK N+EKL
Subjt:  MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKL

Query:  KEKREKLLGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSD
        KEKREKL+G  E          E+K+RLL QVE HQVGSS+EV+L TGSDY FV  Q L LLQE G +IV +N S   DRVFHKIIAE+VG G     + 
Subjt:  KEKREKLLGMEEPRGIIRRTCSESKSRLL-QVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSD

Query:  GARICERVKKCVSQYKDGQH
        G RICE VKK VSQYKD Q+
Subjt:  GARICERVKKCVSQYKDGQH

A0A6J1K483 transcription factor bHLH162-like3.9e-4552.38Show/hide
Query:  SSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLGM
        SSA TD++++ERNRR++MKAL S+L+SLVP+Q S                             +   TLPDQLE+ATNYI  LK N+EKLKEKREKL+G+
Subjt:  SSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLGM

Query:  EEPRGIIRRTCSESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVKK
         E   I     SE+K+R ++QVE H VGSS+E++L TGSDYH V RQ + LLQE G EIV++NQS  A+R FHKIIA++ G G    G+ G RICERVKK
Subjt:  EEPRGIIRRTCSESKSR-LLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVKK

Query:  CVSQYKDGQH
         VS YKD Q+
Subjt:  CVSQYKDGQH

SwissProt top hitse value%identityAlignment
F4I4E1 Transcription factor bHLH1671.0e-0526.7Show/hide
Query:  SSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG
        SSS    R L E++RR +MK L+S L S V                         +  +K+        +P  ++ AT+Y+  LK N+  LKEK+  LL 
Subjt:  SSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG

Query:  MEEPRGIIRRTCSESKSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLL-LLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVK
         E           E    L ++ +    S++E+ L+   +   V    L+ + +E+GA++++ N     DR  + IIA+ +   ++  G D +RI ERV+
Subjt:  MEEPRGIIRRTCSESKSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLL-LLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVK

Query:  KCVSQY
        K +  Y
Subjt:  KCVSQY

F4JIJ7 Transcription factor bHLH1621.9e-1733.02Show/hide
Query:  SAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG--
        S   DRK VE+NRR QMK+LYS+L SL+PH  S                              E  TLPDQL++A NYI  L+ N+EK +E++  L+   
Subjt:  SAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG--

Query:  -MEEPRGIIRRTCSES-----KSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTL-LLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGAR
         +E+   +   + S S       +L ++E+ + GS   + LVT  ++ F+F + + +L +E GAEI +   S+  D VFH +  +V  +        GAR
Subjt:  -MEEPRGIIRRTCSES-----KSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTL-LLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGAR

Query:  --ICERVKKCVS
          I ER++K V+
Subjt:  --ICERVKKCVS

Arabidopsis top hitse value%identityAlignment
AT1G10585.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.1e-0726.7Show/hide
Query:  SSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG
        SSS    R L E++RR +MK L+S L S V                         +  +K+        +P  ++ AT+Y+  LK N+  LKEK+  LL 
Subjt:  SSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG

Query:  MEEPRGIIRRTCSESKSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLL-LLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVK
         E           E    L ++ +    S++E+ L+   +   V    L+ + +E+GA++++ N     DR  + IIA+ +   ++  G D +RI ERV+
Subjt:  MEEPRGIIRRTCSESKSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLL-LLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVK

Query:  KCVSQY
        K +  Y
Subjt:  KCVSQY

AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.4e-1833.02Show/hide
Query:  SAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG--
        S   DRK VE+NRR QMK+LYS+L SL+PH  S                              E  TLPDQL++A NYI  L+ N+EK +E++  L+   
Subjt:  SAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLG--

Query:  -MEEPRGIIRRTCSES-----KSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTL-LLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGAR
         +E+   +   + S S       +L ++E+ + GS   + LVT  ++ F+F + + +L +E GAEI +   S+  D VFH +  +V  +        GAR
Subjt:  -MEEPRGIIRRTCSES-----KSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTL-LLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGAR

Query:  --ICERVKKCVS
          I ER++K V+
Subjt:  --ICERVKKCVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACAACCCCACCCCCCATGGTTCATCATCTGCACCTACCGACCGGAAACTCGTCGAGAGAAATAGAAGAAATCAAATGAAGGCCCTCTACTCCAAGCTCTATTC
TCTGGTTCCCCATCAAATCTCAAGGGTTTCTCTCTCTCTCTCTCTGTCTTTTATCCGTGTTTGGTTGGTGAGAAAGTGTTTGAATAACCCCAAGAAAATTTTCGAGCTGC
AGGAAGCGAAAACGTTGCCGGATCAGCTAGAAGACGCCACGAATTACATAACAATGCTGAAGGCGAACTTGGAGAAACTGAAAGAGAAGAGAGAGAAGCTACTGGGAATG
GAAGAACCTAGAGGAATAATAAGAAGAACATGCAGCGAAAGCAAGTCGAGATTGCTGCAAGTTGAAGTTCATCAAGTGGGTTCTTCAATGGAGGTCGTTTTGGTCACCGG
CTCTGATTATCACTTCGTTTTCAGACAAACCCTTCTGCTGCTTCAAGAACAGGGAGCTGAGATCGTCAATGTCAACCAGTCCATGTTCGCCGATCGAGTTTTCCACAAGA
TAATAGCTGAGGTGGTGGGAAATGGGATGGCCCCTGGAGGCAGTGATGGTGCAAGGATTTGTGAGAGAGTGAAGAAGTGTGTTTCACAGTACAAAGATGGCCAACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACAACCCCACCCCCCATGGTTCATCATCTGCACCTACCGACCGGAAACTCGTCGAGAGAAATAGAAGAAATCAAATGAAGGCCCTCTACTCCAAGCTCTATTC
TCTGGTTCCCCATCAAATCTCAAGGGTTTCTCTCTCTCTCTCTCTGTCTTTTATCCGTGTTTGGTTGGTGAGAAAGTGTTTGAATAACCCCAAGAAAATTTTCGAGCTGC
AGGAAGCGAAAACGTTGCCGGATCAGCTAGAAGACGCCACGAATTACATAACAATGCTGAAGGCGAACTTGGAGAAACTGAAAGAGAAGAGAGAGAAGCTACTGGGAATG
GAAGAACCTAGAGGAATAATAAGAAGAACATGCAGCGAAAGCAAGTCGAGATTGCTGCAAGTTGAAGTTCATCAAGTGGGTTCTTCAATGGAGGTCGTTTTGGTCACCGG
CTCTGATTATCACTTCGTTTTCAGACAAACCCTTCTGCTGCTTCAAGAACAGGGAGCTGAGATCGTCAATGTCAACCAGTCCATGTTCGCCGATCGAGTTTTCCACAAGA
TAATAGCTGAGGTGGTGGGAAATGGGATGGCCCCTGGAGGCAGTGATGGTGCAAGGATTTGTGAGAGAGTGAAGAAGTGTGTTTCACAGTACAAAGATGGCCAACATTAA
Protein sequenceShow/hide protein sequence
MANNPTPHGSSSAPTDRKLVERNRRNQMKALYSKLYSLVPHQISRVSLSLSLSFIRVWLVRKCLNNPKKIFELQEAKTLPDQLEDATNYITMLKANLEKLKEKREKLLGM
EEPRGIIRRTCSESKSRLLQVEVHQVGSSMEVVLVTGSDYHFVFRQTLLLLQEQGAEIVNVNQSMFADRVFHKIIAEVVGNGMAPGGSDGARICERVKKCVSQYKDGQH