; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023927 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023927
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor SCREAM2-like isoform X1
Genome locationtig00001047:1539965..1541623
RNA-Seq ExpressionSgr023927
SyntenySgr023927
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134038.1 uncharacterized protein LOC111006410, partial [Momordica charantia]9.5e-6490.32Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPLPM VTVETLVKGFSINVFSEKSC GLLVS+LE F
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSE
        EELGLNVLEARVSCTDSFQLQAIGE+DEQGEE IDAQAVKEAVVQAI++ SE+SE
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSE

XP_022964712.1 uncharacterized protein LOC111464709 isoform X1 [Cucurbita moschata]4.3e-6486.62Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN  PMQVTVE LVKGFSINVFSEKSC GLLVS+LEAF
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        EELGLNVLEARVSCTD+FQLQA  E++EQGEE +DAQAVKEAVV+AI++ S+N EQD
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]2.3e-6283.44Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPL-----PMQVTVETLVKGFSINVFSEKSCAGLLV
        MVSREHKK  LHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPL     PMQVTVE LVKGFSINVFSEKSC GLLV
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPL-----PMQVTVETLVKGFSINVFSEKSCAGLLV

Query:  SVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        S+LE FEELGLNV+EARVSCTD+FQLQAI E++E+GEE IDAQAVKEAVVQAI++  ++ EQD
Subjt:  SVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]9.5e-6483.95Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPL-----PMQVTVETLVKGFSINVFSEKSCAGLLVS
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPL     PMQVTVE LVKGFSINVFSEKSC GLLVS
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPL-----PMQVTVETLVKGFSINVFSEKSCAGLLVS

Query:  VLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        +LE FEELGLNV+EARVSCTD+FQLQAI E++E+GEE IDAQAVKEAVVQAI++  ++ EQD
Subjt:  VLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]4.0e-6283.23Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQ----VTVETLVKGFSINVFSEKSCAGLLVSV
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPL  Q    VTVE LVKGFSINVFSEKSC GLLVS+
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQ----VTVETLVKGFSINVFSEKSCAGLLVSV

Query:  LEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        LE FEELGLNV+EARVSCTD+FQLQAI E++E+GEE IDAQAVKEAVVQAI++  ++ EQD
Subjt:  LEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064104.6e-6490.32Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPLPM VTVETLVKGFSINVFSEKSC GLLVS+LE F
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSE
        EELGLNVLEARVSCTDSFQLQAIGE+DEQGEE IDAQAVKEAVVQAI++ SE+SE
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSE

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X12.1e-6486.62Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN  PMQVTVE LVKGFSINVFSEKSC GLLVS+LEAF
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        EELGLNVLEARVSCTD+FQLQA  E++EQGEE +DAQAVKEAVV+AI++ S+N EQD
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X22.5e-6285.99Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN  PM VTVE LVKGFSINVFSEKSC GLLVS+LEAF
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        EELGLNVLEARVSCTD+FQLQA  E++EQGEE +DAQAVKEAVV+AI++ S+N EQD
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X22.5e-6285.99Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN  PM VTVE LVKGFSINVFSEKSC GLLVS+LEAF
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        EELGLNVLEARVSCTD+FQLQA  E++EQGEE +DAQAVKEAVV+AI++ S+N EQD
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X12.1e-6486.62Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN  PMQVTVE LVKGFSINVFSEKSC GLLVS+LEAF
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAF

Query:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD
        EELGLNVLEARVSCTD+FQLQA  E++EQGEE +DAQAVKEAVV+AI++ S+N EQD
Subjt:  EELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD

SwissProt top hitse value%identityAlignment
Q9LPW3 Transcription factor SCREAM21.0e-0423.56Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATV---QNSIH-----------------------PNPLPMQVTVE
        +++   +++ L+++L +LRS+      +++ASI+ DA  Y++EL Q++  L+ ++ +     +S+H                       P+P   Q  VE
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATV---QNSIH-----------------------PNPLPMQVTVE

Query:  TLV---KGFSINVFSEKSCAGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVV
          +   K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L        Q +  +  + +K  ++
Subjt:  TLV---KGFSINVFSEKSCAGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVV

Q9LSE2 Transcription factor ICE11.0e-0424.86Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIAT--------VQNSIH----------------------PNPLPMQ
        +++   +++ L+++L +LRS+      +++ASI+ DA  Y++EL Q++  L+ ++ +          +S H                      P+P   Q
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIAT--------VQNSIH----------------------PNPLPMQ

Query:  VTVET-LVKGFSINVFSEKSCA---GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAI-GELDEQGEETIDAQ
          VE  L +G ++N+     C    GLL++ ++A + LGL+V +A +SC + F L     E  ++G+E +  Q
Subjt:  VTVET-LVKGFSINVFSEKSCA---GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAI-GELDEQGEETIDAQ

Q9LSL1 Transcription factor bHLH931.4e-0427.33Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERL---NQDIATVQNSIHP---NPLPMQVTVETLVKG---FSINVFSEKS--
        +++   +++ L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L    Q++    NS H      L      E LV+    F I+   E +  
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERL---NQDIATVQNSIHP---NPLPMQVTVETLVKG---FSINVFSEKS--

Query:  ---CA---GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAV
           C+   GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  ---CA---GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAV

Q9LXA9 Transcription factor bHLH616.5e-0727.67Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGF--------SINVFSEKSC--
        +++   +++ L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L   +T E++V+           +N   +  C  
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGF--------SINVFSEKSC--

Query:  -AGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEE--TIDAQAVKEAVVQ
          GL+VS +   E LGL + +  +SC   F LQA     E GE+   + ++A K+A+++
Subjt:  -AGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEE--TIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G12860.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.4e-0623.56Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATV---QNSIH-----------------------PNPLPMQVTVE
        +++   +++ L+++L +LRS+      +++ASI+ DA  Y++EL Q++  L+ ++ +     +S+H                       P+P   Q  VE
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATV---QNSIH-----------------------PNPLPMQVTVE

Query:  TLV---KGFSINVFSEKSCAGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVV
          +   K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L        Q +  +  + +K  ++
Subjt:  TLV---KGFSINVFSEKSCAGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVV

AT1G29270.1 unknown protein1.3e-1034.88Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDI----ATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLV
        MV+ E KK     K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++      T + S+H      +V VE + + F + + S +     LV
Subjt:  MVSREHKKEGLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDI----ATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLV

Query:  SVLEAFEELGLNVLEARVSCTDSFQLQAI
        ++LEAFEE+GLNV +AR SC DSF ++AI
Subjt:  SVLEAFEELGLNVLEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)3.4e-4365.36Show/hide
Query:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHP-NPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEA
        MVSRE K+  L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P +P    VTVETL KGF INVFS K+  G+LVSVLEA
Subjt:  MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHP-NPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEA

Query:  FEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSE
        FE++GLNVLEAR SCTDSF L A+G  +E G E +DA+AVK+AV  AIR+  E
Subjt:  FEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSE

AT3G56220.1 transcription regulators1.9e-3858.6Show/hide
Query:  MVSREHKK-EGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSV
        MVSREHK+   L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N    + Q+   S  PNP+   VTVETL KGF I V S K+ AG+LV V
Subjt:  MVSREHKK-EGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSV

Query:  LEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSEN
        LE FE+LGL+V+EARVSCTD+F L AIG  +    + IDA+AVK+AV +AIR  S++
Subjt:  LEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSEN

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.6e-0827.67Show/hide
Query:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGF--------SINVFSEKSC--
        +++   +++ L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L   +T E++V+           +N   +  C  
Subjt:  MVSREHKKEGLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGF--------SINVFSEKSC--

Query:  -AGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEE--TIDAQAVKEAVVQ
          GL+VS +   E LGL + +  +SC   F LQA     E GE+   + ++A K+A+++
Subjt:  -AGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIGELDEQGEE--TIDAQAVKEAVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGGGAGCACAAGAAGGAAGGCCTGCATGAGAAGCTCCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCGTC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGCAGGTTACAGTGGAAA
CCCTAGTGAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGCTGTGCAGGCCTCCTTGTCTCCGTATTAGAAGCCTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCT
AGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAACTTGACGAACAAGGTGAAGAAACCATTGATGCTCAAGCTGTGAAAGAAGCAGTAGTCCAAGCTAT
AAGGAACTGCAGCGAAAACAGCGAACAAGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAGGGAGCACAAGAAGGAAGGCCTGCATGAGAAGCTCCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCGTC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGCAGGTTACAGTGGAAA
CCCTAGTGAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGCTGTGCAGGCCTCCTTGTCTCCGTATTAGAAGCCTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCT
AGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAACTTGACGAACAAGGTGAAGAAACCATTGATGCTCAAGCTGTGAAAGAAGCAGTAGTCCAAGCTAT
AAGGAACTGCAGCGAAAACAGCGAACAAGACTAA
Protein sequenceShow/hide protein sequence
MVSREHKKEGLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLPMQVTVETLVKGFSINVFSEKSCAGLLVSVLEAFEELGLNVLEA
RVSCTDSFQLQAIGELDEQGEETIDAQAVKEAVVQAIRNCSENSEQD