; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014321 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014321
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSegmentation polarity homeobox protein engrailed
Genome locationtig00000289:482266..483090
RNA-Seq ExpressionSgr014321
SyntenySgr014321
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]2.9e-4454.64Show/hide
Query:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E
        M++QP   D N+ VQDKLVVI QP  PL           TPSLSL NKISPYPPSP S S+SS+SSFTC+S++T+    SS  + F     SP+  S   
Subjt:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E

Query:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA
        +  S Y +NPH+ RINSLKA+A  SP K +   SP   VVR PSPQRV RSTPQKR+RPASPSP     RQKSF++E  +RP   P P  R  REKC++A
Subjt:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA

Query:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
            AP    N +R K             +RSP   R  AMKKE T IHRISSKID VAV EAV   D DSVVAMED+DNPLISLDCFIFL
Subjt:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]1.8e-4153.98Show/hide
Query:  MIKQPARLDLNS--VQDKLVVISQP--PL-------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-EFL
        M+KQP   D N+  VQDKLVVI QP  PL        TPSLSL NKISPYPPSP S S+SS+SSFTC+S++T     SS  + F     SP+  S   + 
Subjt:  MIKQPARLDLNS--VQDKLVVISQP--PL-------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-EFL

Query:  WSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLAPT
         S Y +NPH+  INSLKA+A   P K +   SP   ++R PSPQRV RS PQKR RPASPSP     RQKSF++E  +RP   P P  R  REKC++A  
Subjt:  WSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLAPT

Query:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
          AP    N +R K             +RSP   R  AMKKE T IHRISSKID VAV EAV   D DSVVAMEDIDNPLISLDCFIFL
Subjt:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

XP_022939152.1 uncharacterized protein LOC111445147 [Cucurbita moschata]6.8e-4151.89Show/hide
Query:  PARLDLNS-VQDKLVVISQPP----------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYK
        P   D N+ VQDKLVVI QPP           + PSLSLSNKISPYPPSPS +S+S+  S +  +T+T NSS S+A+S      RSP     ++ WS Y 
Subjt:  PARLDLNS-VQDKLVVISQPP----------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYK

Query:  ENPHVVRINSLKANALSSPGKVLRLNSP-AKSVVRQPSPQRVLR------STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPT
        +NPHVVRINSLKA+  S P     L SP  +  +R PSPQRV R      STPQKRVR ASPSP     RQKSF++E + P  P P  R   EKCR+A  
Subjt:  ENPHVVRINSLKANALSSPGKVLRLNSP-AKSVVRQPSPQRVLR------STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPT

Query:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL
            A           + K  G  S   RSP  AR C MKKE  T IHRISSKID  A  EAV N+ D DS  AMEDIDNPLISLDCFIFL
Subjt:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]3.8e-4454.2Show/hide
Query:  PARLDLNS-VQDKLVVISQPP--------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYKEN
        P   D N+ VQDKLVVI QPP           PSLSLSNKISPYPPSPS +S+S+  S T  +T+T NSS S+A+S      RSP   S ++ WS Y +N
Subjt:  PARLDLNS-VQDKLVVISQPP--------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYKEN

Query:  PHVVRINSLKANALSSPGKVLRLNSPAKSVVRQ----PSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPTTTAPA
        PHVVRINSLKA+A S P       +P   VVRQ    PSPQRV RSTPQKRVR ASPSP     RQKSF++E + P  P P  R   EKCR+A      A
Subjt:  PHVVRINSLKANALSSPGKVLRLNSPAKSVVRQ----PSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPTTTAPA

Query:  KERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL
                   + K  G  S   RSP  AR C MKKE  T IHRISSKID  A  EAV N+ D DS  AMEDIDNPLISLDCFIFL
Subjt:  KERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]3.6e-4254.83Show/hide
Query:  MIKQPARLDLNS--VQDKLVVISQP--PL--------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFL
        M+KQP   D N+  VQDKLVVI QP  PL        T PSLSL+NKISPYPPSPS    SS+SSFTC+S+ST N+S S+A+S       SP+  S    
Subjt:  MIKQPARLDLNS--VQDKLVVISQP--PL--------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFL

Query:  W-SCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKREERPPPPPGP-PSR---REKCRLAP
        + S Y +   ++RINSLKA A   P K      P   +VR PSPQRVLRSTPQKRVRPASPSP     RQKSF++E  P P P P PSR   REKCR+A 
Subjt:  W-SCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKREERPPPPPGP-PSR---REKCRLAP

Query:  TTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
           AP                       +RSP  AR+  MKKE T IHRISSKID VAV EAV   D DSVVAMEDIDNPLISLDCFIFL
Subjt:  TTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein8.6e-4253.98Show/hide
Query:  MIKQPARLDLNS--VQDKLVVISQP--PL-------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-EFL
        M+KQP   D N+  VQDKLVVI QP  PL        TPSLSL NKISPYPPSP S S+SS+SSFTC+S++T     SS  + F     SP+  S   + 
Subjt:  MIKQPARLDLNS--VQDKLVVISQP--PL-------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-EFL

Query:  WSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLAPT
         S Y +NPH+  INSLKA+A   P K +   SP   ++R PSPQRV RS PQKR RPASPSP     RQKSF++E  +RP   P P  R  REKC++A  
Subjt:  WSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLAPT

Query:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
          AP    N +R K             +RSP   R  AMKKE T IHRISSKID VAV EAV   D DSVVAMEDIDNPLISLDCFIFL
Subjt:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

A0A1S3B4I5 Uncharacterized protein1.4e-4454.64Show/hide
Query:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E
        M++QP   D N+ VQDKLVVI QP  PL           TPSLSL NKISPYPPSP S S+SS+SSFTC+S++T+    SS  + F     SP+  S   
Subjt:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E

Query:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA
        +  S Y +NPH+ RINSLKA+A  SP K +   SP   VVR PSPQRV RSTPQKR+RPASPSP     RQKSF++E  +RP   P P  R  REKC++A
Subjt:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA

Query:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
            AP    N +R K             +RSP   R  AMKKE T IHRISSKID VAV EAV   D DSVVAMED+DNPLISLDCFIFL
Subjt:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

A0A5D3D583 TPRXL protein1.4e-4454.64Show/hide
Query:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E
        M++QP   D N+ VQDKLVVI QP  PL           TPSLSL NKISPYPPSP S S+SS+SSFTC+S++T+    SS  + F     SP+  S   
Subjt:  MIKQPARLDLNS-VQDKLVVISQP--PL----------TTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSD-E

Query:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA
        +  S Y +NPH+ RINSLKA+A  SP K +   SP   VVR PSPQRV RSTPQKR+RPASPSP     RQKSF++E  +RP   P P  R  REKC++A
Subjt:  FLWSCYKENPHVVRINSLKANALSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE--ERPPPPPGPPSR--REKCRLA

Query:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
            AP    N +R K             +RSP   R  AMKKE T IHRISSKID VAV EAV   D DSVVAMED+DNPLISLDCFIFL
Subjt:  PTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE-TFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451473.3e-4151.89Show/hide
Query:  PARLDLNS-VQDKLVVISQPP----------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYK
        P   D N+ VQDKLVVI QPP           + PSLSLSNKISPYPPSPS +S+S+  S +  +T+T NSS S+A+S      RSP     ++ WS Y 
Subjt:  PARLDLNS-VQDKLVVISQPP----------LTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYK

Query:  ENPHVVRINSLKANALSSPGKVLRLNSP-AKSVVRQPSPQRVLR------STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPT
        +NPHVVRINSLKA+  S P     L SP  +  +R PSPQRV R      STPQKRVR ASPSP     RQKSF++E + P  P P  R   EKCR+A  
Subjt:  ENPHVVRINSLKANALSSPGKVLRLNSP-AKSVVRQPSPQRVLR------STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSR--REKCRLAPT

Query:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL
            A           + K  G  S   RSP  AR C MKKE  T IHRISSKID  A  EAV N+ D DS  AMEDIDNPLISLDCFIFL
Subjt:  TTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAARACAMKKE--TFIHRISSKIDGVAVGEAVSNQ-DFDSVVAMEDIDNPLISLDCFIFL

V4T7U2 Uncharacterized protein2.6e-3846.94Show/hide
Query:  NSVQDKLVVISQPPLTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTA---NSSLSSATSRFKERERSPAGFSDEFLWSCYKENPHVVRINSLKAN
        + VQDKLV+  Q P  TP++ LSN+ISP P SP S STSS+SSFTC +++T+   +S  SSA+S    ++RS   FS+EFLWSC KENPH++RINS+K  
Subjt:  NSVQDKLVVISQPPLTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTA---NSSLSSATSRFKERERSPAGFSDEFLWSCYKENPHVVRINSLKAN

Query:  ALS---SPGKVLRLNSPAKSV---VRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE-ERPPPP---------PGPPSRR---EKCRLAPTTTAP
        +LS   +  +  +L+SP KS+   V+Q  P R+  STPQKRVR +SP+P   L+RQKSF+RE ER   P            PSRR   +  R   T T  
Subjt:  ALS---SPGKVLRLNSPAKSV---VRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKRE-ERPPPP---------PGPPSRR---EKCRLAPTTTAP

Query:  AKERNDLRRKDHTTKAAG----------PNSHNNRSPAAARACAMKKETFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
            N +  K H                P+  NN + A  R C   KETF HRI SKID VAV EA+++      V MEDIDNPLISLDCFIFL
Subjt:  AKERNDLRRKDHTTKAAG----------PNSHNNRSPAAARACAMKKETFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein1.8e-1231.87Show/hide
Query:  PLTTPSLSLSNKISPYPPSP---SSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYKENPHVVRINSLKANALS----SPGKVL
        P+    +++  K+ P PPSP   +S S    S+ +  S S++NSSLS+A+S    +ERS   FS++FL +CY+EN HV RINSL+  +LS     P    
Subjt:  PLTTPSLSLSNKISPYPPSP---SSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYKENPHVVRINSLKANALS----SPGKVL

Query:  RLNSPAKSVVRQPSPQRVLR-----STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSRREKCRLAPTTTAPAKERNDLRRKDHTTKAAGPNS--
        R +SP        +P R        S   KR R  SP+  S LTRQKSF++++         +   K +   + +   +   +  +    ++  G  +  
Subjt:  RLNSPAKSVVRQPSPQRVLR-----STPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSRREKCRLAPTTTAPAKERNDLRRKDHTTKAAGPNS--

Query:  --------HNNRSPAAARACAM--KKETFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL
                 ++   +  + C M  + E  IHRISSKID   + E ++      V   E++ NPLI LDCFIFL
Subjt:  --------HNNRSPAAARACAM--KKETFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGATCAAACAACCGGCTCGTTTAGATCTCAATTCTGTTCAAGACAAGCTCGTTGTGATCTCCCAACCGCCATTGACAACACCTTCTCTCTCGCTCTCAAACAA
GATTTCTCCATATCCACCGTCTCCTTCATCGGCTTCTACTTCTTCCGTCTCTTCCTTCACCTGCGTTTCCACCAGCACTGCTAACAGCTCGCTGTCGAGTGCAACTTCGC
GTTTTAAGGAAAGGGAGAGGTCGCCGGCGGGTTTCTCCGATGAGTTCTTGTGGTCCTGTTACAAGGAGAACCCCCACGTCGTCCGGATTAACTCCCTTAAAGCTAACGCT
CTGTCGTCCCCTGGAAAAGTTTTGAGGCTGAACTCGCCGGCGAAGTCGGTGGTCCGGCAGCCATCCCCACAGAGGGTCTTGAGATCGACACCCCAGAAGAGAGTCCGTCC
GGCGTCGCCCTCACCGGCGTCAAACCTGACGCGGCAGAAGAGCTTCAAGAGGGAGGAGCGGCCTCCTCCTCCGCCTGGCCCTCCTAGCAGAAGGGAGAAATGCCGGCTCG
CTCCGACCACCACAGCGCCGGCGAAAGAGCGAAACGATTTAAGGCGGAAGGACCATACGACGAAAGCAGCAGGTCCGAACAGCCATAACAATAGAAGTCCAGCCGCTGCT
CGAGCTTGTGCGATGAAGAAGGAAACTTTCATCCACCGGATCAGTTCAAAGATAGACGGAGTTGCAGTGGGAGAAGCAGTTTCAAATCAAGATTTTGATTCGGTCGTGGC
TATGGAGGATATTGATAATCCCTTAATCTCATTGGATTGCTTTATCTTTCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGATGATCAAACAACCGGCTCGTTTAGATCTCAATTCTGTTCAAGACAAGCTCGTTGTGATCTCCCAACCGCCATTGACAACACCTTCTCTCTCGCTCTCAAACAA
GATTTCTCCATATCCACCGTCTCCTTCATCGGCTTCTACTTCTTCCGTCTCTTCCTTCACCTGCGTTTCCACCAGCACTGCTAACAGCTCGCTGTCGAGTGCAACTTCGC
GTTTTAAGGAAAGGGAGAGGTCGCCGGCGGGTTTCTCCGATGAGTTCTTGTGGTCCTGTTACAAGGAGAACCCCCACGTCGTCCGGATTAACTCCCTTAAAGCTAACGCT
CTGTCGTCCCCTGGAAAAGTTTTGAGGCTGAACTCGCCGGCGAAGTCGGTGGTCCGGCAGCCATCCCCACAGAGGGTCTTGAGATCGACACCCCAGAAGAGAGTCCGTCC
GGCGTCGCCCTCACCGGCGTCAAACCTGACGCGGCAGAAGAGCTTCAAGAGGGAGGAGCGGCCTCCTCCTCCGCCTGGCCCTCCTAGCAGAAGGGAGAAATGCCGGCTCG
CTCCGACCACCACAGCGCCGGCGAAAGAGCGAAACGATTTAAGGCGGAAGGACCATACGACGAAAGCAGCAGGTCCGAACAGCCATAACAATAGAAGTCCAGCCGCTGCT
CGAGCTTGTGCGATGAAGAAGGAAACTTTCATCCACCGGATCAGTTCAAAGATAGACGGAGTTGCAGTGGGAGAAGCAGTTTCAAATCAAGATTTTGATTCGGTCGTGGC
TATGGAGGATATTGATAATCCCTTAATCTCATTGGATTGCTTTATCTTTCTATAG
Protein sequenceShow/hide protein sequence
MMMIKQPARLDLNSVQDKLVVISQPPLTTPSLSLSNKISPYPPSPSSASTSSVSSFTCVSTSTANSSLSSATSRFKERERSPAGFSDEFLWSCYKENPHVVRINSLKANA
LSSPGKVLRLNSPAKSVVRQPSPQRVLRSTPQKRVRPASPSPASNLTRQKSFKREERPPPPPGPPSRREKCRLAPTTTAPAKERNDLRRKDHTTKAAGPNSHNNRSPAAA
RACAMKKETFIHRISSKIDGVAVGEAVSNQDFDSVVAMEDIDNPLISLDCFIFL