; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015725 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015725
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionsegmentation polarity homeobox protein engrailed
Genome locationscaffold983:624329..625111
RNA-Seq ExpressionMS015725
SyntenyMS015725
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]4.2e-5659.03Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS
        MGSCISKCKPKMM +Q P    D NNLV DKL++IPQ  SPL     T     + SL L NKISPYPPSPSPS+SS+SSFTCLSS T   T +S STA+S
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS

Query:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P
          +P S   Y  S Y QNPH+ RINSLKA+A           SPVKP+SP+     R PSPQRVSRSTP KR+RP SPSP     RQKSFRKE  QRP  
Subjt:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P

Query:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SPSPTRR         +AP  G      SP   + MKKEI+CIHRISSKID+VA +EA    + D D+VVAMED+DNPLISLDCFIFL
Subjt:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]5.1e-5459.44Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNL-VHDKLLIIPQ--SPLTNPPCNSL--SLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATSRS
        MGSCISKCKPKMM KQ P    D NNL V DKL++IPQ  SPL      S   SL L NKISPYPPSPSPS+SS+SSFTCLSS T   T +S STA+S  
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNL-VHDKLLIIPQ--SPLTNPPCNSL--SLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATSRS

Query:  APFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-PSP
        +P S   Y  S Y QNPH+  INSLKA+A            PVKP+SP+     R PSPQRVSRS P KR RP SPSP     RQKSFRKE  QRP  SP
Subjt:  APFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-PSP

Query:  SPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SPTRR         +AP  G      SP   + MKKEI+CIHRISSKIDEVA +EA    + D D+VVAMEDIDNPLISLDCFIFL
Subjt:  SPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

XP_022939152.1 uncharacterized protein LOC111445147 [Cucurbita moschata]1.7e-5457.09Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP-----LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR
        MGSCISKCKPK  IK  P    D NN+V DKL++IPQ P     +T    ++ SL LSNKISPYPPSPSPS+SS    TCLSS+TT     SS STA+SR
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP-----LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR

Query:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSR------STPLKRVRPCSPSPSPTPPRQKSFRKE-QR
        S     DY WS Y QNPHV RINSLKA+  S          P   VSPV R R R PSPQRVSR      STP KRVR  SPS    P RQKSFRKE QR
Subjt:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSR------STPLKRVRPCSPSPSPTPPRQKSFRKE-QR

Query:  PPSPSPTRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        P SPSP+RR++  K         G S         SPA    MKKE I+CIHRISSKIDE AAREA   +  D D+  AMEDIDNPLISLDCFIFL
Subjt:  PPSPSPTRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]1.9e-5658.62Show/hide
Query:  MGSCISKCKPKMMIKQHPSCN--LDLNNLVHDKLLIIPQSPLTNPPCNSL---SLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR
        MGSCISKCKPK +    P      D NN+V DKL++IPQ P       S    SL LSNKISPYPPSPSPS+SS    TCLSS TT     SS STA+SR
Subjt:  MGSCISKCKPKMMIKQHPSCN--LDLNNLVHDKLLIIPQSPLTNPPCNSL---SLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR

Query:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE-QRPPSPSP
        S   S DY WS Y QNPHV RINSLKA+A S          P  PVSPV R R R PSPQRVSRSTP KRVR  SPS    P RQKSFRKE QRP SPSP
Subjt:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE-QRPPSPSP

Query:  TRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        +RR++  K         G S         SPA    MKKE I+CIHRISSKIDE AAREA   +  D D+  AMEDIDNPLISLDCFIFL
Subjt:  TRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]9.0e-5158.12Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPLTNPPCNSL---SLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTLSSASTATSRSAPF
        MGSCISKCKPKMM KQ P  + + NNLV DKL++IPQ  SPL      +    SL L+NKISPYPPSPS   SS+SSFTCLSS+T  +S STA+S  +P 
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPLTNPPCNSL---SLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTLSSASTATSRSAPF

Query:  SDDYLW-SCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKEQRP---PSPSPT
        S  + + S Y Q   + RINSLKA A            P+KPVSP+ RH    PSPQRV RSTP KRVRP SPSP     RQKSFRKE  P   PSPSP+
Subjt:  SDDYLW-SCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKEQRP---PSPSPT

Query:  RRVAPAKGNSYL----HSPA---PMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        RR +  K    +     SPA    MKKEI+CIHRISSKIDEVA +EA    + D D+VVAMEDIDNPLISLDCFIFL
Subjt:  RRVAPAKGNSYL----HSPA---PMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein2.5e-5459.44Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNL-VHDKLLIIPQ--SPLTNPPCNSL--SLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATSRS
        MGSCISKCKPKMM KQ P    D NNL V DKL++IPQ  SPL      S   SL L NKISPYPPSPSPS+SS+SSFTCLSS T   T +S STA+S  
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNL-VHDKLLIIPQ--SPLTNPPCNSL--SLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATSRS

Query:  APFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-PSP
        +P S   Y  S Y QNPH+  INSLKA+A            PVKP+SP+     R PSPQRVSRS P KR RP SPSP     RQKSFRKE  QRP  SP
Subjt:  APFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-PSP

Query:  SPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SPTRR         +AP  G      SP   + MKKEI+CIHRISSKIDEVA +EA    + D D+VVAMEDIDNPLISLDCFIFL
Subjt:  SPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

A0A1S3B4I5 Uncharacterized protein2.0e-5659.03Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS
        MGSCISKCKPKMM +Q P    D NNLV DKL++IPQ  SPL     T     + SL L NKISPYPPSPSPS+SS+SSFTCLSS T   T +S STA+S
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS

Query:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P
          +P S   Y  S Y QNPH+ RINSLKA+A           SPVKP+SP+     R PSPQRVSRSTP KR+RP SPSP     RQKSFRKE  QRP  
Subjt:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P

Query:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SPSPTRR         +AP  G      SP   + MKKEI+CIHRISSKID+VA +EA    + D D+VVAMED+DNPLISLDCFIFL
Subjt:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

A0A5D3D583 TPRXL protein2.0e-5659.03Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS
        MGSCISKCKPKMM +Q P    D NNLV DKL++IPQ  SPL     T     + SL L NKISPYPPSPSPS+SS+SSFTCLSS T   T +S STA+S
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ--SPL-----TNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSAT---TLSSASTATS

Query:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P
          +P S   Y  S Y QNPH+ RINSLKA+A           SPVKP+SP+     R PSPQRVSRSTP KR+RP SPSP     RQKSFRKE  QRP  
Subjt:  RSAPFSD-DYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKE--QRP-P

Query:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SPSPTRR         +AP  G      SP   + MKKEI+CIHRISSKID+VA +EA    + D D+VVAMED+DNPLISLDCFIFL
Subjt:  SPSPTRR---------VAPAKG-NSYLHSP---APMKKEISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451478.4e-5557.09Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP-----LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR
        MGSCISKCKPK  IK  P    D NN+V DKL++IPQ P     +T    ++ SL LSNKISPYPPSPSPS+SS    TCLSS+TT     SS STA+SR
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP-----LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTL----SSASTATSR

Query:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSR------STPLKRVRPCSPSPSPTPPRQKSFRKE-QR
        S     DY WS Y QNPHV RINSLKA+  S          P   VSPV R R R PSPQRVSR      STP KRVR  SPS    P RQKSFRKE QR
Subjt:  SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSR------STPLKRVRPCSPSPSPTPPRQKSFRKE-QR

Query:  PPSPSPTRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        P SPSP+RR++  K         G S         SPA    MKKE I+CIHRISSKIDE AAREA   +  D D+  AMEDIDNPLISLDCFIFL
Subjt:  PPSPSPTRRVAPAK---------GNSY------LHSPA---PMKKE-ISCIHRISSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL

M5XHQ0 Uncharacterized protein9.1e-3343.49Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP--LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCL----------SSATTLSSASTA
        MGSCISKC+P+  +       +D  N V DK L+I Q+P  L  PP     +  SNKISP PPSPS STSS SSFTC           S  +TLSSAS+ 
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSP--LTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCL----------SSATTLSSASTA

Query:  TSR--SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRS-TPLKRVRPCSPSPSPTPPRQKSFRKE-QR
         S      FS+++LWSCYK+NPHV RINSLK  + SS       + P KP+ P    + +QP+ +  + S TP KRVR  SP+P     RQKSFRKE +R
Subjt:  TSR--SAPFSDDYLWSCYKQNPHVARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRS-TPLKRVRPCSPSPSPTPPRQKSFRKE-QR

Query:  PP-------------SPSPTRR-------------------VAPAKGNSYLHSPAPMKKEI------SCIHRISSKIDEVAAREAASEDIQDFDTVVAME
        PP             SPSP+RR                   + PA  ++Y +S   ++  +      + IHRISSKIDEVA  EA    + D+   +  E
Subjt:  PP-------------SPSPTRR-------------------VAPAKGNSYLHSPAPMKKEI------SCIHRISSKIDEVAAREAASEDIQDFDTVVAME

Query:  DIDNPLISLDCFIFL
        DIDNPLISLDCFIFL
Subjt:  DIDNPLISLDCFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein2.0e-1633.04Show/hide
Query:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ-----------SPLTNP--------PCNSLSLDLSNKISPYPPSP---------SPSTSSLSS
        MG CISKC PK       S +       H+K   +P+           SPL  P        P  +  +++  K+ P PPSP          PST+S SS
Subjt:  MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQ-----------SPLTNP--------PCNSLSLDLSNKISPYPPSP---------SPSTSSLSS

Query:  FTCLSSATTLSSAST-ATSRSAPFSDDYLWSCYKQNPHVARINSLKANALS----SPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSP
         +  SS ++LS+AS+ + S+   FS+D+L +CY++N HVARINSL+  +LS     P    R +SPV P        P + +      S   KR R  SP
Subjt:  FTCLSSATTLSSAST-ATSRSAPFSDDYLWSCYKQNPHVARINSLKANALS----SPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSP

Query:  SPSPTPPRQKSFRKEQRP----------------PSPSPTRRVAPAKGNSYLHSPAPMKK---------EISC-----------------------IHRI
        +   +  RQKSFR++Q                   SPSP+RR    +GN +L SP+P ++           SC                       IHRI
Subjt:  SPSPTPPRQKSFRKEQRP----------------PSPSPTRRVAPAKGNSYLHSPAPMKK---------EISC-----------------------IHRI

Query:  SSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL
        SSKID+   RE  ++D +    V   E++ NPLI LDCFIFL
Subjt:  SSKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTTGCATTAGCAAATGCAAACCCAAGATGATGATCAAACAACACCCATCTTGTAACTTGGATCTCAACAATCTCGTCCACGACAAGCTCCTCATAATCCCCCA
ATCCCCATTAACAAATCCTCCTTGTAATTCTCTCTCTTTAGATCTCTCCAACAAAATCTCTCCCTATCCTCCTTCTCCTTCCCCTTCTACTTCTTCCCTCTCTTCCTTCA
CTTGTCTCTCTTCCGCCACCACCCTCAGCTCGGCCTCGACCGCCACGTCGCGGTCTGCGCCTTTCTCCGACGACTACTTGTGGTCGTGTTATAAACAAAACCCTCACGTT
GCACGTATCAATTCCCTTAAAGCCAACGCCTTGTCGTCGCCGGGGAGAGCTTTGCGCCTCAACTCCCCCGTGAAGCCGGTTTCACCGGTGTTCCGCCACCGCCCGCGCCA
GCCTTCCCCGCAGAGGGTCTCGAGGTCCACACCCCTGAAGAGAGTCCGTCCATGCTCGCCCTCACCCTCGCCAACGCCCCCGCGCCAGAAGAGCTTCAGGAAGGAGCAGC
GGCCTCCGTCGCCGTCTCCGACCAGACGGGTGGCGCCGGCGAAAGGAAATAGTTACTTACATTCACCTGCTCCGATGAAGAAGGAAATTAGTTGCATTCATCGGATCAGT
TCGAAGATAGACGAGGTGGCGGCGAGAGAAGCAGCTTCGGAGGATATTCAAGATTTTGATACGGTGGTGGCTATGGAGGATATTGATAATCCCTTAATCTCGTTGGATTG
CTTTATCTTTCTA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTTGCATTAGCAAATGCAAACCCAAGATGATGATCAAACAACACCCATCTTGTAACTTGGATCTCAACAATCTCGTCCACGACAAGCTCCTCATAATCCCCCA
ATCCCCATTAACAAATCCTCCTTGTAATTCTCTCTCTTTAGATCTCTCCAACAAAATCTCTCCCTATCCTCCTTCTCCTTCCCCTTCTACTTCTTCCCTCTCTTCCTTCA
CTTGTCTCTCTTCCGCCACCACCCTCAGCTCGGCCTCGACCGCCACGTCGCGGTCTGCGCCTTTCTCCGACGACTACTTGTGGTCGTGTTATAAACAAAACCCTCACGTT
GCACGTATCAATTCCCTTAAAGCCAACGCCTTGTCGTCGCCGGGGAGAGCTTTGCGCCTCAACTCCCCCGTGAAGCCGGTTTCACCGGTGTTCCGCCACCGCCCGCGCCA
GCCTTCCCCGCAGAGGGTCTCGAGGTCCACACCCCTGAAGAGAGTCCGTCCATGCTCGCCCTCACCCTCGCCAACGCCCCCGCGCCAGAAGAGCTTCAGGAAGGAGCAGC
GGCCTCCGTCGCCGTCTCCGACCAGACGGGTGGCGCCGGCGAAAGGAAATAGTTACTTACATTCACCTGCTCCGATGAAGAAGGAAATTAGTTGCATTCATCGGATCAGT
TCGAAGATAGACGAGGTGGCGGCGAGAGAAGCAGCTTCGGAGGATATTCAAGATTTTGATACGGTGGTGGCTATGGAGGATATTGATAATCCCTTAATCTCGTTGGATTG
CTTTATCTTTCTA
Protein sequenceShow/hide protein sequence
MGSCISKCKPKMMIKQHPSCNLDLNNLVHDKLLIIPQSPLTNPPCNSLSLDLSNKISPYPPSPSPSTSSLSSFTCLSSATTLSSASTATSRSAPFSDDYLWSCYKQNPHV
ARINSLKANALSSPGRALRLNSPVKPVSPVFRHRPRQPSPQRVSRSTPLKRVRPCSPSPSPTPPRQKSFRKEQRPPSPSPTRRVAPAKGNSYLHSPAPMKKEISCIHRIS
SKIDEVAAREAASEDIQDFDTVVAMEDIDNPLISLDCFIFL