; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008745 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008745
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionsegmentation polarity homeobox protein engrailed
Genome locationscaffold10:35513199..35513984
RNA-Seq ExpressionSpg008745
SyntenySpg008745
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]6.8e-8372.73Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT+TAT     TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPL      R PSPQRVS  RSTPQKR+R ASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP     P+  S P RG  MKKEITCIHRISSKID+VA +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]7.8e-7972.53Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLT--TTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI
        MGSCISKCKPK  KQPP FDFNNL VQDKLVVIPQP  P LT  TT+ TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T ++TNTSFSTASS  SPI
Subjt:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLT--TTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI

Query:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV
         S  +  S Y QNPH+  INSLKA+AF  P+KP+SPL      R PSPQRVS  RS PQKR R ASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC+V
Subjt:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV

Query:  --APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
          AP     P+  S P R   MKKEITCIHRISSKIDEVA +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  --APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_022939152.1 uncharacterized protein LOC111445147 [Cucurbita moschata]3.5e-8773.4Show/hide
Query:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPL-----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS
        MGSCISKCKPK  K   PP FDFNN+VQDKLVVIPQPPPL       +A+ PSLSLSNKISPYPPSPSPSSSS    TCLSSS TTTTTN+SFSTASSRS
Subjt:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPL-----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRV----STLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGE
        PI   ++ WS Y QNPHVVRINSLKA+ FS P   VSP+ RQR  R PSPQRV    ST  STPQKRVRQASPSPVRQKSFRKE +RPLSPSPSRR  GE
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRV----STLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGE

Query:  KCRVA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        KCRVA       +     R+   PARGC MKKE ITCIHRISSKIDE AAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  KCRVA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]7.5e-9075.27Show/hide
Query:  MGSCISKCKPKTFK-----QPPRFDFNNLVQDKLVVIPQPPPL---TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSS-STTTTTNTSFSTASSR
        MGSCISKCKPK  K      PP FDFNN+VQDKLVVIPQPPPL    T+A  PSLSLSNKISPYPPSPSPSSSS    TCLSS +TTTTTN+SFSTASSR
Subjt:  MGSCISKCKPKTFK-----QPPRFDFNNLVQDKLVVIPQPPPL---TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSS-STTTTTNTSFSTASSR

Query:  SPIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGEKCR
        SPI S ++ WS Y QNPHVVRINSLKA+AFS P  PVSP+ RQR  R PSPQRVS  RSTPQKRVRQASPSPVRQKSFRKE +RPLSPSPSRR  GEKCR
Subjt:  SPIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGEKCR

Query:  VA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        VA       +     R+   PARGC MKKE ITCIHRISSKIDE AAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  VA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]8.3e-7372.43Show/hide
Query:  MGSCISKCKPKTFKQPPRFDF-NNLVQDKLVVIPQP-PPL----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SP
        MGSCISKCKPK  KQPP FDF NNLVQDKLVVIPQP  PL    TTT T PSLSL+NKISPYPPSP   SSSISSFTCLSSS    TNTSFSTASS  SP
Subjt:  MGSCISKCKPKTFKQPPRFDF-NNLVQDKLVVIPQP-PPL----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SP

Query:  IFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKEERPL---SPSPSRRFGGEKCR
        I S     S Y Q   ++RINSLKA AF  PIKPVSPL      R PSPQRV  LRSTPQKRVR ASPSP+RQKSFRKE  P    SPSPSRRF  EKCR
Subjt:  IFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKEERPL---SPSPSRRFGGEKCR

Query:  VAPTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        VA     +P+  S PAR   MKKEITCIHRISSKIDEVA +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  VAPTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein3.8e-7972.53Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLT--TTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI
        MGSCISKCKPK  KQPP FDFNNL VQDKLVVIPQP  P LT  TT+ TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T ++TNTSFSTASS  SPI
Subjt:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLT--TTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI

Query:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV
         S  +  S Y QNPH+  INSLKA+AF  P+KP+SPL      R PSPQRVS  RS PQKR R ASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC+V
Subjt:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV

Query:  --APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
          AP     P+  S P R   MKKEITCIHRISSKIDEVA +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  --APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A1S3B4I5 Uncharacterized protein3.3e-8372.73Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT+TAT     TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPL      R PSPQRVS  RSTPQKR+R ASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP     P+  S P RG  MKKEITCIHRISSKID+VA +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A5D3D583 TPRXL protein3.3e-8372.73Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT+TAT     TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLTTTAT-----TPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPL      R PSPQRVS  RSTPQKR+R ASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP     P+  S P RG  MKKEITCIHRISSKID+VA +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASPRRPSLPARGCGMKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451471.7e-8773.4Show/hide
Query:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPL-----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS
        MGSCISKCKPK  K   PP FDFNN+VQDKLVVIPQPPPL       +A+ PSLSLSNKISPYPPSPSPSSSS    TCLSSS TTTTTN+SFSTASSRS
Subjt:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPL-----TTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRV----STLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGE
        PI   ++ WS Y QNPHVVRINSLKA+ FS P   VSP+ RQR  R PSPQRV    ST  STPQKRVRQASPSPVRQKSFRKE +RPLSPSPSRR  GE
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLARQRQQRQPSPQRV----STLRSTPQKRVRQASPSPVRQKSFRKE-ERPLSPSPSRRFGGE

Query:  KCRVA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        KCRVA       +     R+   PARGC MKKE ITCIHRISSKIDE AAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  KCRVA------PTKPASPRRPSLPARGCGMKKE-ITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A6J5TQX1 Uncharacterized protein2.0e-4048.04Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSF-STASSRSPI-----
        MGSCISKC+P+        D  N VQDKLV+   P  L      P +S SNKISP PPSPS S+SS SSFTC ++++T+ T++S  ST SS S +     
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSF-STASSRSPI-----

Query:  ---FSDEFLWSCYKQNPHVVRINSLKANAFS----PIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPV-RQKSFRKE-ERP-----------
           FS+EFLWSCYK+NPHVVRINSLK  +FS    P KP+ P A +++Q        S    TPQKRVR +SP+P+ RQKSFRKE ERP           
Subjt:  ---FSDEFLWSCYKQNPHVVRINSLKANAFS----PIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPV-RQKSFRKE-ERP-----------

Query:  --LSPSPSRRFGGEKCRVAPTKPASPRRPSL----PARGCG-------------MKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISL
           SPSPSRRF        P K +S  +P+     PA                  ++  T IHRISSKIDEVA  EA+ +  ++DS+ A EDIDNPLISL
Subjt:  --LSPSPSRRFGGEKCRVAPTKPASPRRPSL----PARGCG-------------MKKEITCIHRISSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISL

Query:  DCFIFL
        DCFIFL
Subjt:  DCFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein4.1e-1731.79Show/hide
Query:  MGSCISKCKPKT--FKQPPRFDFNNLVQDKLVVIPQPPPLTTTATTPS------LSLSNKISPYPPS--PSPSSSSISSFTCLSSSTT-----TTTNTSF
        MG CISKC PK+  FK+            + + + + P   +    P       + + NK    P    P PS   ++SF+ +  STT     +++N+S 
Subjt:  MGSCISKCKPKT--FKQPPRFDFNNLVQDKLVVIPQPPPLTTTATTPS------LSLSNKISPYPPS--PSPSSSSISSFTCLSSSTT-----TTTNTSF

Query:  STAS----SRSPIFSDEFLWSCYKQNPHVVRINSLKANAFS--------PIKPVSPLARQRQQRQPSPQRVSTLR-STPQKRVRQASP---SPVRQKSFR
        STAS    S+   FS++FL +CY++N HV RINSL+  + S        P +  SP+   R    P+     + R S   KR R+ SP   S  RQKSFR
Subjt:  STAS----SRSPIFSDEFLWSCYKQNPHVVRINSLKANAFS--------PIKPVSPLARQRQQRQPSPQRVSTLR-STPQKRVRQASP---SPVRQKSFR

Query:  KEERPL----------------SPSPSRRFGGEKCR-VAPTKPASPRRPSLPARGCGMKKEITC---------------IHRISSKIDEVAAREAVLNDG
        +++  +                SPSPSRR+ G   +  +P++       SL    C  K  +                 IHRISSKID+   RE +  D 
Subjt:  KEERPL----------------SPSPSRRFGGEKCR-VAPTKPASPRRPSLPARGCGMKKEITC---------------IHRISSKIDEVAAREAVLNDG

Query:  DLDSVVAMEDIDNPLISLDCFIFL
        +   V   E++ NPLI LDCFIFL
Subjt:  DLDSVVAMEDIDNPLISLDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACATTCAAACAACCACCTCGTTTCGATTTCAACAACCTTGTTCAAGACAAGCTCGTTGTGATTCCTCAGCCGCCGCC
ATTGACGACGACAGCAACAACTCCTTCTCTCTCTCTCAGCAACAAAATCTCTCCTTATCCTCCTTCCCCTTCGCCTTCCTCTTCTTCCATCTCTTCTTTCACTTGTCTCT
CTTCATCCACAACCACAACCACCAACACCTCTTTCTCCACTGCATCTTCTCGCTCGCCCATCTTCTCTGACGAGTTCTTGTGGTCTTGCTACAAGCAAAACCCTCACGTC
GTTCGAATCAATTCCCTTAAAGCTAACGCCTTTTCGCCGATCAAGCCGGTTTCCCCGCTCGCCCGCCAGCGCCAACAACGACAGCCGTCCCCTCAGAGAGTGTCGACGTT
GAGGTCGACACCCCAGAAGAGAGTTCGACAGGCATCGCCGTCGCCCGTTCGACAGAAGAGCTTCAGGAAGGAGGAGCGGCCTCTGTCGCCGTCGCCGAGTAGACGGTTTG
GCGGAGAGAAATGCCGGGTGGCTCCGACCAAGCCTGCCAGTCCCAGAAGGCCTTCATTGCCGGCGAGGGGTTGTGGGATGAAGAAGGAAATTACTTGCATTCATAGGATC
AGTTCGAAGATTGACGAAGTTGCTGCGAGAGAGGCGGTTTTGAATGATGGAGATTTAGATTCGGTGGTGGCTATGGAGGATATTGACAATCCTTTAATCTCGTTGGATTG
CTTTATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACATTCAAACAACCACCTCGTTTCGATTTCAACAACCTTGTTCAAGACAAGCTCGTTGTGATTCCTCAGCCGCCGCC
ATTGACGACGACAGCAACAACTCCTTCTCTCTCTCTCAGCAACAAAATCTCTCCTTATCCTCCTTCCCCTTCGCCTTCCTCTTCTTCCATCTCTTCTTTCACTTGTCTCT
CTTCATCCACAACCACAACCACCAACACCTCTTTCTCCACTGCATCTTCTCGCTCGCCCATCTTCTCTGACGAGTTCTTGTGGTCTTGCTACAAGCAAAACCCTCACGTC
GTTCGAATCAATTCCCTTAAAGCTAACGCCTTTTCGCCGATCAAGCCGGTTTCCCCGCTCGCCCGCCAGCGCCAACAACGACAGCCGTCCCCTCAGAGAGTGTCGACGTT
GAGGTCGACACCCCAGAAGAGAGTTCGACAGGCATCGCCGTCGCCCGTTCGACAGAAGAGCTTCAGGAAGGAGGAGCGGCCTCTGTCGCCGTCGCCGAGTAGACGGTTTG
GCGGAGAGAAATGCCGGGTGGCTCCGACCAAGCCTGCCAGTCCCAGAAGGCCTTCATTGCCGGCGAGGGGTTGTGGGATGAAGAAGGAAATTACTTGCATTCATAGGATC
AGTTCGAAGATTGACGAAGTTGCTGCGAGAGAGGCGGTTTTGAATGATGGAGATTTAGATTCGGTGGTGGCTATGGAGGATATTGACAATCCTTTAATCTCGTTGGATTG
CTTTATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTATTPSLSLSNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSRSPIFSDEFLWSCYKQNPHV
VRINSLKANAFSPIKPVSPLARQRQQRQPSPQRVSTLRSTPQKRVRQASPSPVRQKSFRKEERPLSPSPSRRFGGEKCRVAPTKPASPRRPSLPARGCGMKKEITCIHRI
SSKIDEVAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL