; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionsegmentation polarity homeobox protein engrailed
Genome locationchr8:36047657..36048439
RNA-Seq ExpressionLag0029174
SyntenyLag0029174
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]1.8e-8371.9Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT     TT+ TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPLV     R PSPQRVS  RSTPQKR+RPASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP      + + P RG  MKKEITCIHRISSKI++ A +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]5.4e-8072.06Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLTTTTT--TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI
        MGSCISKCKPK  KQPP FDFNNL VQDKLVVIPQP  P LTT TT  TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T ++TNTSFSTASS  SPI
Subjt:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLTTTTT--TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI

Query:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV
         S  +  S Y QNPH+  INSLKA+AF  P+KP+SPL+     R PSPQRVS  RS PQKR RPASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC+V
Subjt:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV

Query:  --APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
          AP      + + P R   MKKEITCIHRISSKI+E A +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  --APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_022939152.1 uncharacterized protein LOC111445147 [Cucurbita moschata]4.5e-8773.05Show/hide
Query:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPLTTTTT-----TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS
        MGSCISKCKPK  K   PP FDFNN+VQDKLVVIPQPPPL    T      PSLSL NKISPYPPSPSPSSSS    TCLSSS TTTTTN+SFSTASSRS
Subjt:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPLTTTTT-----TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRV----STLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGE
        PI   ++ WS Y QNPHVVRINSLKA+ FS P   VSP+VRQR  R PSPQRV    ST  STPQKRVR ASPSPVRQKSFRKE +RPLSPSPSRR  GE
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRV----STLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGE

Query:  KCRV-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        KCRV       A  K    + + PARGC MKKE ITCIHRISSKI+EAAAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  KCRV-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]1.7e-8974.55Show/hide
Query:  MGSCISKCKPKTFK-----QPPRFDFNNLVQDKLVVIPQPPPL---TTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSS-STTTTTNTSFSTASSR
        MGSCISKCKPK  K      PP FDFNN+VQDKLVVIPQPPPL    T+   PSLSL NKISPYPPSPSPSSSS    TCLSS +TTTTTN+SFSTASSR
Subjt:  MGSCISKCKPKTFK-----QPPRFDFNNLVQDKLVVIPQPPPL---TTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSS-STTTTTNTSFSTASSR

Query:  SPIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGEKCR
        SPI S ++ WS Y QNPHVVRINSLKA+AFS P  PVSP+VRQR  R PSPQRVS  RSTPQKRVR ASPSPVRQKSFRKE +RPLSPSPSRR  GEKCR
Subjt:  SPIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGEKCR

Query:  V-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        V       A  K    + + PARGC MKKE ITCIHRISSKI+EAAAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  V-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]4.4e-7472.32Show/hide
Query:  MGSCISKCKPKTFKQPPRFDF-NNLVQDKLVVIPQP-PPL----TTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SP
        MGSCISKCKPK  KQPP FDF NNLVQDKLVVIPQP  PL    TTTTT PSLSL NKISPYPPSP   SSSISSFTCLSSS    TNTSFSTASS  SP
Subjt:  MGSCISKCKPKTFKQPPRFDF-NNLVQDKLVVIPQP-PPL----TTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SP

Query:  IFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKEERPL---SPSPSRRFGGEKCR
        I S     S Y Q   ++RINSLKA AF  PIKPVSPLV     R PSPQRV  LRSTPQKRVRPASPSP+RQKSFRKE  P    SPSPSRRF  EKCR
Subjt:  IFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKEERPL---SPSPSRRFGGEKCR

Query:  VAPTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        VA     + + + PAR   MKKEITCIHRISSKI+E A +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  VAPTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein2.6e-8072.06Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLTTTTT--TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI
        MGSCISKCKPK  KQPP FDFNNL VQDKLVVIPQP  P LTT TT  TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T ++TNTSFSTASS  SPI
Subjt:  MGSCISKCKPKTFKQPPRFDFNNL-VQDKLVVIPQP--PPLTTTTT--TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-SPI

Query:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV
         S  +  S Y QNPH+  INSLKA+AF  P+KP+SPL+     R PSPQRVS  RS PQKR RPASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC+V
Subjt:  FSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKCRV

Query:  --APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
          AP      + + P R   MKKEITCIHRISSKI+E A +EAV   GDLDSVVAMEDIDNPLISLDCFIFL
Subjt:  --APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A1S3B4I5 Uncharacterized protein8.6e-8471.9Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT     TT+ TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPLV     R PSPQRVS  RSTPQKR+RPASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP      + + P RG  MKKEITCIHRISSKI++ A +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A5D3D583 TPRXL protein8.6e-8471.9Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S
        MGSCISKCKPK  +QPP FDFNNLVQDKLVVIPQP  P LT     TT+ TPSLSL NKISPYPPSPSPSSSSISSFTCLSS+T+++TNTSFSTASS  S
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQP--PPLT-----TTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSR-S

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC
        PI S  +  S Y QNPH+ RINSLKA+AF SP+KP+SPLV     R PSPQRVS  RSTPQKR+RPASPSP+RQKSFRKE  +RPL SPSP+RRF  EKC
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAF-SPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKE--ERPL-SPSPSRRFGGEKC

Query:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        +V  AP      + + P RG  MKKEITCIHRISSKI++ A +EAV   GDLDSVVAMED+DNPLISLDCFIFL
Subjt:  RV--APTKPASLRRQLPARGCGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451472.2e-8773.05Show/hide
Query:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPLTTTTT-----TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS
        MGSCISKCKPK  K   PP FDFNN+VQDKLVVIPQPPPL    T      PSLSL NKISPYPPSPSPSSSS    TCLSSS TTTTTN+SFSTASSRS
Subjt:  MGSCISKCKPKTFK--QPPRFDFNNLVQDKLVVIPQPPPLTTTTT-----TPSLSLRNKISPYPPSPSPSSSSISSFTCLSSS-TTTTTNTSFSTASSRS

Query:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRV----STLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGE
        PI   ++ WS Y QNPHVVRINSLKA+ FS P   VSP+VRQR  R PSPQRV    ST  STPQKRVR ASPSPVRQKSFRKE +RPLSPSPSRR  GE
Subjt:  PIFSDEFLWSCYKQNPHVVRINSLKANAFS-PIKPVSPLVRQRQPRQPSPQRV----STLRSTPQKRVRPASPSPVRQKSFRKE-ERPLSPSPSRRFGGE

Query:  KCRV-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
        KCRV       A  K    + + PARGC MKKE ITCIHRISSKI+EAAAREAVLN+GDLDS  AMEDIDNPLISLDCFIFL
Subjt:  KCRV-------APTKPASLRRQLPARGCGMKKE-ITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL

A0A6J5TQX1 Uncharacterized protein1.4e-3846.86Show/hide
Query:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSF-STASSRSPI-----
        MGSCISKC+P+        D  N VQDKLV+   P  L      P +S  NKISP PPSPS S+SS SSFTC ++++T+ T++S  ST SS S +     
Subjt:  MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSF-STASSRSPI-----

Query:  ---FSDEFLWSCYKQNPHVVRINSLKANAFS----PIKPVSP-LVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPV-RQKSFRKE-ERP----------
           FS+EFLWSCYK+NPHVVRINSLK  +FS    P KP+ P  V+++QP   +    +    TPQKRVR +SP+P+ RQKSFRKE ERP          
Subjt:  ---FSDEFLWSCYKQNPHVVRINSLKANAFS----PIKPVSP-LVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPV-RQKSFRKE-ERP----------

Query:  ---LSPSPSRRF--GGEKCRVAPTKPASLRRQLPARG------------CGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCF
            SPSPSRRF         + +KP +L  +  A                 ++  T IHRISSKI+E A  EA+ +  ++DS+ A EDIDNPLISLDCF
Subjt:  ---LSPSPSRRF--GGEKCRVAPTKPASLRRQLPARG------------CGMKKEITCIHRISSKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCF

Query:  IFL
        IFL
Subjt:  IFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein2.6e-1631.74Show/hide
Query:  MGSCISKCKPKT--FKQPPRFDFNNLVQDKLVVIPQPPPLTTTTTTPS---LSLRNKISPYP-PS----------PSPSSSSISSFTCLSSSTT-----T
        MG CISKC PK+  FK+           +K   +P+   ++   +  S   L  +N+  P P P+          P PS   ++SF+ +  STT     +
Subjt:  MGSCISKCKPKT--FKQPPRFDFNNLVQDKLVVIPQPPPLTTTTTTPS---LSLRNKISPYP-PS----------PSPSSSSISSFTCLSSSTT-----T

Query:  TTNTSFSTAS----SRSPIFSDEFLWSCYKQNPHVVRINSLKANAFS--------PIKPVSPLVRQRQPRQPSPQRVSTLR-STPQKRVRPASP---SPV
        ++N+S STAS    S+   FS++FL +CY++N HV RINSL+  + S        P +  SP++  R    P+     + R S   KR R  SP   S  
Subjt:  TTNTSFSTAS----SRSPIFSDEFLWSCYKQNPHVVRINSLKANAFS--------PIKPVSPLVRQRQPRQPSPQRVSTLR-STPQKRVRPASP---SPV

Query:  RQKSFRKEERPL----------------SPSPSRRFGGEKCRVAPTKPASLRR------QLPARGCGMKKEITC---------------IHRISSKIEEA
        RQKSFR+++  +                SPSPSRR+ G   +     P+  RR       L    C  K  +                 IHRISSKI++ 
Subjt:  RQKSFRKEERPL----------------SPSPSRRFGGEKCRVAPTKPASLRR------QLPARGCGMKKEITC---------------IHRISSKIEEA

Query:  AAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL
          RE +  D +   V   E++ NPLI LDCFIFL
Subjt:  AAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACATTCAAACAACCACCTCGTTTCGATTTCAACAACCTTGTTCAAGACAAGCTCGTTGTGATTCCTCAGCCGCCGCC
ATTAACAACCACAACAACAACTCCTTCTCTCTCTCTCAGAAACAAAATCTCTCCTTATCCTCCTTCCCCTTCGCCTTCCTCTTCTTCCATCTCTTCTTTCACTTGTCTCT
CTTCATCCACAACCACAACCACCAACACCTCCTTCTCCACTGCATCTTCTCGCTCGCCCATTTTCTCCGACGAGTTCTTGTGGTCTTGCTACAAGCAAAACCCTCACGTC
GTTCGAATCAATTCCCTTAAAGCTAACGCCTTTTCGCCCATCAAGCCGGTTTCCCCGCTCGTCCGCCAGCGCCAGCCACGGCAGCCGTCCCCTCAGAGGGTGTCGACGTT
GAGGTCGACACCCCAGAAGAGAGTTCGACCGGCGTCGCCGTCGCCCGTTCGACAGAAGAGCTTCAGGAAGGAGGAGCGGCCTCTGTCGCCATCGCCGAGTAGACGGTTTG
GCGGAGAGAAATGCCGGGTGGCTCCGACCAAGCCTGCCAGTCTGAGAAGGCAATTGCCGGCGAGGGGTTGTGGGATGAAGAAGGAAATTACTTGCATTCATAGGATCAGT
TCGAAGATTGAAGAAGCTGCTGCGAGAGAGGCGGTTTTAAATGATGGAGATTTGGATTCGGTGGTGGCTATGGAGGATATTGACAATCCTTTAATCTCGTTGGATTGCTT
TATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACATTCAAACAACCACCTCGTTTCGATTTCAACAACCTTGTTCAAGACAAGCTCGTTGTGATTCCTCAGCCGCCGCC
ATTAACAACCACAACAACAACTCCTTCTCTCTCTCTCAGAAACAAAATCTCTCCTTATCCTCCTTCCCCTTCGCCTTCCTCTTCTTCCATCTCTTCTTTCACTTGTCTCT
CTTCATCCACAACCACAACCACCAACACCTCCTTCTCCACTGCATCTTCTCGCTCGCCCATTTTCTCCGACGAGTTCTTGTGGTCTTGCTACAAGCAAAACCCTCACGTC
GTTCGAATCAATTCCCTTAAAGCTAACGCCTTTTCGCCCATCAAGCCGGTTTCCCCGCTCGTCCGCCAGCGCCAGCCACGGCAGCCGTCCCCTCAGAGGGTGTCGACGTT
GAGGTCGACACCCCAGAAGAGAGTTCGACCGGCGTCGCCGTCGCCCGTTCGACAGAAGAGCTTCAGGAAGGAGGAGCGGCCTCTGTCGCCATCGCCGAGTAGACGGTTTG
GCGGAGAGAAATGCCGGGTGGCTCCGACCAAGCCTGCCAGTCTGAGAAGGCAATTGCCGGCGAGGGGTTGTGGGATGAAGAAGGAAATTACTTGCATTCATAGGATCAGT
TCGAAGATTGAAGAAGCTGCTGCGAGAGAGGCGGTTTTAAATGATGGAGATTTGGATTCGGTGGTGGCTATGGAGGATATTGACAATCCTTTAATCTCGTTGGATTGCTT
TATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGSCISKCKPKTFKQPPRFDFNNLVQDKLVVIPQPPPLTTTTTTPSLSLRNKISPYPPSPSPSSSSISSFTCLSSSTTTTTNTSFSTASSRSPIFSDEFLWSCYKQNPHV
VRINSLKANAFSPIKPVSPLVRQRQPRQPSPQRVSTLRSTPQKRVRPASPSPVRQKSFRKEERPLSPSPSRRFGGEKCRVAPTKPASLRRQLPARGCGMKKEITCIHRIS
SKIEEAAAREAVLNDGDLDSVVAMEDIDNPLISLDCFIFL