; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010125 (gene) of Snake gourd v1 genome

Gene IDTan0010125
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionsegmentation polarity homeobox protein engrailed
Genome locationLG11:10263698..10264495
RNA-Seq ExpressionTan0010125
SyntenyTan0010125
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]6.7e-7869.96Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR
        MGSCISKCKPK  +Q P FDFNNL VQDKLVVIPQP  P LT+T+T TT+ TPSLSL+NKISPYPPSPSPS+SSISSFTCLSS  T+++TN+SFSTA+S 
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR

Query:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR
         SPI S  Y  S Y QNPH+ RINSLKA+ F  PVKPISP++    RHPSPQRV RSTPQKR+R ASPSP+RQKSFRK+  QRP   PSP+RR S EKC+
Subjt:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR

Query:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         A +       KS+ PVR   MKKEITCIHRISSKID+VA +EAV   GD DSVVAMED+DNPLIS+DCFIFL
Subjt:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]2.5e-8071.22Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S
        MGSCISKCKPK  KQ P FDFNNL+VQDKLVVIPQP  PL TT T T+ TPSLSL+NKISPYPPSPSPS+SSISSFTCLSS  T ++TN+SFSTA+S  S
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S

Query:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCRAA
        PI S  Y  S Y QNPH+  INSLKA+ F PPVKPISP+L    RHPSPQRV RS PQKR R ASPSP+RQKSFRK+  QRP   PSP+RR S EKC+ A
Subjt:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCRAA

Query:  -STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         +       KS+ PVR   MKKEITCIHRISSKIDEVA +EAV   GD DSVVAMEDIDNPLIS+DCFIFL
Subjt:  -STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

XP_022939152.1 uncharacterized protein LOC111445147 [Cucurbita moschata]9.3e-8870.92Show/hide
Query:  MGSCISKCKPKTFK--QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR
        MGSCISKCKPK  K    P FDFNN IVQDKLVVIPQPPP     + ++ + PSLSL+NKISPYPPSPSPS+SS    TCLSS+TTTTTTNSSFSTA+SR
Subjt:  MGSCISKCKPKTFK--QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR

Query:  SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPR------STPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGE
        SPI    Y WS Y QNPHVVRINSLKA+ FSPP   +SP++RQ+ RHPSPQRV R      STPQKRVR ASPSPVRQKSFRK+ QRP  PSPSRRLSGE
Subjt:  SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPR------STPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGE

Query:  KCRAA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
        KCR A      ++ K  G KS+ P R C MKKE ITCIHRISSKIDE AAREAVLN+GD DS  AMEDIDNPLIS+DCFIFL
Subjt:  KCRAA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]2.0e-9072.76Show/hide
Query:  MGSCISKCKPKTFK-----QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTA
        MGSCISKCKPK  K       P FDFNN IVQDKLVVIPQPPP     + ++   PSLSL+NKISPYPPSPSPS+SS    TCLSSTTTTTTTNSSFSTA
Subjt:  MGSCISKCKPKTFK-----QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTA

Query:  TSRSPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGEKCR
        +SRSPI S  Y WS Y QNPHVVRINSLKA+ FSPP  P+SP++RQ+ RHPSPQRV RSTPQKRVR ASPSPVRQKSFRK+ QRP  PSPSRRLSGEKCR
Subjt:  TSRSPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGEKCR

Query:  AA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         A      ++ K  G KS+ P R C MKKE ITCIHRISSKIDE AAREAVLN+GD DS  AMEDIDNPLIS+DCFIFL
Subjt:  AA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]4.1e-7568.52Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S
        MGSCISKCKPK  KQ P FDFNN +VQDKLVVIPQP  PL TT T+TTT PSLSLNNKISPYPPSPS   SSISSFTCLSS     +TN+SFSTA+S  S
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S

Query:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKD----DQRPPSPSRRLSGEKCRAA
        PI S     S Y Q   ++RINSLKA  F PP+KP+SP++    RHPSPQRV RSTPQKRVR ASPSP+RQKSFRK+        PSPSRR S EKCR  
Subjt:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKD----DQRPPSPSRRLSGEKCRAA

Query:  STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
            A   KS+ P R+  MKKEITCIHRISSKIDEVA +EAV   GD DSVVAMEDIDNPLIS+DCFIFL
Subjt:  STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein1.2e-8071.22Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S
        MGSCISKCKPK  KQ P FDFNNL+VQDKLVVIPQP  PL TT T T+ TPSLSL+NKISPYPPSPSPS+SSISSFTCLSS  T ++TN+SFSTA+S  S
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR-S

Query:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCRAA
        PI S  Y  S Y QNPH+  INSLKA+ F PPVKPISP+L    RHPSPQRV RS PQKR R ASPSP+RQKSFRK+  QRP   PSP+RR S EKC+ A
Subjt:  PIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCRAA

Query:  -STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         +       KS+ PVR   MKKEITCIHRISSKIDEVA +EAV   GD DSVVAMEDIDNPLIS+DCFIFL
Subjt:  -STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

A0A1S3B4I5 Uncharacterized protein3.2e-7869.96Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR
        MGSCISKCKPK  +Q P FDFNNL VQDKLVVIPQP  P LT+T+T TT+ TPSLSL+NKISPYPPSPSPS+SSISSFTCLSS  T+++TN+SFSTA+S 
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR

Query:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR
         SPI S  Y  S Y QNPH+ RINSLKA+ F  PVKPISP++    RHPSPQRV RSTPQKR+R ASPSP+RQKSFRK+  QRP   PSP+RR S EKC+
Subjt:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR

Query:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         A +       KS+ PVR   MKKEITCIHRISSKID+VA +EAV   GD DSVVAMED+DNPLIS+DCFIFL
Subjt:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

A0A5D3D583 TPRXL protein3.2e-7869.96Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR
        MGSCISKCKPK  +Q P FDFNNL VQDKLVVIPQP  P LT+T+T TT+ TPSLSL+NKISPYPPSPSPS+SSISSFTCLSS  T+++TN+SFSTA+S 
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQP-PPPLTTTSTSTTT-TPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR

Query:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR
         SPI S  Y  S Y QNPH+ RINSLKA+ F  PVKPISP++    RHPSPQRV RSTPQKR+R ASPSP+RQKSFRK+  QRP   PSP+RR S EKC+
Subjt:  -SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDD-QRP---PSPSRRLSGEKCR

Query:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
         A +       KS+ PVR   MKKEITCIHRISSKID+VA +EAV   GD DSVVAMED+DNPLIS+DCFIFL
Subjt:  AA-STRKAAGLKSQLPVRACGMKKEITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451474.5e-8870.92Show/hide
Query:  MGSCISKCKPKTFK--QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR
        MGSCISKCKPK  K    P FDFNN IVQDKLVVIPQPPP     + ++ + PSLSL+NKISPYPPSPSPS+SS    TCLSS+TTTTTTNSSFSTA+SR
Subjt:  MGSCISKCKPKTFK--QQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSR

Query:  SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPR------STPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGE
        SPI    Y WS Y QNPHVVRINSLKA+ FSPP   +SP++RQ+ RHPSPQRV R      STPQKRVR ASPSPVRQKSFRK+ QRP  PSPSRRLSGE
Subjt:  SPIYSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPR------STPQKRVRLASPSPVRQKSFRKDDQRP--PSPSRRLSGE

Query:  KCRAA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL
        KCR A      ++ K  G KS+ P R C MKKE ITCIHRISSKIDE AAREAVLN+GD DS  AMEDIDNPLIS+DCFIFL
Subjt:  KCRAA------STRKAAGLKSQLPVRACGMKKE-ITCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL

A0A6P5SQY2 segmentation polarity homeobox protein engrailed9.5e-3845.25Show/hide
Query:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSRSP
        MGSCISKC+P+        + N+  VQDKLV+   P         S    P +S +NKISP PPSPS STSS SSFTC ++T+T+ T++S  ST +S S 
Subjt:  MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSRSP

Query:  I--------YSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPIL--RQQPRHPSPQRVPRS-TPQKRVRLASPSPV-RQKSFRKDDQRP--------
        +        +S+++LWSCYK+NPHVVRINSLK   FS    P  P+L    + + P+ +    S TPQKR+R +SP+P+ RQKSFRK+ +RP        
Subjt:  I--------YSDQYLWSCYKQNPHVVRINSLKANPFSPPVKPISPIL--RQQPRHPSPQRVPRS-TPQKRVRLASPSPV-RQKSFRKDDQRP--------

Query:  -------PSPSRRL---------SGEKCRAASTRKAAG---LKSQLPVRACGMKKEI-TCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMD
               PSPSRR          S  K  A + R AA      S   +R C   +E  T IHRISSKIDEVA  EA+    DY   +  EDIDNPLIS+D
Subjt:  -------PSPSRRL---------SGEKCRAASTRKAAG---LKSQLPVRACGMKKEI-TCIHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMD

Query:  CFIFL
        CFIFL
Subjt:  CFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein5.9e-1633.03Show/hide
Query:  MGSCISKCKPKT--FKQQPRFDFNNLIVQDKLVV--IPQPPPPLTTTSTS-----TTTTPSLSLNNKISPYPPSP------SPSTSSISSFTCLSSTTTT
        MG CISKC PK+  FK+          V +K+ +   P    PL     +           +++  K+ P PPSP      SP   S +S + LSS+ ++
Subjt:  MGSCISKCKPKT--FKQQPRFDFNNLIVQDKLVV--IPQPPPPLTTTSTS-----TTTTPSLSLNNKISPYPPSP------SPSTSSISSFTCLSSTTTT

Query:  TTTNSSFSTATSRSPIYSDQYLWSCYKQNPHVVRINSLKANPFS-PPVKPISPILRQQPRHPS-----PQRVPR-----STPQKRVRLASP---SPVRQK
         +T SS S +  RS  +S+ +L +CY++N HV RINSL+    S    KP  P     P  PS     P R        S   KR R  SP   S  RQK
Subjt:  TTTNSSFSTATSRSPIYSDQYLWSCYKQNPHVVRINSLKANPFS-PPVKPISPILRQQPRHPS-----PQRVPR-----STPQKRVRLASP---SPVRQK

Query:  SFRKDDQR-----------------PPSPSRRLSGEKCRAASTRKAAGL-KSQLPVRACGMKKEITC---------------IHRISSKIDEVAAREAVL
        SFR+D +R                  PSPSRR  G   ++ S  +  G+  + L V +C  K  +                 IHRISSKID+   RE + 
Subjt:  SFRKDDQR-----------------PPSPSRRLSGEKCRAASTRKAAGL-KSQLPVRACGMKKEITC---------------IHRISSKIDEVAAREAVL

Query:  NDGDYDSVVAMEDIDNPLISMDCFIFL
         D +   V   E++ NPLI +DCFIFL
Subjt:  NDGDYDSVVAMEDIDNPLISMDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACCTTCAAACAACAACCTCGTTTTGATTTCAACAACCTTATTGTCCAAGACAAGCTCGTTGTAATCCCTCAACCGCC
GCCGCCATTAACAACAACATCAACATCCACAACAACAACTCCTTCTCTCTCTCTCAATAACAAAATTTCTCCTTATCCTCCTTCTCCTTCTCCTTCAACTTCTTCCATTT
CTTCTTTCACTTGTCTTTCTTCAACTACAACAACCACAACAACCAACAGCTCTTTCTCAACTGCAACTTCCCGTTCGCCAATTTACTCAGACCAGTACTTGTGGTCGTGT
TACAAGCAAAACCCTCATGTCGTTCGGATCAATTCCCTTAAAGCTAATCCCTTTTCGCCGCCGGTGAAGCCGATTTCCCCGATTCTCCGGCAGCAACCTCGGCACCCGTC
TCCGCAAAGGGTGCCGAGATCTACACCTCAGAAGAGAGTCCGACTGGCATCGCCATCGCCCGTTCGGCAGAAGAGCTTCAGGAAGGACGATCAGCGGCCTCCATCGCCGA
GTAGACGGTTGAGCGGAGAGAAATGCCGGGCGGCTTCGACGAGGAAGGCTGCTGGTCTTAAAAGCCAATTGCCGGTGAGGGCTTGCGGGATGAAGAAGGAAATTACTTGC
ATTCATAGGATAAGTTCGAAGATTGATGAAGTTGCTGCGAGAGAAGCGGTTTTAAATGATGGAGATTATGATTCGGTGGTGGCTATGGAGGATATTGATAATCCTTTAAT
CTCGATGGATTGCTTTATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGACCTTCAAACAACAACCTCGTTTTGATTTCAACAACCTTATTGTCCAAGACAAGCTCGTTGTAATCCCTCAACCGCC
GCCGCCATTAACAACAACATCAACATCCACAACAACAACTCCTTCTCTCTCTCTCAATAACAAAATTTCTCCTTATCCTCCTTCTCCTTCTCCTTCAACTTCTTCCATTT
CTTCTTTCACTTGTCTTTCTTCAACTACAACAACCACAACAACCAACAGCTCTTTCTCAACTGCAACTTCCCGTTCGCCAATTTACTCAGACCAGTACTTGTGGTCGTGT
TACAAGCAAAACCCTCATGTCGTTCGGATCAATTCCCTTAAAGCTAATCCCTTTTCGCCGCCGGTGAAGCCGATTTCCCCGATTCTCCGGCAGCAACCTCGGCACCCGTC
TCCGCAAAGGGTGCCGAGATCTACACCTCAGAAGAGAGTCCGACTGGCATCGCCATCGCCCGTTCGGCAGAAGAGCTTCAGGAAGGACGATCAGCGGCCTCCATCGCCGA
GTAGACGGTTGAGCGGAGAGAAATGCCGGGCGGCTTCGACGAGGAAGGCTGCTGGTCTTAAAAGCCAATTGCCGGTGAGGGCTTGCGGGATGAAGAAGGAAATTACTTGC
ATTCATAGGATAAGTTCGAAGATTGATGAAGTTGCTGCGAGAGAAGCGGTTTTAAATGATGGAGATTATGATTCGGTGGTGGCTATGGAGGATATTGATAATCCTTTAAT
CTCGATGGATTGCTTTATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGSCISKCKPKTFKQQPRFDFNNLIVQDKLVVIPQPPPPLTTTSTSTTTTPSLSLNNKISPYPPSPSPSTSSISSFTCLSSTTTTTTTNSSFSTATSRSPIYSDQYLWSC
YKQNPHVVRINSLKANPFSPPVKPISPILRQQPRHPSPQRVPRSTPQKRVRLASPSPVRQKSFRKDDQRPPSPSRRLSGEKCRAASTRKAAGLKSQLPVRACGMKKEITC
IHRISSKIDEVAAREAVLNDGDYDSVVAMEDIDNPLISMDCFIFL