; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020625 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020625
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationchr7:892738..895126
RNA-Seq ExpressionLag0020625
SyntenyLag0020625
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597060.1 hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7759.81Show/hide
Query:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLE-PNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP--
        M+N PRFG  RQR  + APPV   + PAA               L P+E P P   + K+P       PR L SP +  TSP ASP+YG+SVTRV  P  
Subjt:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLE-PNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP--

Query:  ------PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-
              PPVSPA KY DR I Q    QSP++S R TSPP  P ALP TQ T+ NGT  QP+IQPEVE+KS VYNKTVEKPAK+D  SEYGSGKP +KQ+ 
Subjt:  ------PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-

Query:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD
         E INLAGHNVGAVMEI++S+ GHRLGGET++         GGV  GNEEKK   KKE PMTAFMNSNFQSVNNSVLY S+CNHRDPGLHL F+DAADGD
Subjt:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD

Query:  GATADGRKDYQ
        GAT DGRK+Y+
Subjt:  GATADGRKDYQ

XP_008437851.1 PREDICTED: zyxin-like [Cucumis melo]1.4e-8463.26Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------
        MANLPRFGR RQR+P   PPVPAAAQPA EP+ +IL FA                T  APASP   SPR L SP K ATSP ASPKYG S TR+      
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------

Query:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ
           + PP S  +KY +R  G+     SPAKS RA +PPLSPLALPR QV +GNGT AQPR+QPEVE K IVYNKTVEKP+K++R S EYGS K  Q KQ+
Subjt:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ

Query:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG
         EVI L GHNVGAVMEIN+SS G+RLGGET+KKNETE  GD    +G+E+KKT   KKE P+TAFMNSNFQSVNNS+L+DS+CNHRDPGLHLAF DA DG
Subjt:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG

Query:  DGATADGRKDYQP
        DGA  DG+K Y+P
Subjt:  DGATADGRKDYQP

XP_022951357.1 sulfated surface glycoprotein 185-like [Cucurbita moschata]3.4e-7860.51Show/hide
Query:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---
        M+N PRFG  RQR  +AAPPV     PAAE KPE   F   +Q+LQ                           PA+  TSP ASPKYG SVTRV  P   
Subjt:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---

Query:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQA
             PPVSPA KY DR + Q    QSP++S R +  PP  PLALP TQ T+ N T  QPRIQPEVE+KSIVYNKTVEK  K+DRPSEYGSGKP +KQ+A
Subjt:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQA

Query:  -EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGG-DGGVSHGNEEKKTGPKKE--TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAA
         E INLAGHNVGAVMEI++SS GHRLGGET++KNETEGGG DG      EEKK   KK+   PMTAFMNSNFQSVNNSVLYDS+C+HRDPGLHL F+DAA
Subjt:  -EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGG-DGGVSHGNEEKKTGPKKE--TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAA

Query:  DGDGATADGRKDYQ
        DGDGA  DGRK Y+
Subjt:  DGDGATADGRKDYQ

XP_022974816.1 wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima]1.2e-7860.32Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---
        M+N PRFGR RQR  VAAPPV     PAAE KPE L F   +QTLQ                           PA+  TSP ASPKYG SVT V  P   
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---

Query:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-A
             PPVSPA KY DR + Q    QSP++S R + PP  PLALP T  T+ +G   QPRIQ EVE+KSIVYNKTVEKP K+DRP EYGSGK  +KQ+ A
Subjt:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-A

Query:  EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKE-----TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDA
        E INLAGHNVGAVMEI++ SA HRLGGET++KN+TE GGDGG   GNEEKK   KK+      PMTAFMNSNFQSVNNSVLYDS+CNHRDPGLHL F+DA
Subjt:  EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKE-----TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDA

Query:  ADGDGATADGRKDYQ
        ADGDGA  DGRK+Y+
Subjt:  ADGDGATADGRKDYQ

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]5.8e-8662.18Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---
        MANLPR GR RQR    APP+ AA QP AEPKPEI  FA  +   QP+EP     T  APASPL  SPR + SP K ATSP ASPKYG S+TRV  P   
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---

Query:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ--
             PP SP NKY +   G+     SPAKS R  +PPLSPL LPRT V S + T A PR QP VE K IVYNK VEKP K DRPSEYGSGKP QKQQ  
Subjt:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ--

Query:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD
        AEVINL GHNVGAVMEIN+SS G+RLGGET+K  ET+G   GGV HG++EK  G K   P+TAFMN+NFQS+NNS+LYDS+CNH DPGLHL+  ++ DGD
Subjt:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD

Query:  GATADGRKDYQP
        GAT  G K Y+P
Subjt:  GATADGRKDYQP

TrEMBL top hitse value%identityAlignment
A0A1S3AV38 zyxin-like6.9e-8563.26Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------
        MANLPRFGR RQR+P   PPVPAAAQPA EP+ +IL FA                T  APASP   SPR L SP K ATSP ASPKYG S TR+      
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------

Query:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ
           + PP S  +KY +R  G+     SPAKS RA +PPLSPLALPR QV +GNGT AQPR+QPEVE K IVYNKTVEKP+K++R S EYGS K  Q KQ+
Subjt:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ

Query:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG
         EVI L GHNVGAVMEIN+SS G+RLGGET+KKNETE  GD    +G+E+KKT   KKE P+TAFMNSNFQSVNNS+L+DS+CNHRDPGLHLAF DA DG
Subjt:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG

Query:  DGATADGRKDYQP
        DGA  DG+K Y+P
Subjt:  DGATADGRKDYQP

A0A5A7TZ24 Zyxin-like6.9e-8563.26Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------
        MANLPRFGR RQR+P   PPVPAAAQPA EP+ +IL FA                T  APASP   SPR L SP K ATSP ASPKYG S TR+      
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRV------

Query:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ
           + PP S  +KY +R  G+     SPAKS RA +PPLSPLALPR QV +GNGT AQPR+QPEVE K IVYNKTVEKP+K++R S EYGS K  Q KQ+
Subjt:  --AIPPPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPS-EYGSGKPPQ-KQQ

Query:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG
         EVI L GHNVGAVMEIN+SS G+RLGGET+KKNETE  GD    +G+E+KKT   KKE P+TAFMNSNFQSVNNS+L+DS+CNHRDPGLHLAF DA DG
Subjt:  AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKT-GPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADG

Query:  DGATADGRKDYQP
        DGA  DG+K Y+P
Subjt:  DGATADGRKDYQP

A0A6J1GDX3 wiskott-Aldrich syndrome protein family member 2-like8.2e-7866.42Show/hide
Query:  LEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---------PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALP
        L+  PAP     PASPL +SP  LPSPAK   SP ASPKYG+SVTRV  P         PPVSPA KY DR I Q    QSP++S R +  PP  PLALP
Subjt:  LEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---------PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALP

Query:  RTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSH
         TQ T+ NGT  QP+IQPE+EQKSIVYNKTVEKPAK DR SEYGSGKP +KQ+ AE INLAGHNVGAVMEI++SS GHRLGGET++         G V  
Subjt:  RTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-AEVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSH

Query:  GNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGDGATADGRKDYQ
        GNEEKK   KKE PMTAF NSNFQSVNNSVLY S+CNHRDPGLHL F+DAADGDGA ADGRK+Y+
Subjt:  GNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGDGATADGRKDYQ

A0A6J1GHE5 sulfated surface glycoprotein 185-like1.7e-7860.51Show/hide
Query:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---
        M+N PRFG  RQR  +AAPPV     PAAE KPE   F   +Q+LQ                           PA+  TSP ASPKYG SVTRV  P   
Subjt:  MANLPRFG-RRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---

Query:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQA
             PPVSPA KY DR + Q    QSP++S R +  PP  PLALP TQ T+ N T  QPRIQPEVE+KSIVYNKTVEK  K+DRPSEYGSGKP +KQ+A
Subjt:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRAT-SPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQA

Query:  -EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGG-DGGVSHGNEEKKTGPKKE--TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAA
         E INLAGHNVGAVMEI++SS GHRLGGET++KNETEGGG DG      EEKK   KK+   PMTAFMNSNFQSVNNSVLYDS+C+HRDPGLHL F+DAA
Subjt:  -EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGG-DGGVSHGNEEKKTGPKKE--TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAA

Query:  DGDGATADGRKDYQ
        DGDGA  DGRK Y+
Subjt:  DGDGATADGRKDYQ

A0A6J1ICG8 wiskott-Aldrich syndrome protein family member 2-like5.7e-7960.32Show/hide
Query:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---
        M+N PRFGR RQR  VAAPPV     PAAE KPE L F   +QTLQ                           PA+  TSP ASPKYG SVT V  P   
Subjt:  MANLPRFGR-RQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIP---

Query:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-A
             PPVSPA KY DR + Q    QSP++S R + PP  PLALP T  T+ +G   QPRIQ EVE+KSIVYNKTVEKP K+DRP EYGSGK  +KQ+ A
Subjt:  -----PPVSPANKYSDRIIGQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQ-A

Query:  EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKE-----TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDA
        E INLAGHNVGAVMEI++ SA HRLGGET++KN+TE GGDGG   GNEEKK   KK+      PMTAFMNSNFQSVNNSVLYDS+CNHRDPGLHL F+DA
Subjt:  EVINLAGHNVGAVMEINQSSAGHRLGGETIKKNETEGGGDGGVSHGNEEKKTGPKKE-----TPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDA

Query:  ADGDGATADGRKDYQ
        ADGDGA  DGRK+Y+
Subjt:  ADGDGATADGRKDYQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein1.1e-1029.33Show/hide
Query:  RVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAP---ASPLITSPRR---LPSPAKNATSPSASPKY------GASVTRVAIPP--
        R P   P  P   QP + P+ +    +   Q  QPL P    A   +P    SP  + P R    P+P K AT P   P+            + A+PP  
Subjt:  RVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAP---ASPLITSPRR---LPSPAKNATSPSASPKY------GASVTRVAIPP--

Query:  ---PVSPANKYSDRIIGQKITAQSPAKS---GRATSP-PLSPLALPRT------QVTSGNGTAAQPRIQP-EVEQKSIVYNKTVEKPAKADRPSEYG---
           P SPA+  S     + +  +SP++S    +A SP  LSP +LP +      + T  N   A+   Q  E    +  +N    +    ++   Y    
Subjt:  ---PVSPANKYSDRIIGQKITAQSPAKS---GRATSP-PLSPLALPRT------QVTSGNGTAAQPRIQP-EVEQKSIVYNKTVEKPAKADRPSEYG---

Query:  --SGKPPQKQQAE-------------VINLAGHNVGAVMEINQSSAGHRLGGE-TIKKNETEGGGDGG-----VSHGNEEKKTGPKKET-----------
           G  P+K   +             VI +AG N GAVMEI +S  G++ GG  T     + G G+ G      S  + ++  G KK T           
Subjt:  --SGKPPQKQQAE-------------VINLAGHNVGAVMEINQSSAGHRLGGE-TIKKNETEGGGDGG-----VSHGNEEKKTGPKKET-----------

Query:  PMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD
        PM AFMNSN Q +NNS++Y+S  +H DPG+HL  S     D
Subjt:  PMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATCTTCCTCGCTTTGGCCGACGGCAACGTGTTCCGGTGGCGGCGCCGCCAGTTCCCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAAATTCTAACGTT
TGCTCTGGGCACCCAAACTCTTCAACCTTTAGAACCCAACCCGGCGCCGGCGACGGCGAAGGCGCCGGCTTCTCCTCTGATAACATCACCCCGCCGGCTTCCCTCTCCGG
CGAAGAATGCCACGTCACCCTCTGCGTCGCCAAAATATGGAGCTTCTGTCACACGTGTGGCCATCCCGCCGCCGGTTTCGCCTGCAAACAAATATTCCGACCGGATAATT
GGGCAAAAAATCACAGCACAGTCGCCGGCGAAGTCCGGGCGAGCAACCTCGCCGCCGCTTTCTCCTCTTGCTCTGCCTCGTACTCAAGTGACTTCCGGGAATGGAACTGC
GGCTCAACCCAGGATTCAGCCGGAGGTGGAGCAGAAAAGCATTGTGTACAACAAGACCGTCGAGAAGCCGGCGAAGGCTGACCGCCCATCGGAGTACGGCTCCGGCAAGC
CACCGCAGAAGCAGCAGGCGGAGGTTATAAACCTCGCCGGACATAACGTCGGCGCCGTCATGGAAATAAATCAGTCCTCCGCCGGCCACCGTTTGGGCGGAGAAACCATA
AAAAAGAACGAAACAGAAGGCGGCGGCGACGGCGGCGTCAGCCATGGAAATGAAGAGAAGAAAACGGGGCCAAAGAAGGAAACGCCGATGACGGCATTTATGAACAGCAA
TTTCCAGAGTGTAAACAATTCGGTTCTGTACGACTCGGCGTGCAACCACCGTGATCCCGGCCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGCTG
ACGGCCGTAAGGATTACCAGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATCTTCCTCGCTTTGGCCGACGGCAACGTGTTCCGGTGGCGGCGCCGCCAGTTCCCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAAATTCTAACGTT
TGCTCTGGGCACCCAAACTCTTCAACCTTTAGAACCCAACCCGGCGCCGGCGACGGCGAAGGCGCCGGCTTCTCCTCTGATAACATCACCCCGCCGGCTTCCCTCTCCGG
CGAAGAATGCCACGTCACCCTCTGCGTCGCCAAAATATGGAGCTTCTGTCACACGTGTGGCCATCCCGCCGCCGGTTTCGCCTGCAAACAAATATTCCGACCGGATAATT
GGGCAAAAAATCACAGCACAGTCGCCGGCGAAGTCCGGGCGAGCAACCTCGCCGCCGCTTTCTCCTCTTGCTCTGCCTCGTACTCAAGTGACTTCCGGGAATGGAACTGC
GGCTCAACCCAGGATTCAGCCGGAGGTGGAGCAGAAAAGCATTGTGTACAACAAGACCGTCGAGAAGCCGGCGAAGGCTGACCGCCCATCGGAGTACGGCTCCGGCAAGC
CACCGCAGAAGCAGCAGGCGGAGGTTATAAACCTCGCCGGACATAACGTCGGCGCCGTCATGGAAATAAATCAGTCCTCCGCCGGCCACCGTTTGGGCGGAGAAACCATA
AAAAAGAACGAAACAGAAGGCGGCGGCGACGGCGGCGTCAGCCATGGAAATGAAGAGAAGAAAACGGGGCCAAAGAAGGAAACGCCGATGACGGCATTTATGAACAGCAA
TTTCCAGAGTGTAAACAATTCGGTTCTGTACGACTCGGCGTGCAACCACCGTGATCCCGGCCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGCTG
ACGGCCGTAAGGATTACCAGCCATAG
Protein sequenceShow/hide protein sequence
MANLPRFGRRQRVPVAAPPVPAAAQPAAEPKPEILTFALGTQTLQPLEPNPAPATAKAPASPLITSPRRLPSPAKNATSPSASPKYGASVTRVAIPPPVSPANKYSDRII
GQKITAQSPAKSGRATSPPLSPLALPRTQVTSGNGTAAQPRIQPEVEQKSIVYNKTVEKPAKADRPSEYGSGKPPQKQQAEVINLAGHNVGAVMEINQSSAGHRLGGETI
KKNETEGGGDGGVSHGNEEKKTGPKKETPMTAFMNSNFQSVNNSVLYDSACNHRDPGLHLAFSDAADGDGATADGRKDYQP