; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012901 (gene) of Snake gourd v1 genome

Gene IDTan0012901
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationLG10:1146572..1147423
RNA-Seq ExpressionTan0012901
SyntenyTan0012901
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048864.1 WW domain-binding protein 11-like [Cucumis melo var. makuwa]1.9e-6758.13Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------
        MANLPR+GR WQR S+ PR APAA      P+PE LPLAPT Q+LQ FEP PPPVA+ P S     TP   +P  + + PAASPKY ATV   A      
Subjt:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------

Query:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV
          S PVSP RK  + R+ ISPNSYQ+ +KPTTP LS LVLPKS++V TI S ++PEVE+     K   K D + EY SG  P   +A+AINL+G N+GAV
Subjt:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV

Query:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA
        M++ QFSDK SGGEV +KIETETGV+  ND+      EKS R T FPMT   NSNFQ+VNNSV+YNSSC+ RDPGLHLDFSGK KD+ A
Subjt:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA

XP_008437853.1 PREDICTED: WW domain-binding protein 11-like [Cucumis melo]4.1e-4956.07Show/hide
Query:  NPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA--------SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQ
        NPPP       PP   TP   +P  + + PAASPKY ATV   A        S PVSP RK  + R+ ISPNSYQ+ +KPTTP LS LVLPKS++V TI 
Subjt:  NPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA--------SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQ

Query:  SIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQ
        S ++PEVE+     K   K D + EY SG  P   +A+AINL+G N+GAVM++ QFSDK SGGEV +KIETETGV+  ND+      EKS R T FPMT 
Subjt:  SIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQ

Query:  FMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA
          NSNFQ+VNNSV+YNSSC+ RDPGLHLDFSGK KD+ A
Subjt:  FMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA

XP_011650663.1 gibberellin-regulated protein 14 [Cucumis sativus]2.5e-7058.19Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------
        MANLPR+GR W R S   R AP A      P+PE LPLAPT Q+LQPFEP PP  A+ PSS P + TP IS        PAASPKY ATV   A      
Subjt:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------

Query:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV
          S PVSP RK  D RH ISPNSYQ+ +KPT P LS L LPKSA+V T+ S ++PEVE+  D +K   K DR+ +  S K P   +A+AINL+G N+GAV
Subjt:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV

Query:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGAIFDGGEKSKY
        M++ QFSDK SGGEV +KIET+TGV+  ND       EKS R T FPMT   NSNFQ+VNNSVMYNSSCSGRDPGLHLDFSG+ KD+ A  DG +KSKY
Subjt:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGAIFDGGEKSKY

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]5.0e-5552.65Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASS--------PV
        MANLPRFGR WQR +    PAPAA  P      PTTQ+LQPFEPN PP AS  SSP +      +SP ++   P+ASPKY A    V SS        P 
Subjt:  MANLPRFGRAWQRFSAPPRPAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASS--------PV

Query:  SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQKAEAINLSGHNVGA
        SP+ K  DR H     P+SP   ++T    TPPLSPL LP++AN   + S V PEVE+ + LY    +KPAK  R +E+GSGK PQ AE INL+GHNVGA
Subjt:  SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQKAEAINLSGHNVGA

Query:  VMEVKQFSDKRSGGEVIKKIETET---GVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSK-DDGAIFDGGE
        VME+ Q S K S GE+IKK E+ET   G  HGND+ K GAK++       P T FMNSNFQ VNNSV+Y+SSCS RDPGLHL FS  +    GAI DG  
Subjt:  VMEVKQFSDKRSGGEVIKKIETET---GVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSK-DDGAIFDGGE

Query:  KS
        K+
Subjt:  KS

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]8.2e-4244.94Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASSPV--
        MANLPR GR  QR SA   P  AA      P+PE  P APT+   QP EP  P     P+SP RQ    I+SP KKATSP ASPKY  ++T V S P   
Subjt:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASSPV--

Query:  -------SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVN----TIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQK--
               SP+ K  + R+     P+SP   ++T    TPPLSPL LP++  ++    T     QP VE    +Y    +KP K DR +EYGSGK  QK  
Subjt:  -------SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVN----TIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQK--

Query:  --AEAINLSGHNVGAVMEVKQFSD-KRSGGEVIKKIETE-TGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSG
          AE INL+GHNVGAVME+ + SD  R GGE +K  ET+  GV HG+ +   GAK         P+T FMN+NFQ +NNS++Y+SSC+  DPGLHL    
Subjt:  --AEAINLSGHNVGAVMEVKQFSD-KRSGGEVIKKIETE-TGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSG

Query:  KSKDDGAIFDGGEKSK
            DGA   G +  K
Subjt:  KSKDDGAIFDGGEKSK

TrEMBL top hitse value%identityAlignment
A0A0A0L8G8 Uncharacterized protein1.2e-7058.19Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------
        MANLPR+GR W R S   R AP A      P+PE LPLAPT Q+LQPFEP PP  A+ PSS P + TP IS        PAASPKY ATV   A      
Subjt:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------

Query:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV
          S PVSP RK  D RH ISPNSYQ+ +KPT P LS L LPKSA+V T+ S ++PEVE+  D +K   K DR+ +  S K P   +A+AINL+G N+GAV
Subjt:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV

Query:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGAIFDGGEKSKY
        M++ QFSDK SGGEV +KIET+TGV+  ND       EKS R T FPMT   NSNFQ+VNNSVMYNSSCSGRDPGLHLDFSG+ KD+ A  DG +KSKY
Subjt:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGAIFDGGEKSKY

A0A1S3AVL7 WW domain-binding protein 11-like2.0e-4956.07Show/hide
Query:  NPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA--------SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQ
        NPPP       PP   TP   +P  + + PAASPKY ATV   A        S PVSP RK  + R+ ISPNSYQ+ +KPTTP LS LVLPKS++V TI 
Subjt:  NPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA--------SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQ

Query:  SIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQ
        S ++PEVE+     K   K D + EY SG  P   +A+AINL+G N+GAVM++ QFSDK SGGEV +KIETETGV+  ND+      EKS R T FPMT 
Subjt:  SIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQ

Query:  FMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA
          NSNFQ+VNNSV+YNSSC+ RDPGLHLDFSGK KD+ A
Subjt:  FMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA

A0A5D3DB96 WW domain-binding protein 11-like9.4e-6858.13Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------
        MANLPR+GR WQR S+ PR APAA      P+PE LPLAPT Q+LQ FEP PPPVA+ P S     TP   +P  + + PAASPKY ATV   A      
Subjt:  MANLPRFGRAWQRFSAPPRPAPAA------PQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVA------

Query:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV
          S PVSP RK  + R+ ISPNSYQ+ +KPTTP LS LVLPKS++V TI S ++PEVE+     K   K D + EY SG  P   +A+AINL+G N+GAV
Subjt:  --SSPVSPSRKPYDRRHPISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQ--KAEAINLSGHNVGAV

Query:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA
        M++ QFSDK SGGEV +KIETETGV+  ND+      EKS R T FPMT   NSNFQ+VNNSV+YNSSC+ RDPGLHLDFSGK KD+ A
Subjt:  MEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGA

A0A6J1D1D2 vegetative cell wall protein gp1-like2.4e-5552.65Show/hide
Query:  MANLPRFGRAWQRFSAPPRPAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASS--------PV
        MANLPRFGR WQR +    PAPAA  P      PTTQ+LQPFEPN PP AS  SSP +      +SP ++   P+ASPKY A    V SS        P 
Subjt:  MANLPRFGRAWQRFSAPPRPAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASS--------PV

Query:  SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQKAEAINLSGHNVGA
        SP+ K  DR H     P+SP   ++T    TPPLSPL LP++AN   + S V PEVE+ + LY    +KPAK  R +E+GSGK PQ AE INL+GHNVGA
Subjt:  SPSRKPYDRRH-----PISPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQKAEAINLSGHNVGA

Query:  VMEVKQFSDKRSGGEVIKKIETET---GVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSK-DDGAIFDGGE
        VME+ Q S K S GE+IKK E+ET   G  HGND+ K GAK++       P T FMNSNFQ VNNSV+Y+SSCS RDPGLHL FS  +    GAI DG  
Subjt:  VMEVKQFSDKRSGGEVIKKIETET---GVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSK-DDGAIFDGGE

Query:  KS
        K+
Subjt:  KS

A0A6J1ICG8 wiskott-Aldrich syndrome protein family member 2-like1.4e-3943.91Show/hide
Query:  MANLPRFGRAWQR--FSAP---PRPAPAAP-QPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHV-------
        M+N PRFGR  QR   +AP   P   PAA  +PETLP   T+Q+LQP E                          K TSP ASPKY  +VT V       
Subjt:  MANLPRFGRAWQR--FSAP---PRPAPAAP-QPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHV-------

Query:  --ASSPVSPSRKPYDRRHPISP-NSYQKTVKPTTPPLSPLVLP----KSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQK---AEA
           S PVSP++K  DR    SP  S  ++++ + PP  PL LP     + + NT Q  +Q EVE+ + +Y    +KP K DR  EYGSGK  +K   AE+
Subjt:  --ASSPVSPSRKPYDRRHPISP-NSYQKTVKPTTPPLSPLVLP----KSANVNTIQSIVQPEVERIADLY----KKPAKPDRKTEYGSGKLPQK---AEA

Query:  INLSGHNVGAVMEVKQFS-DKRSGGEVIKKIETE--TGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKD
        INL+GHNVGAVME+ + S   R GGE ++K +TE   G R GN++ K   K K  +    PMT FMNSNFQ VNNSV+Y+SSC+ RDPGLHL F+  +  
Subjt:  INLSGHNVGAVMEVKQFS-DKRSGGEVIKKIETE--TGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKD

Query:  DGAIFDGGEKSK
        DGA  DG +  K
Subjt:  DGAIFDGGEKSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63310.1 unknown protein6.6e-0536.56Show/hide
Query:  INLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSG
        I LSG N+GA M                  +TE    HG+ D + G  E    +T      ++NSNFQ VNNS+M  +     DPG+HLD SG
Subjt:  INLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSG

AT1G75260.1 oxidoreductases, acting on NADH or NADPH4.3e-0428.7Show/hide
Query:  KTEYGSGKLPQKAEAI-NLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRA-TGFPMTQFMNSNFQDVNNSVMYNSSCSGR
        K  +G      K+ ++  L+G N GA M +    DK+ G   I++          N         K   A      T ++N N Q +NNS++  SS S  
Subjt:  KTEYGSGKLPQKAEAI-NLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRA-TGFPMTQFMNSNFQDVNNSVMYNSSCSGR

Query:  DPGLHLDF
        DPG+H+ F
Subjt:  DPGLHLDF

AT2G46630.1 unknown protein1.2e-0624.92Show/hide
Query:  RAWQRFSAPPR---PAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASSPVSPSRKPYDRRHPI
        R  Q+   PPR   P  + PQ  +   +P ++ + P  P PP  A+ P  PPR  + + S P  K    A  P+   +    A S  S + +    R P 
Subjt:  RAWQRFSAPPR---PAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASSPVSPSRKPYDRRHPI

Query:  SPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEV---ERIADLYK---------------------------------KPAKPDRKTEYGSGKLPQ
           +++K   P    LSP  LP S  +++ +   Q  +   E+ +  ++                                  P K  R+      +   
Subjt:  SPNSYQKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEV---ERIADLYK---------------------------------KPAKPDRKTEYGSGKLPQ

Query:  KAEAINLSGHNVGAVMEVKQ------------FSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRA-----TGFPMTQFMNSNFQDVNNSVMYNSS
            I ++G N GAVME+ +             S + S G   K    ++     +D+G+ G K+ +        +  PM  FMNSN Q +NNS++YNS+
Subjt:  KAEAINLSGHNVGAVMEVKQ------------FSDKRSGGEVIKKIETETGVRHGNDDGKMGAKEKSHRA-----TGFPMTQFMNSNFQDVNNSVMYNSS

Query:  CSGRDPGLHLDFSGKSKDDGA--IFDGGEKSKY
         S  DPG+HL  S K   D    + D G    Y
Subjt:  CSGRDPGLHLDFSGKSKDDGA--IFDGGEKSKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATCTTCCTCGCTTTGGTCGGGCATGGCAGCGCTTTTCCGCACCCCCCCGCCCTGCTCCGGCCGCCCCACAGCCTGAGACTCTGCCATTAGCTCCGACCACCCA
ATCTCTGCAACCTTTCGAGCCCAACCCGCCCCCGGTGGCTTCTCTGCCGTCTTCCCCTCCAAGACAACTCACTCCTTGGATTTCCTCTCCGGTGAAGAAAGCGACCTCAC
CCGCGGCATCCCCAAAATATTATGCTACCGTCACACATGTGGCCAGCTCGCCGGTCTCCCCTTCGCGCAAACCTTACGACCGGAGACATCCAATTAGCCCTAATTCGTAT
CAGAAAACCGTCAAGCCAACTACTCCCCCACTTTCCCCTCTGGTTCTGCCGAAATCTGCTAATGTGAACACGATTCAATCCATAGTCCAACCGGAGGTGGAGCGGATTGC
CGATCTGTACAAGAAGCCGGCGAAGCCTGATCGGAAGACGGAGTACGGCTCCGGTAAGCTGCCGCAGAAGGCGGAGGCTATAAACCTCAGCGGACATAACGTAGGCGCGG
TCATGGAAGTAAAGCAATTCTCCGATAAACGTTCAGGCGGAGAAGTCATCAAGAAGATCGAAACAGAAACCGGCGTCCGCCATGGAAATGATGACGGGAAAATGGGTGCA
AAGGAGAAAAGTCACAGAGCAACCGGATTTCCGATGACGCAATTCATGAACAGCAATTTTCAAGACGTGAACAATTCCGTTATGTATAATTCGTCGTGCAGTGGCCGTGA
TCCGGGGCTGCACCTTGATTTCTCCGGCAAGTCGAAGGATGATGGAGCCATTTTCGACGGCGGCGAGAAATCTAAGTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAATCTTCCTCGCTTTGGTCGGGCATGGCAGCGCTTTTCCGCACCCCCCCGCCCTGCTCCGGCCGCCCCACAGCCTGAGACTCTGCCATTAGCTCCGACCACCCA
ATCTCTGCAACCTTTCGAGCCCAACCCGCCCCCGGTGGCTTCTCTGCCGTCTTCCCCTCCAAGACAACTCACTCCTTGGATTTCCTCTCCGGTGAAGAAAGCGACCTCAC
CCGCGGCATCCCCAAAATATTATGCTACCGTCACACATGTGGCCAGCTCGCCGGTCTCCCCTTCGCGCAAACCTTACGACCGGAGACATCCAATTAGCCCTAATTCGTAT
CAGAAAACCGTCAAGCCAACTACTCCCCCACTTTCCCCTCTGGTTCTGCCGAAATCTGCTAATGTGAACACGATTCAATCCATAGTCCAACCGGAGGTGGAGCGGATTGC
CGATCTGTACAAGAAGCCGGCGAAGCCTGATCGGAAGACGGAGTACGGCTCCGGTAAGCTGCCGCAGAAGGCGGAGGCTATAAACCTCAGCGGACATAACGTAGGCGCGG
TCATGGAAGTAAAGCAATTCTCCGATAAACGTTCAGGCGGAGAAGTCATCAAGAAGATCGAAACAGAAACCGGCGTCCGCCATGGAAATGATGACGGGAAAATGGGTGCA
AAGGAGAAAAGTCACAGAGCAACCGGATTTCCGATGACGCAATTCATGAACAGCAATTTTCAAGACGTGAACAATTCCGTTATGTATAATTCGTCGTGCAGTGGCCGTGA
TCCGGGGCTGCACCTTGATTTCTCCGGCAAGTCGAAGGATGATGGAGCCATTTTCGACGGCGGCGAGAAATCTAAGTACTGA
Protein sequenceShow/hide protein sequence
MANLPRFGRAWQRFSAPPRPAPAAPQPETLPLAPTTQSLQPFEPNPPPVASLPSSPPRQLTPWISSPVKKATSPAASPKYYATVTHVASSPVSPSRKPYDRRHPISPNSY
QKTVKPTTPPLSPLVLPKSANVNTIQSIVQPEVERIADLYKKPAKPDRKTEYGSGKLPQKAEAINLSGHNVGAVMEVKQFSDKRSGGEVIKKIETETGVRHGNDDGKMGA
KEKSHRATGFPMTQFMNSNFQDVNNSVMYNSSCSGRDPGLHLDFSGKSKDDGAIFDGGEKSKY