; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020135 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020135
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMyosin heavy chain
Genome locationchr5:48364343..48365461
RNA-Seq ExpressionLag0020135
SyntenyLag0020135
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050514.1 myosin heavy chain [Cucumis melo var. makuwa]3.1e-8981.19Show/hide
Query:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL
        ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+DLSS L
Subjt:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL

Query:  SFRFPDSWAE------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIYGS
        SFRFPD+WAE      TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH IANIYGS
Subjt:  SFRFPDSWAE------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIYGS

Query:  LKQVIPWRRKHDEMRRVS
        LKQ I WRRK DEM  +S
Subjt:  LKQVIPWRRKHDEMRRVS

TYK29189.1 myosin heavy chain [Cucumis melo var. makuwa]5.3e-8980.45Show/hide
Query:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL
        ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+DLSS L
Subjt:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL

Query:  SFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY
        SFRFPD+WAE        TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH IANIY
Subjt:  SFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY

Query:  GSLKQVIPWRRKHDEMRRVS
        GSLKQ I WRRK DEM  +S
Subjt:  GSLKQVIPWRRKHDEMRRVS

XP_008466837.1 PREDICTED: uncharacterized protein LOC103504144 [Cucumis melo]2.8e-9079.65Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR  T ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH
        DLSS LSFRFPD+WAE        TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH
Subjt:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH

Query:  LIANIYGSLKQVIPWRRKHDEMRRVS
         IANIYGSLKQ I WRRK DEM  +S
Subjt:  LIANIYGSLKQVIPWRRKHDEMRRVS

XP_023552510.1 uncharacterized protein At4g00950-like [Cucurbita pepo subsp. pepo]4.1e-8979.19Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR AT ENPSSTPPKLSLFSLPR P EPPG+VTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLF D KVAHF+SPT AVDDP+ GQ
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGG--GGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY
        DLSS LSFRFPD+WAETA  T+EGK  KYVG+RRWMSFRKNKE+PKG SEI  S GG + GG   G G+G+TRVKITRFRSRR  F  S++KS LIA+IY
Subjt:  DLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGG--GGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY

Query:  GSLKQVIPWRRKHDEMRRVSR
        GSLKQVIPWRRK DE R VS+
Subjt:  GSLKQVIPWRRKHDEMRRVSR

XP_038882925.1 uncharacterized protein At4g00950-like [Benincasa hispida]7.4e-9183.26Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR AT ENPSSTPPKLSLFSLPR P E PGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT  VDDPI+G+
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAE----TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGG--GRGDGDTRVKITRFRSRRSFFGTSNSKSHLI
        DLSS LSFRFPD+WAE    TAT T+ GKD KYVG+RRWMSFRKNKEIPKGGSEI +S GGADGGG   G GDG+TRVKITRFRSRRSFF   NSK HLI
Subjt:  DLSSGLSFRFPDSWAE----TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGG--GRGDGDTRVKITRFRSRRSFFGTSNSKSHLI

Query:  ANIYGSLKQVIPWRR
        A+IYGSLKQ IPWR+
Subjt:  ANIYGSLKQVIPWRR

TrEMBL top hitse value%identityAlignment
A0A1S3CS64 uncharacterized protein LOC1035041441.4e-9079.65Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR  T ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH
        DLSS LSFRFPD+WAE        TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH
Subjt:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH

Query:  LIANIYGSLKQVIPWRRKHDEMRRVS
         IANIYGSLKQ I WRRK DEM  +S
Subjt:  LIANIYGSLKQVIPWRRKHDEMRRVS

A0A5A7U5K0 Myosin heavy chain1.5e-8981.19Show/hide
Query:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL
        ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+DLSS L
Subjt:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL

Query:  SFRFPDSWAE------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIYGS
        SFRFPD+WAE      TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH IANIYGS
Subjt:  SFRFPDSWAE------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIYGS

Query:  LKQVIPWRRKHDEMRRVS
        LKQ I WRRK DEM  +S
Subjt:  LKQVIPWRRKHDEMRRVS

A0A5D3E0H1 Myosin heavy chain2.6e-8980.45Show/hide
Query:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL
        ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+DLSS L
Subjt:  ENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGL

Query:  SFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY
        SFRFPD+WAE        TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH IANIY
Subjt:  SFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY

Query:  GSLKQVIPWRRKHDEMRRVS
        GSLKQ I WRRK DEM  +S
Subjt:  GSLKQVIPWRRKHDEMRRVS

A0A6J1E520 uncharacterized protein At4g00950-like4.4e-8979.19Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR A  ENPSSTPPKLSLFSLPR P EPPG+VTPPLHASISVPFQWEEAPGKPRPFG+IE NSKP+SARSLDLPPRLF D KVAHF+SPT AVDDP+ GQ
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGG--GGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY
        DLSS LSFRFPD+WAETA  T+EGK  KYVG+RRWMSFRKNKE+PKG SEI  S GG   GG   G GDG+TRVKITRFRSRRS F  S+SKS LIA+IY
Subjt:  DLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGG--GGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIY

Query:  GSLKQVIPWRRKHDEMRRVSR
        GSLKQVIPWRRK DE R VS+
Subjt:  GSLKQVIPWRRKHDEMRRVSR

E5GBA5 Uncharacterized protein1.4e-9079.65Show/hide
Query:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ
        MR  T ENPSSTPPKLSLFSLPR P EPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIE NSKPKSARSLDLPPRLFADAKVAHFASPT AVD+PI G+
Subjt:  MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQ

Query:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH
        DLSS LSFRFPD+WAE        TAT T+EGKD KYVG+RRWMSFRKNKEIPK GSEI V+GGG      G  DG+TRVKITRFRSRRSFF   NSKSH
Subjt:  DLSSGLSFRFPDSWAE--------TATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSH

Query:  LIANIYGSLKQVIPWRRKHDEMRRVS
         IANIYGSLKQ I WRRK DEM  +S
Subjt:  LIANIYGSLKQVIPWRRKHDEMRRVS

SwissProt top hitse value%identityAlignment
Q9M160 Uncharacterized protein At4g009503.8e-0527.27Show/hide
Query:  TEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASI--SVPFQWEEAPGKPRPFGIIESNSKPKSA------------RSLDLPPRLFADAK----VAHF
        TE+  + T  KL +  LP  P+     ++ P+H+SI  SVPF WEE PGKP+      S+S   S             +SL+LPPRL    K    V   
Subjt:  TEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASI--SVPFQWEEAPGKPRPFGIIESNSKPKSA------------RSLDLPPRLFADAK----VAHF

Query:  ASPTIAVDDP-------------------------------IAG--QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILV
         SP    D P                               I G  +DL  G   +   S    A   K G+ + + G RR  + +   E  + GS +  
Subjt:  ASPTIAVDDP-------------------------------IAG--QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILV

Query:  SG-----------------------GGADGGGGGRGDGDTRVKITRFRSRRSFFGT------SNSKSHLIANIYGSLKQVIPWRRK
        S                           DG    +      VKI+   SR   F T      S+SKSH   N+Y  LKQV+PW+ K
Subjt:  SG-----------------------GGADGGGGGRGDGDTRVKITRFRSRRSFFGT------SNSKSHLIANIYGSLKQVIPWRRK

Arabidopsis top hitse value%identityAlignment
AT4G00950.1 Protein of unknown function (DUF688)2.7e-0627.27Show/hide
Query:  TEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASI--SVPFQWEEAPGKPRPFGIIESNSKPKSA------------RSLDLPPRLFADAK----VAHF
        TE+  + T  KL +  LP  P+     ++ P+H+SI  SVPF WEE PGKP+      S+S   S             +SL+LPPRL    K    V   
Subjt:  TEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASI--SVPFQWEEAPGKPRPFGIIESNSKPKSA------------RSLDLPPRLFADAK----VAHF

Query:  ASPTIAVDDP-------------------------------IAG--QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILV
         SP    D P                               I G  +DL  G   +   S    A   K G+ + + G RR  + +   E  + GS +  
Subjt:  ASPTIAVDDP-------------------------------IAG--QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILV

Query:  SG-----------------------GGADGGGGGRGDGDTRVKITRFRSRRSFFGT------SNSKSHLIANIYGSLKQVIPWRRK
        S                           DG    +      VKI+   SR   F T      S+SKSH   N+Y  LKQV+PW+ K
Subjt:  SG-----------------------GGADGGGGGRGDGDTRVKITRFRSRRSFFGT------SNSKSHLIANIYGSLKQVIPWRRK

AT4G27810.1 unknown protein1.6e-1934.68Show/hide
Query:  PKLSLFSLP-RPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSA-------------RSLDLPPRLFADAKVAHFASPTIAVDDPIAG
        PKL LFS+P     + PG+ TPP++ + SVPF WEEAPGKPR    +   +KP ++             R L+LPPRLF  A      SPT  +D P   
Subjt:  PKLSLFSLP-RPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSA-------------RSLDLPPRLFADAKVAHFASPTIAVDDPIAG

Query:  QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMS-FRKNKEIPKGGSEILVSGGG--ADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIAN
                                     Y   RR +S  R+++   +G  E   S      DGGGG      T VKI+R R + S    S+SKS  +A 
Subjt:  QDLSSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMS-FRKNKEIPKGGSEILVSGGG--ADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIAN

Query:  IYGSLKQVIPWRRKHDEMRRVS
        +Y   KQVIPWRR+ + + R+S
Subjt:  IYGSLKQVIPWRRKHDEMRRVS

AT5G53030.1 unknown protein3.6e-1934.06Show/hide
Query:  SSTPPKLSLFSLPR---PPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLF--ADAKVAHFASPTIAVDDP--IAGQDL
        SST  +L LFS P         PG+ TPP++ + SVPF WEEAPGKPR        ++    RSL+LPPRL    ++   +  SPT  +D P  +  + L
Subjt:  SSTPPKLSLFSLPR---PPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLF--ADAKVAHFASPTIAVDDP--IAGQDL

Query:  SSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKG---GSEILVSG-----GGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKS---
        S   S               E ++    G+ RW SF   KE+ +G    S     G       A GGG G   GD +VK+ R   + SFF  S++     
Subjt:  SSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKG---GSEILVSG-----GGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKS---

Query:  --HLIANIYGSLKQVIPWRRKHDEMRRVS
           + A +Y   KQVIPW+RK + + R +
Subjt:  --HLIANIYGSLKQVIPWRRKHDEMRRVS

AT5G53030.2 unknown protein2.3e-1334.15Show/hide
Query:  SSTPPKLSLFSLPR---PPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLF--ADAKVAHFASPTIAVDDP--IAGQDL
        SST  +L LFS P         PG+ TPP++ + SVPF WEEAPGKPR        ++    RSL+LPPRL    ++   +  SPT  +D P  +  + L
Subjt:  SSTPPKLSLFSLPR---PPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLF--ADAKVAHFASPTIAVDDP--IAGQDL

Query:  SSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKG---GSEILVSG-----GGADGGGGGRGDGDTRVKITRFRSRRSFFGTSN-SKSHL
        S   S               E ++    G+ RW SF   KE+ +G    S     G       A GGG G   GD +VK+ R   + SFF  S+ +KS  
Subjt:  SSGLSFRFPDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKG---GSEILVSG-----GGADGGGGGRGDGDTRVKITRFRSRRSFFGTSN-SKSHL

Query:  IANIY
          + Y
Subjt:  IANIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGACTGCAACAGAGGAAAATCCGAGCTCCACGCCACCAAAGCTGTCTCTGTTTTCTCTTCCAAGGCCGCCGTCGGAGCCGCCGGGGATGGTGACGCCGCCGCTGCA
CGCGTCGATTTCTGTGCCGTTTCAGTGGGAGGAGGCGCCGGGGAAGCCGAGGCCATTCGGAATTATTGAATCGAATTCAAAGCCCAAAAGTGCAAGATCCTTGGATCTGC
CGCCGAGGCTGTTCGCCGACGCCAAAGTAGCCCATTTTGCCTCTCCGACGATCGCCGTCGACGACCCCATCGCCGGCCAAGACCTGTCTTCCGGTTTGTCGTTCAGGTTC
CCGGACAGTTGGGCGGAGACGGCGACGGAGACGAAGGAGGGCAAGGATGTTAAATACGTTGGGACTAGGCGGTGGATGAGCTTTAGGAAGAATAAGGAGATCCCGAAGGG
TGGGTCTGAAATCTTGGTCTCGGGCGGCGGTGCTGACGGCGGTGGCGGCGGCCGTGGCGACGGCGATACAAGGGTAAAGATCACAAGGTTTAGGAGCAGAAGAAGCTTTT
TTGGGACGTCGAATTCAAAGTCGCACTTGATCGCAAATATTTATGGGAGCTTGAAGCAAGTGATTCCATGGAGGCGTAAGCATGATGAAATGAGAAGAGTATCACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGACTGCAACAGAGGAAAATCCGAGCTCCACGCCACCAAAGCTGTCTCTGTTTTCTCTTCCAAGGCCGCCGTCGGAGCCGCCGGGGATGGTGACGCCGCCGCTGCA
CGCGTCGATTTCTGTGCCGTTTCAGTGGGAGGAGGCGCCGGGGAAGCCGAGGCCATTCGGAATTATTGAATCGAATTCAAAGCCCAAAAGTGCAAGATCCTTGGATCTGC
CGCCGAGGCTGTTCGCCGACGCCAAAGTAGCCCATTTTGCCTCTCCGACGATCGCCGTCGACGACCCCATCGCCGGCCAAGACCTGTCTTCCGGTTTGTCGTTCAGGTTC
CCGGACAGTTGGGCGGAGACGGCGACGGAGACGAAGGAGGGCAAGGATGTTAAATACGTTGGGACTAGGCGGTGGATGAGCTTTAGGAAGAATAAGGAGATCCCGAAGGG
TGGGTCTGAAATCTTGGTCTCGGGCGGCGGTGCTGACGGCGGTGGCGGCGGCCGTGGCGACGGCGATACAAGGGTAAAGATCACAAGGTTTAGGAGCAGAAGAAGCTTTT
TTGGGACGTCGAATTCAAAGTCGCACTTGATCGCAAATATTTATGGGAGCTTGAAGCAAGTGATTCCATGGAGGCGTAAGCATGATGAAATGAGAAGAGTATCACGGTGA
Protein sequenceShow/hide protein sequence
MRTATEENPSSTPPKLSLFSLPRPPSEPPGMVTPPLHASISVPFQWEEAPGKPRPFGIIESNSKPKSARSLDLPPRLFADAKVAHFASPTIAVDDPIAGQDLSSGLSFRF
PDSWAETATETKEGKDVKYVGTRRWMSFRKNKEIPKGGSEILVSGGGADGGGGGRGDGDTRVKITRFRSRRSFFGTSNSKSHLIANIYGSLKQVIPWRRKHDEMRRVSR