; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22746 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22746
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionvitellogenin-like
Genome locationCarg_Chr18:178497..179231
RNA-Seq ExpressionCarg22746
SyntenyCarg22746
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572943.1 hypothetical protein SDJN03_26830, partial [Cucurbita argyrosperma subsp. sororia]1.6e-134100Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR

Query:  PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

XP_022955104.1 uncharacterized protein LOC111457172 [Cucurbita moschata]3.9e-13399.59Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS-PPPPPPSLNQYPTNV
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS PPPPPPSLNQYPTNV
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS-PPPPPPSLNQYPTNV

Query:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

XP_022994181.1 uncharacterized protein LOC111489997 [Cucurbita maxima]4.1e-13097.57Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPI+HRLPDITLLD+NLDFKFSISIEHESSTADELF NGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP-LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS--PPPPPPSLNQYPT
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS  PPPPPPSLNQYPT
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP-LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS--PPPPPPSLNQYPT

Query:  NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

XP_023000894.1 uncharacterized protein LOC111495198 [Cucurbita maxima]2.5e-8773.47Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPID-HRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKK
        MCSET+SPRISFSH L G+  L ID H   D+TLLDSNLDF+F+ISI+HESS+ADELF NG+I+P K ESHKQSHPFE P TASLPPLPP +NS  TL  
Subjt:  MCSETTSPRISFSHYLLGEGGLPID-HRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKK

Query:  IRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNV
          VVN S+SS QLEQR +ESKSFWG KRSSS+NFE KR SLCPLPLLSRS STGS  N KSKK K SQKQISQKQYSTS RK SS   P  SLNQYPTN+
Subjt:  IRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNV

Query:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        +PQ+C+NPGG YG YH IG VLNVPPKFFG GS+L CG DRKSKK
Subjt:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

XP_023542924.1 uncharacterized protein LOC111802696 [Cucurbita pepo subsp. pepo]2.8e-13197.95Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPI+HRLPDITLLDSNLDFKFSISIEHESSTADELF NGIILPIK ESHK SHPFEVPFTASLPPLPPKQNS HTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVR

Query:  PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  PQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

TrEMBL top hitse value%identityAlignment
A0A6J1E7V7 uncharacterized protein LOC111431596 isoform X13.5e-8771.15Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPI----DHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHT
        MCSET+SPRISFSH L  +  L I    +H   D+TLLDSNLDF+F+ISI+HESS+ADELF NG+I+P K ESHKQSHPFE P TASLPPLPP +NS  T
Subjt:  MCSETTSPRISFSHYLLGEGGLPI----DHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHT

Query:  LKKIRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPP-----PS
        L    VVN S+SS QLEQR +ESKSFWG KRSSS+NFE KR SLCPLPLLSRS STGS  N KSKK KDSQKQISQKQ+STS RK SSP P P      S
Subjt:  LKKIRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPP-----PS

Query:  LNQYPTNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        LNQYPTN++PQMC+NPGG YG YH IG VLNVPP+FFG GS+L CG DRKSKK
Subjt:  LNQYPTNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

A0A6J1E831 uncharacterized protein LOC111431596 isoform X21.5e-8570.97Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPI----DHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHT
        MCSET+SPRISFSH L  +  L I    +H   D+TLLDSNLDF+F+ISI+HESS+ADELF NG+I+P K ESHKQSHPFE P TASLPPLPP +NS  T
Subjt:  MCSETTSPRISFSHYLLGEGGLPI----DHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHT

Query:  LKKIRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYP
        L    VVN S+SS QLEQR +ESKSFWG KRSSS+NFE KR SLCPLPLLSRS STGS  N KSKK KDSQKQISQKQ+STS RK S       SLNQYP
Subjt:  LKKIRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYP

Query:  TNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        TN++PQMC+NPGG YG YH IG VLNVPP+FFG GS+L CG DRKSKK
Subjt:  TNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

A0A6J1GV05 uncharacterized protein LOC1114571721.9e-13399.59Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS-PPPPPPSLNQYPTNV
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS PPPPPPSLNQYPTNV
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS-PPPPPPSLNQYPTNV

Query:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

A0A6J1JYE0 uncharacterized protein LOC1114899972.0e-13097.57Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
        MCSETTSPRISFSHYLLGEGGLPI+HRLPDITLLD+NLDFKFSISIEHESSTADELF NGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI
Subjt:  MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKI

Query:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP-LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS--PPPPPPSLNQYPT
        RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS  PPPPPPSLNQYPT
Subjt:  RVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLP-LLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSS--PPPPPPSLNQYPT

Query:  NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
Subjt:  NVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

A0A6J1KL87 uncharacterized protein LOC1114951981.2e-8773.47Show/hide
Query:  MCSETTSPRISFSHYLLGEGGLPID-HRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKK
        MCSET+SPRISFSH L G+  L ID H   D+TLLDSNLDF+F+ISI+HESS+ADELF NG+I+P K ESHKQSHPFE P TASLPPLPP +NS  TL  
Subjt:  MCSETTSPRISFSHYLLGEGGLPID-HRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKK

Query:  IRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNV
          VVN S+SS QLEQR +ESKSFWG KRSSS+NFE KR SLCPLPLLSRS STGS  N KSKK K SQKQISQKQYSTS RK SS   P  SLNQYPTN+
Subjt:  IRVVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNV

Query:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK
        +PQ+C+NPGG YG YH IG VLNVPPKFFG GS+L CG DRKSKK
Subjt:  RPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48780.1 unknown protein2.3e-1435.35Show/hide
Query:  DITLLD-SNLDFKFSISIEH---ESSTADELFGNGIILPIKAES----HKQSHPFEV-PFTASLPPLPPKQNSNHTLKKIRVVND-SSSSHQLEQRDAES
        D TLLD SN DF+F IS      +SS ADE+F +G+ILP    +     K+ + +E+ P T+SL P P       T    +  N  +S ++   + +  S
Subjt:  DITLLD-SNLDFKFSISIEH---ESSTADELFGNGIILPIKAES----HKQSHPFEV-PFTASLPPLPPKQNSNHTLKKIRVVND-SSSSHQLEQRDAES

Query:  KSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVRPQMCKNPGGVYGNYHYIGS
        KSFW  KRSSS+N + K++ +C  P L+RS STGS +N K    +D         +  S R S          N Y    RPQ      G  G    +  
Subjt:  KSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVRPQMCKNPGGVYGNYHYIGS

Query:  VLNVPPKFFGFGSLL
        VLN  P  FG GS+L
Subjt:  VLNVPPKFFGFGSLL

AT1G67050.1 unknown protein1.2e-1533.21Show/hide
Query:  SPRISFSHYLLGEGGLPIDHR-----LPDITLLDSNLDFKFSI--------SIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNS
        SPRISFS        +PI+ R         + L+S++DF F I        S +  S +ADELF NG ILP   E  K+  P +       P   P ++ 
Subjt:  SPRISFSHYLLGEGGLPIDHR-----LPDITLLDSNLDFKFSI--------SIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNS

Query:  NHTLKKIRVVNDSSSSHQL---EQRDAESKSFWGSKRSSSINFEGK-RTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPP
          + K+ +  N+      +    +    +KSFWG KRSSS+N       SLCPLPLL+RS STGS S+ + +       +  + Q S+S+  SSS     
Subjt:  NHTLKKIRVVNDSSSSHQL---EQRDAESKSFWGSKRSSSINFEGK-RTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPP

Query:  PSLNQYPTNVRPQMCKNPGGV-YGNYH----YIGSVLNVPP--KFFGFGSLLLCGIDRKSKK
         SL+    + +P + K+ GG  YG++      +  V+NV P    FGFGS+       K+KK
Subjt:  PSLNQYPTNVRPQMCKNPGGV-YGNYH----YIGSVLNVPP--KFFGFGSLLLCGIDRKSKK

AT3G18300.1 unknown protein4.0e-1932Show/hide
Query:  MCSETTSPRISFSHYL-LGEGGLPIDHR-----LPDITLLD-SNLDFKFSISIEH---ESSTADELFGNGIILPI-------KAESHKQSHPFEVP----
        +C+E+   R SF+  L   + G P++ +       D TLLD SN DF+F IS      +SS ADE+F +G+ILP+        +   K+ + +E+P    
Subjt:  MCSETTSPRISFSHYL-LGEGGLPIDHR-----LPDITLLD-SNLDFKFSISIEH---ESSTADELFGNGIILPI-------KAESHKQSHPFEVP----

Query:  ---FTASLPPLP---PKQNSNHTLKKIR--VVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQIS
            ++ LPPLP   P+ +  +++K+ R  +    S ++   + +  SKSFW  KRSSS+N + K++ +C  P L+RS STGS +  K +  +D  K  S
Subjt:  ---FTASLPPLP---PKQNSNHTLKKIR--VVNDSSSSHQLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQIS

Query:  QKQYSTSMRKSSSPPPPPPS---LNQYPTNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSK
        Q+        + S    PPS    + Y    +    KN GG  G++ +I  V+  P   FG GS+L    ++K K
Subjt:  QKQYSTSMRKSSSPPPPPPS---LNQYPTNVRPQMCKNPGGVYGNYHYIGSVLNVPPKFFGFGSLLLCGIDRKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTCCGAAACAACCTCGCCAAGGATATCCTTTTCCCACTATCTCCTCGGCGAGGGCGGCCTGCCGATCGACCACAGGCTGCCTGACATAACATTGCTGGATTCAAA
TCTGGACTTCAAGTTCAGCATCAGCATAGAACATGAATCTTCCACTGCAGATGAGCTGTTCGGCAATGGAATCATCCTTCCCATCAAAGCTGAATCCCATAAACAATCCC
ATCCTTTTGAGGTTCCTTTCACAGCCTCACTCCCTCCTCTCCCTCCCAAACAGAATTCAAACCACACACTCAAGAAAATCAGAGTGGTAAATGATTCATCTTCTTCACAT
CAGTTAGAACAGAGAGATGCAGAGTCCAAATCCTTTTGGGGATCCAAAAGAAGTAGCAGTATCAACTTTGAAGGTAAAAGAACATCTCTTTGCCCCCTCCCGCTTCTATC
ACGAAGCATTTCAACTGGGTCAGACTCAAATCCAAAGTCCAAAAAGCATAAAGATTCACAGAAGCAAATTTCTCAGAAGCAGTACTCAACATCGATGAGAAAGTCGTCAT
CACCGCCGCCGCCGCCGCCGTCACTAAATCAATATCCAACAAATGTAAGGCCTCAGATGTGCAAGAATCCAGGAGGGGTTTATGGCAATTACCATTACATTGGGTCTGTG
TTGAATGTGCCACCAAAGTTCTTCGGTTTTGGTTCACTTCTGCTATGTGGGATAGACAGAAAGAGTAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCTCCGAAACAACCTCGCCAAGGATATCCTTTTCCCACTATCTCCTCGGCGAGGGCGGCCTGCCGATCGACCACAGGCTGCCTGACATAACATTGCTGGATTCAAA
TCTGGACTTCAAGTTCAGCATCAGCATAGAACATGAATCTTCCACTGCAGATGAGCTGTTCGGCAATGGAATCATCCTTCCCATCAAAGCTGAATCCCATAAACAATCCC
ATCCTTTTGAGGTTCCTTTCACAGCCTCACTCCCTCCTCTCCCTCCCAAACAGAATTCAAACCACACACTCAAGAAAATCAGAGTGGTAAATGATTCATCTTCTTCACAT
CAGTTAGAACAGAGAGATGCAGAGTCCAAATCCTTTTGGGGATCCAAAAGAAGTAGCAGTATCAACTTTGAAGGTAAAAGAACATCTCTTTGCCCCCTCCCGCTTCTATC
ACGAAGCATTTCAACTGGGTCAGACTCAAATCCAAAGTCCAAAAAGCATAAAGATTCACAGAAGCAAATTTCTCAGAAGCAGTACTCAACATCGATGAGAAAGTCGTCAT
CACCGCCGCCGCCGCCGCCGTCACTAAATCAATATCCAACAAATGTAAGGCCTCAGATGTGCAAGAATCCAGGAGGGGTTTATGGCAATTACCATTACATTGGGTCTGTG
TTGAATGTGCCACCAAAGTTCTTCGGTTTTGGTTCACTTCTGCTATGTGGGATAGACAGAAAGAGTAAGAAATGA
Protein sequenceShow/hide protein sequence
MCSETTSPRISFSHYLLGEGGLPIDHRLPDITLLDSNLDFKFSISIEHESSTADELFGNGIILPIKAESHKQSHPFEVPFTASLPPLPPKQNSNHTLKKIRVVNDSSSSH
QLEQRDAESKSFWGSKRSSSINFEGKRTSLCPLPLLSRSISTGSDSNPKSKKHKDSQKQISQKQYSTSMRKSSSPPPPPPSLNQYPTNVRPQMCKNPGGVYGNYHYIGSV
LNVPPKFFGFGSLLLCGIDRKSKK