; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021917 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021917
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransmembrane protein
Genome locationtig00153841:1186923..1188817
RNA-Seq ExpressionSgr021917
SyntenySgr021917
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576125.1 hypothetical protein SDJN03_26764, partial [Cucurbita argyrosperma subsp. sororia]2.2e-4750.66Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A  APQ K +RR QQQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL+FPAFTSAYLL LARFAFPSHG  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQAS-LCDILLILGMYV---KKTNTSLSG--EQDVQNVRGSWD----------DGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHRECDKWA
        IFQ S L  + +++G  +         L G    D   VR +                +S + +         RA  PV      + V+ D  +  D W 
Subjt:  IFQAS-LCDILLILGMYV---KKTNTSLSG--EQDVQNVRGSWD----------DGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHRECDKWA

Query:  VVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSDKK
           L A       A              R LA           VANFAYF INL GFLIPRF+PRAF+RYF+DR DET AK QED LSSA PKS PSDKK
Subjt:  VVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSDKK

Query:  CD
         D
Subjt:  CD

KAG7014643.1 hypothetical protein SDJN02_24822, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-4751.97Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A  APQ K +RR QQQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL+FPAFTSAYLL LARFAFPSHG  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQA L    L   MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD
        W    L A       A              R LA           VANFAYF INL GFLIPRF+PRAF+RYF+DR DET AK QED LSSA PKS PSD
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD

Query:  KKCD
        KK D
Subjt:  KKCD

XP_022954375.1 uncharacterized protein LOC111456631 [Cucurbita moschata]2.2e-4751.32Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A  APQ K +RR QQQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL+FPAFTSAYLL LARFAFPSHG  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQ S      +  MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD
        W    L A       A              R LA           VANFAYF INL GFLIPRF+PRAF+RYF+DR DET AK QED LSSA PKS PSD
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD

Query:  KKCD
        KK D
Subjt:  KKCD

XP_022991374.1 uncharacterized protein LOC111488027 [Cucurbita maxima]6.5e-4751.48Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A   PQQK +RR QQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL FPAFTSAYLL LARFAFPS G  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQ S      +  MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR--DETRAKFQEDKLSSAAPKSQPS
        W    LAA       A              R LA           VANFAYF INL GFLIPRF+PRAFERYF+DR  DET AK +ED LSSA PKS PS
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR--DETRAKFQEDKLSSAAPKSQPS

Query:  DKKCD
        DKK D
Subjt:  DKKCD

XP_023550142.1 uncharacterized protein LOC111808420 [Cucurbita pepo subsp. pepo]2.2e-4751.64Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A  APQQK + R QQQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL+FPAFTSAYLL LARFAFPSHG  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQ S      +  MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD
        W    L A       A              R LA           VANFAYF INL GFLIPRF+PRAFERYF+DR DET AK QED LSSA PKS PSD
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD

Query:  KKCD
        KK D
Subjt:  KKCD

TrEMBL top hitse value%identityAlignment
A0A0A0KAF1 Uncharacterized protein8.0e-4348.23Show/hide
Query:  MSGVTVGV---ANPNEAA-----QAPQQK-SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRV
        MSGVT+ +    + N+AA     +A Q K  +R+  QQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVP LDLVFPAFTSAYLL LARFAFPSHG  
Subjt:  MSGVTVGV---ANPNEAA-----QAPQQK-SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRV

Query:  STASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDS
        ST S EIFQAS      +  MYV    T++     +  V G +  G       A   + +    I  E             RA  P+      + V+ D 
Subjt:  STASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDS

Query:  HRECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS-AA
         +  D W    L A            +D+                   T  VANFAYF INL GFLIPRFLPRAFE+YF++R DE+ AKF EDKLSS AA
Subjt:  HRECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS-AA

Query:  PKSQPSDKKCD
         KSQPSDKK D
Subjt:  PKSQPSDKKCD

A0A1S3CB68 uncharacterized protein LOC1034988617.2e-4447.74Show/hide
Query:  MSGVTVGVANPNEAAQAPQQK-------SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVST
        MSGV++ V    E+  A + K        +RR  QQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP LDLVFPAFTSAYLL LARFAFPSHG  ST
Subjt:  MSGVTVGVANPNEAAQAPQQK-------SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVST

Query:  ASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDSHR
         S EIFQ S      +  MYV    T++     +  V G +  G       A   + +    I  E             RA  P+      + V+ D  +
Subjt:  ASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDSHR

Query:  ECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS--AAP
          D W    L A            +D+                   T  VANFAYF INL GFLIPRFLPRAFE+YF++R DET AKFQEDKLSS  AA 
Subjt:  ECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS--AAP

Query:  KSQPSDKKCD
        KSQPSDKK D
Subjt:  KSQPSDKKCD

A0A5D3DMD1 Uncharacterized protein5.6e-4447.74Show/hide
Query:  MSGVTVGVANPNEAAQAPQQK-------SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVST
        MSGV++ V    E+  A + K        +RR  QQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP LDLVFPAFTSAYLL LARFAFPSHG+ ST
Subjt:  MSGVTVGVANPNEAAQAPQQK-------SVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVST

Query:  ASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDSHR
         S EIFQ S      +  MYV    T++     +  V G +  G       A   + +    I  E             RA  P+      + V+ D  +
Subjt:  ASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSWDDG------GAVSAIGVCVGWICEER------------RACSPVCDATSILAVVSDSHR

Query:  ECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS--AAP
          D W    L A            +D+                   T  VANFAYF INL GFLIPRFLPRAFE+YF++R DET AKFQEDKLSS  AA 
Subjt:  ECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSS--AAP

Query:  KSQPSDKKCD
        KSQPSDKK D
Subjt:  KSQPSDKKCD

A0A6J1GS90 uncharacterized protein LOC1114566311.1e-4751.32Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A  APQ K +RR QQQEQN+VVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL+FPAFTSAYLL LARFAFPSHG  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQ S      +  MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD
        W    L A       A              R LA           VANFAYF INL GFLIPRF+PRAF+RYF+DR DET AK QED LSSA PKS PSD
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR-DETRAKFQEDKLSSAAPKSQPSD

Query:  KKCD
        KK D
Subjt:  KKCD

A0A6J1JUQ0 uncharacterized protein LOC1114880273.1e-4751.48Show/hide
Query:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE
        MSGVT+ VA   +PN+A   PQQK +RR QQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVP+LDL FPAFTSAYLL LARFAFPS G  S ASQE
Subjt:  MSGVTVGVA---NPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQE

Query:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK
        IFQ S      +  MYV    T++     +  V G +   D+    SA   + +    I  E             RA  PV      + V+ D  +  D 
Subjt:  IFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW---DDGGAVSA---IGVCVGWICEER------------RACSPVCDATSILAVVSDSHRECDK

Query:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR--DETRAKFQEDKLSSAAPKSQPS
        W    LAA       A              R LA           VANFAYF INL GFLIPRF+PRAFERYF+DR  DET AK +ED LSSA PKS PS
Subjt:  WAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDR--DETRAKFQEDKLSSAAPKSQPS

Query:  DKKCD
        DKK D
Subjt:  DKKCD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein2.6e-3040.93Show/hide
Query:  GGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW-
        GG+MGSLRVIELQLVAFI+VFSASGLVP+LD++FPAF S Y++AL+R AFPSHG VSTAS E+F+ S    L ++      + T++     +  V G + 
Subjt:  GGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQEIFQASLCDILLILGMYVKKTNTSLSGEQDVQNVRGSW-

Query:  --DDGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHREC-----DKWAVVVLAAGEG----TGATALHGAEDLRHHGLDERCLAQQNSAGECTS
          DD    SA        C+         +  S L++ S   R         W + V+           +  ++   ++       R LA          
Subjt:  --DDGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHREC-----DKWAVVVLAAGEG----TGATALHGAEDLRHHGLDERCLAQQNSAGECTS

Query:  QVANFAYFTINLLGFLIPRFLPRAFERYFRDRDETRAKFQEDKLSSAAPKSQPSDKKCD
         +AN  YF +NLL FLIPRFLPRAFE+YFR+RDE  AK QEDK     P+S+PS+ K D
Subjt:  QVANFAYFTINLLGFLIPRFLPRAFERYFRDRDETRAKFQEDKLSSAAPKSQPSDKKCD

AT5G52420.1 unknown protein1.3e-0524.18Show/hide
Query:  HQQQEQNTV--VGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGR------VSTASQEIFQ---------ASLCDILL
        HQ Q  +TV   G   G     +L ++A I+V SASGLV + D +F   T  Y   L++  FP H        +++++ +IF+           +  I  
Subjt:  HQQQEQNTV--VGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGR------VSTASQEIFQ---------ASLCDILL

Query:  ILGMYVKKTNTSLSGEQDVQNVRGSWDDGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHRECDKWAVVVLAAGEGTGATALHGAEDLRHHGLD
        I    V+     +S       +  S      +  +    G+    R     V +A  +L +V        +W +   +  + TG  +             
Subjt:  ILGMYVKKTNTSLSGEQDVQNVRGSWDDGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHRECDKWAVVVLAAGEGTGATALHGAEDLRHHGLD

Query:  ERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYF
            A++  AG+  +  AN   ++ NL G LIP +LPRAF+RY+
Subjt:  ERCLAQQNSAGECTSQVANFAYFTINLLGFLIPRFLPRAFERYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGTGTAACTGTTGGGGTGGCGAATCCAAACGAAGCAGCACAGGCGCCGCAGCAGAAGTCCGTCCGCCGACACCAGCAGCAGGAGCAGAACACTGTTGTGGGAGG
CGTGATGGGATCTCTACGTGTCATCGAACTCCAGCTTGTTGCCTTCATCATGGTGTTCTCCGCCAGCGGCCTCGTCCCACTTCTCGACCTCGTCTTTCCGGCGTTCACGT
CCGCCTACCTTCTTGCTCTGGCGCGGTTCGCCTTCCCGTCCCACGGCCGTGTGTCGACGGCGTCGCAAGAGATCTTTCAGGCAAGTCTTTGTGATATTTTGTTGATTTTG
GGAATGTATGTCAAGAAAACTAATACATCCCTTTCAGGGGAGCAAGATGTTCAGAATGTACGTGGTAGTTGGGACGACGGTGGGGCTGTTTCTGCCATTGGCGTATGTGT
TGGGTGGATTTGCGAGGAGCGACGAGCATGCAGTCCGGTCTGCGACGCCACATCTATTCTTGCTGTCGTTTCAGATTCTCACCGAGAATGTGATAAGTGGGCTGTCGTTG
TTCTCGCCGCCGGTGAGGGCACTGGTGCCACTGCTTTACACGGTGCGGAGGATCTTCGTCATCATGGATTGGATGAAAGATGTTTGGCTCAACAAAACTCTGCCGGCGAA
TGCACCTCTCAAGTGGCTAATTTTGCGTATTTCACTATAAATTTATTGGGATTTTTGATCCCAAGGTTCCTTCCAAGGGCATTCGAGAGGTATTTTAGGGACAGAGATGA
GACTCGTGCAAAATTTCAAGAAGATAAGCTCTCTTCCGCTGCCCCCAAATCTCAGCCGTCCGATAAGAAGTGTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGTGTAACTGTTGGGGTGGCGAATCCAAACGAAGCAGCACAGGCGCCGCAGCAGAAGTCCGTCCGCCGACACCAGCAGCAGGAGCAGAACACTGTTGTGGGAGG
CGTGATGGGATCTCTACGTGTCATCGAACTCCAGCTTGTTGCCTTCATCATGGTGTTCTCCGCCAGCGGCCTCGTCCCACTTCTCGACCTCGTCTTTCCGGCGTTCACGT
CCGCCTACCTTCTTGCTCTGGCGCGGTTCGCCTTCCCGTCCCACGGCCGTGTGTCGACGGCGTCGCAAGAGATCTTTCAGGCAAGTCTTTGTGATATTTTGTTGATTTTG
GGAATGTATGTCAAGAAAACTAATACATCCCTTTCAGGGGAGCAAGATGTTCAGAATGTACGTGGTAGTTGGGACGACGGTGGGGCTGTTTCTGCCATTGGCGTATGTGT
TGGGTGGATTTGCGAGGAGCGACGAGCATGCAGTCCGGTCTGCGACGCCACATCTATTCTTGCTGTCGTTTCAGATTCTCACCGAGAATGTGATAAGTGGGCTGTCGTTG
TTCTCGCCGCCGGTGAGGGCACTGGTGCCACTGCTTTACACGGTGCGGAGGATCTTCGTCATCATGGATTGGATGAAAGATGTTTGGCTCAACAAAACTCTGCCGGCGAA
TGCACCTCTCAAGTGGCTAATTTTGCGTATTTCACTATAAATTTATTGGGATTTTTGATCCCAAGGTTCCTTCCAAGGGCATTCGAGAGGTATTTTAGGGACAGAGATGA
GACTCGTGCAAAATTTCAAGAAGATAAGCTCTCTTCCGCTGCCCCCAAATCTCAGCCGTCCGATAAGAAGTGTGATTGA
Protein sequenceShow/hide protein sequence
MSGVTVGVANPNEAAQAPQQKSVRRHQQQEQNTVVGGVMGSLRVIELQLVAFIMVFSASGLVPLLDLVFPAFTSAYLLALARFAFPSHGRVSTASQEIFQASLCDILLIL
GMYVKKTNTSLSGEQDVQNVRGSWDDGGAVSAIGVCVGWICEERRACSPVCDATSILAVVSDSHRECDKWAVVVLAAGEGTGATALHGAEDLRHHGLDERCLAQQNSAGE
CTSQVANFAYFTINLLGFLIPRFLPRAFERYFRDRDETRAKFQEDKLSSAAPKSQPSDKKCD