; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021726 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021726
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153823:261613..273839
RNA-Seq ExpressionSgr021726
SyntenySgr021726
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455085.1 PREDICTED: uncharacterized protein LOC103495346 isoform X1 [Cucumis melo]6.1e-9661.61Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        M PLL R WG A S F P+SRLR+FRSDAALEAIARAA++RVPN+VLYNYPSFSGAFSALFAHLFH RLRLP LILPFSSVAPLRVED YV+GLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA   RR                   P ED PKNLSIRVNLEKSSS+AVYEY S+RL D+ETPCGPVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W+                                                                     Y    AVRADGNSNLSDEIGKQLSMRS 
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DG TDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

XP_022136383.1 uncharacterized protein LOC111008108 isoform X1 [Momordica charantia]1.1e-9762.2Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MVPLL R  GGA S  SP+SRLRSFRSDAALEAIARAAE+RVPNVVLYNYPSFSGAFSALFAHLFH RL LPCL+LPFSSVAPLR++DLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAAT+ RR                   PTE + KN SI VNLEKSSS+AVYEY STRL D+ETPCG VADLLEPKD+SR+EMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADG+SNLSDEIGKQLSMRSA
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

XP_022952097.1 uncharacterized protein LOC111454859 [Cucurbita moschata]2.6e-9460.12Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MVP+L RL G +I  F P S+LRSFRSDA+LEAIA+AAE+RVPNVV YNYPSFSGAFSALFAHLFH RL LPCLILPFSS  P R+EDLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA + RR                   PTEDRPKNLSIRVNLEKSSS+ VYEY S+RL D+ETPC PVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADGNSNLSDEIGKQLS+RSA
Subjt:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRP+GAV+YMQR NLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

XP_023553768.1 uncharacterized protein LOC111811240 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-9560.42Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MVP+L RL G  I  F P S+LRSFRSDA+L+AIA+AAE+RVPNVV YNYPSFSGAFSALFAHLFH RLRLPCLILPFSS  P R+EDLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA + RR                   PTEDRPKNLSIRVNLEKSSS+AVYEY S+RL D+ETPC PVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADGNSNLSDEIGKQLS+RSA
Subjt:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRP+GAV+YMQR NLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

XP_038886891.1 uncharacterized protein LOC120077095 isoform X2 [Benincasa hispida]5.2e-9561.9Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        M PLL RLWG   S F PFSR+RSFRSDAALEAIA+AAEERVPNVVLYNYPSFSGAFSALFAHLFH RLRLPCLILPFSSVAPLR+EDLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA + RR                   PTEDRPKNLS+RVNLEKSSS AVYEY S+RL D+E  C    DLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADGNSNLSDEIGKQLSMRS 
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

TrEMBL top hitse value%identityAlignment
A0A1S3C0T8 uncharacterized protein LOC103495346 isoform X13.0e-9661.61Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        M PLL R WG A S F P+SRLR+FRSDAALEAIARAA++RVPN+VLYNYPSFSGAFSALFAHLFH RLRLP LILPFSSVAPLRVED YV+GLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA   RR                   P ED PKNLSIRVNLEKSSS+AVYEY S+RL D+ETPCGPVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W+                                                                     Y    AVRADGNSNLSDEIGKQLSMRS 
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DG TDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

A0A5A7SMH4 Uncharacterized protein3.0e-9661.61Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        M PLL R WG A S F P+SRLR+FRSDAALEAIARAA++RVPN+VLYNYPSFSGAFSALFAHLFH RLRLP LILPFSSVAPLRVED YV+GLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA   RR                   P ED PKNLSIRVNLEKSSS+AVYEY S+RL D+ETPCGPVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W+                                                                     Y    AVRADGNSNLSDEIGKQLSMRS 
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DG TDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

A0A6J1C7F8 uncharacterized protein LOC111008108 isoform X15.4e-9862.2Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MVPLL R  GGA S  SP+SRLRSFRSDAALEAIARAAE+RVPNVVLYNYPSFSGAFSALFAHLFH RL LPCL+LPFSSVAPLR++DLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAAT+ RR                   PTE + KN SI VNLEKSSS+AVYEY STRL D+ETPCG VADLLEPKD+SR+EMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRRD-----------------FPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADG+SNLSDEIGKQLSMRSA
Subjt:  YWH---------------------------------------------------------------------YRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQRNNLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

A0A6J1GJH1 uncharacterized protein LOC1114548591.2e-9460.12Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MVP+L RL G +I  F P S+LRSFRSDA+LEAIA+AAE+RVPNVV YNYPSFSGAFSALFAHLFH RL LPCLILPFSS  P R+EDLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA + RR                   PTEDRPKNLSIRVNLEKSSS+ VYEY S+RL D+ETPC PVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADGNSNLSDEIGKQLS+RSA
Subjt:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRP+GAV+YMQR NLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

A0A6J1I3B6 uncharacterized protein LOC1114706186.2e-9460.12Show/hide
Query:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL
        MV +L RL G  I  F P SRLRSFRSDA+LEAIA+AAE+RVPNVV YNYPSFSGAFSALFA +FH RLRLPCLILPFSS  P R+EDLYVEGLERCYFL
Subjt:  MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFL

Query:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE
        DFLGPKGFAA + RR                   PTEDRPKNLSIRVNLEKSSS+AVYEY S+RL D+ETPC PVADLLE KD+SRIEMVLKYIEDG   
Subjt:  DFLGPKGFAATLLRR-----------------DFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEE

Query:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA
         W                                                                      Y    AVRADGN+NLSDEIGKQLS+RSA
Subjt:  YW---------------------------------------------------------------------HYRLVSAVRADGNSNLSDEIGKQLSMRSA

Query:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK
        AAGLRPIGAV+YMQR NLKMCLRT DGATDTSEVSK
Subjt:  AAGLRPIGAVMYMQRNNLKMCLRTMDGATDTSEVSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G09580.1 unknown protein9.5e-6346.54Show/hide
Query:  FSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFLDFLGPKGFA---------
        F   RSFRSDAALEAIA A EE+VPN+VLYNYPSFSGAFSALFAHL+H RLRLPCLILPFSS+ P RVEDL +EG ERCY LDF+ PK FA         
Subjt:  FSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFLDFLGPKGFA---------

Query:  ------ATLLRRDFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIED----------------GFEE----
              + L R     E+  K L I V+ E SSS AVY+Y S++L D  +       LL  +DK+R+E VL YIED                G ++    
Subjt:  ------ATLLRRDFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIED----------------GFEE----

Query:  ---------------------------YWHYRLVSA------------------------VRADGNSNLSDEIGKQLSMRSAAAGLRPIGAVMYMQRNNL
                                   Y+  RL+ A                        +RADGN  LSDE+GK LS++S+AAGLRPIGAV ++QRNNL
Subjt:  ---------------------------YWHYRLVSA------------------------VRADGNSNLSDEIGKQLSMRSAAAGLRPIGAVMYMQRNNL

Query:  KMCLRTMDGATDTSEVSK
        KMCLR+ D  T+TSEV+K
Subjt:  KMCLRTMDGATDTSEVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCCTCTGCTTCACCGATTATGGGGAGGGGCTATCTCCGCATTTTCGCCATTTTCGCGACTTCGGAGCTTCCGTTCTGATGCTGCGTTGGAAGCCATAGCCAGAGC
TGCGGAAGAGAGAGTTCCTAATGTGGTGCTTTACAACTATCCCTCTTTTTCTGGAGCATTCTCAGCCCTCTTCGCACACTTGTTTCATGCTCGTCTTCGTCTCCCTTGCC
TCATTTTGCCTTTCTCTTCCGTTGCGCCTCTCAGGGTTGAAGATCTGTATGTTGAAGGGCTTGAGCGATGCTATTTTCTTGACTTTTTGGGTCCAAAGGGATTTGCTGCG
ACCCTTTTACGTCGAGATTTCCCTACGGAAGATCGGCCTAAGAATCTCTCAATTCGTGTAAATCTTGAGAAGAGTAGCTCTAGCGCTGTGTATGAATATGTCTCCACTAG
ACTTGGAGATGTGGAAACTCCTTGTGGTCCTGTTGCAGACCTTTTAGAACCAAAAGATAAAAGTCGAATTGAAATGGTTCTCAAGTACATTGAGGATGGATTCGAGGAGT
ACTGGCATTACAGATTGGTCTCAGCAGTAAGAGCAGATGGGAATTCCAACTTGAGCGATGAAATTGGAAAGCAACTAAGTATGAGAAGTGCTGCAGCTGGTTTGAGGCCT
ATTGGGGCTGTTATGTACATGCAACGAAACAATCTTAAAATGTGCTTGAGGACTATGGATGGTGCTACCGACACATCTGAGGTCTCCAAACGTATAAGGTGGGAAAAGAC
GACCAGACTTATGACAAATGGGGCAAAGATCATGTACGCATATGCCTCTCCTTTTCAAATTATGAATCGGCAATGGAGCTTTGAAGTTAAGCATGTCGATCACCTGTTTG
ATAGAAGGCCGTACAGAGAAATCAGGCTGAGCGCAGGCAAGTCCAATGATCAACAGTTGCTTCATCTCTTCAGCGATTTCCAAGCAGACAATGCCGAAGCTGTAGATGTC
GGATTCTTTACTCGCTATGCTACTTTCCAGGAGGTTGTTCTGCTGAAAGAGATGGAAATCCAGGCTCTTGTTGGGCATGAACTTGTAGACTATGAGATATTCTTGGTCTT
TGCTGCAGCACCACCCAATGAGCTCTACCAATCTTTATATGAAATCTTCCTTGGTCCTGTTTCCTTCTCAAAATCTTCCTCTTGGTCCCTTCTCATACTTTCTCTTCTGC
CGGTTCTCTTCATCCATAGCCAAAACCAAGCAAATCCCAGAATCACAGAGAGTAAAAATGCCACTGCCAAGACGATCCCGAGCTGTTTCATGTGTAGTCCTTTCTTGGAC
TTACCAGGAGTGGTGATGATTCCAGGTGAAGAACTAGCTGGATCAGTGGTGACGTTGACTTCAACTTGTAATCTAGAGCTGAAATTCCACGAGCTAATCAAACTAGAAGA
GTTCCTCGGAGAAACTTCAGAATCTCCTTCATCAAACAAGAAAACAGTCAAGTTGTGGGAGCTTGAATTTAGAGCTCAGAGATGCGACAGAGTTGACGTCAATGCCGATG
TGATTCCCCGACTGGTCCCATCTATTCGTAAAGTAAGGCGGAAGCCTCGAGCCATTCGGTGCGAGGAAGAAGGCAAAGCCATCGCCATAGGAGCGACTTCCCTGCGAATC
AATGGAGAAGGAGAAGTGTGTGGTGAAGTCTGCAATATTCTTCTGTCCATTCTCCCAAAGATGGAAAGGATCTTTGTAGATGGCGCGGCCAACGCTTCCATTGAGAGGCA
TGTCTCTCTGGTTCATGGTGAGTTGAATGACATTGTTGGAAGATTCTCAGAAGAAATGGTGGCTGAGACCGCAACTGGGTTGGTTATGCTCCACAAGCAGAAGATGATGA
GGAGGAAAGGGTTTAGAGATCCAAAAGTGAAGACCTCCATGGGGAACGTGGATGAAGAGATCCAAGGCAGCCATGAAGGAGAAGATTTGAATTTATACAAAAATGATGGT
CTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCCTCTGCTTCACCGATTATGGGGAGGGGCTATCTCCGCATTTTCGCCATTTTCGCGACTTCGGAGCTTCCGTTCTGATGCTGCGTTGGAAGCCATAGCCAGAGC
TGCGGAAGAGAGAGTTCCTAATGTGGTGCTTTACAACTATCCCTCTTTTTCTGGAGCATTCTCAGCCCTCTTCGCACACTTGTTTCATGCTCGTCTTCGTCTCCCTTGCC
TCATTTTGCCTTTCTCTTCCGTTGCGCCTCTCAGGGTTGAAGATCTGTATGTTGAAGGGCTTGAGCGATGCTATTTTCTTGACTTTTTGGGTCCAAAGGGATTTGCTGCG
ACCCTTTTACGTCGAGATTTCCCTACGGAAGATCGGCCTAAGAATCTCTCAATTCGTGTAAATCTTGAGAAGAGTAGCTCTAGCGCTGTGTATGAATATGTCTCCACTAG
ACTTGGAGATGTGGAAACTCCTTGTGGTCCTGTTGCAGACCTTTTAGAACCAAAAGATAAAAGTCGAATTGAAATGGTTCTCAAGTACATTGAGGATGGATTCGAGGAGT
ACTGGCATTACAGATTGGTCTCAGCAGTAAGAGCAGATGGGAATTCCAACTTGAGCGATGAAATTGGAAAGCAACTAAGTATGAGAAGTGCTGCAGCTGGTTTGAGGCCT
ATTGGGGCTGTTATGTACATGCAACGAAACAATCTTAAAATGTGCTTGAGGACTATGGATGGTGCTACCGACACATCTGAGGTCTCCAAACGTATAAGGTGGGAAAAGAC
GACCAGACTTATGACAAATGGGGCAAAGATCATGTACGCATATGCCTCTCCTTTTCAAATTATGAATCGGCAATGGAGCTTTGAAGTTAAGCATGTCGATCACCTGTTTG
ATAGAAGGCCGTACAGAGAAATCAGGCTGAGCGCAGGCAAGTCCAATGATCAACAGTTGCTTCATCTCTTCAGCGATTTCCAAGCAGACAATGCCGAAGCTGTAGATGTC
GGATTCTTTACTCGCTATGCTACTTTCCAGGAGGTTGTTCTGCTGAAAGAGATGGAAATCCAGGCTCTTGTTGGGCATGAACTTGTAGACTATGAGATATTCTTGGTCTT
TGCTGCAGCACCACCCAATGAGCTCTACCAATCTTTATATGAAATCTTCCTTGGTCCTGTTTCCTTCTCAAAATCTTCCTCTTGGTCCCTTCTCATACTTTCTCTTCTGC
CGGTTCTCTTCATCCATAGCCAAAACCAAGCAAATCCCAGAATCACAGAGAGTAAAAATGCCACTGCCAAGACGATCCCGAGCTGTTTCATGTGTAGTCCTTTCTTGGAC
TTACCAGGAGTGGTGATGATTCCAGGTGAAGAACTAGCTGGATCAGTGGTGACGTTGACTTCAACTTGTAATCTAGAGCTGAAATTCCACGAGCTAATCAAACTAGAAGA
GTTCCTCGGAGAAACTTCAGAATCTCCTTCATCAAACAAGAAAACAGTCAAGTTGTGGGAGCTTGAATTTAGAGCTCAGAGATGCGACAGAGTTGACGTCAATGCCGATG
TGATTCCCCGACTGGTCCCATCTATTCGTAAAGTAAGGCGGAAGCCTCGAGCCATTCGGTGCGAGGAAGAAGGCAAAGCCATCGCCATAGGAGCGACTTCCCTGCGAATC
AATGGAGAAGGAGAAGTGTGTGGTGAAGTCTGCAATATTCTTCTGTCCATTCTCCCAAAGATGGAAAGGATCTTTGTAGATGGCGCGGCCAACGCTTCCATTGAGAGGCA
TGTCTCTCTGGTTCATGGTGAGTTGAATGACATTGTTGGAAGATTCTCAGAAGAAATGGTGGCTGAGACCGCAACTGGGTTGGTTATGCTCCACAAGCAGAAGATGATGA
GGAGGAAAGGGTTTAGAGATCCAAAAGTGAAGACCTCCATGGGGAACGTGGATGAAGAGATCCAAGGCAGCCATGAAGGAGAAGATTTGAATTTATACAAAAATGATGGT
CTTTGA
Protein sequenceShow/hide protein sequence
MVPLLHRLWGGAISAFSPFSRLRSFRSDAALEAIARAAEERVPNVVLYNYPSFSGAFSALFAHLFHARLRLPCLILPFSSVAPLRVEDLYVEGLERCYFLDFLGPKGFAA
TLLRRDFPTEDRPKNLSIRVNLEKSSSSAVYEYVSTRLGDVETPCGPVADLLEPKDKSRIEMVLKYIEDGFEEYWHYRLVSAVRADGNSNLSDEIGKQLSMRSAAAGLRP
IGAVMYMQRNNLKMCLRTMDGATDTSEVSKRIRWEKTTRLMTNGAKIMYAYASPFQIMNRQWSFEVKHVDHLFDRRPYREIRLSAGKSNDQQLLHLFSDFQADNAEAVDV
GFFTRYATFQEVVLLKEMEIQALVGHELVDYEIFLVFAAAPPNELYQSLYEIFLGPVSFSKSSSWSLLILSLLPVLFIHSQNQANPRITESKNATAKTIPSCFMCSPFLD
LPGVVMIPGEELAGSVVTLTSTCNLELKFHELIKLEEFLGETSESPSSNKKTVKLWELEFRAQRCDRVDVNADVIPRLVPSIRKVRRKPRAIRCEEEGKAIAIGATSLRI
NGEGEVCGEVCNILLSILPKMERIFVDGAANASIERHVSLVHGELNDIVGRFSEEMVAETATGLVMLHKQKMMRRKGFRDPKVKTSMGNVDEEIQGSHEGEDLNLYKNDG
L