; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028225 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028225
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153056:4807181..4812564
RNA-Seq ExpressionSgr028225
SyntenySgr028225
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR040411 - Uncharacterized protein At5g23160-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139696.1 uncharacterized protein LOC111010543 [Momordica charantia]1.2e-8265.53Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A K RKK+K FPCFR+PA+  + RTER KD PDE VFP MAV E DG MFH+V+ +AS  D DDED G RK+KG GGALSRAIKAVLFGT+L    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLF
                      K+  + KAK   +NSKKENQ RH ++SSIS+R RIASD YHN S + SS TS+PFSSSS CSSS SSSDI++RSF   PTA  RLF
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLF

Query:  RQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
         QINLR+ICSGW + LVC+LSLI WGK+ AIVCTS WILC   RR GFKSP+ KAS   AAAIDS E+ KRIVIEGLLARDRSAAQNSSLRID
Subjt:  RQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

XP_022941268.1 uncharacterized protein LOC111446618 isoform X1 [Cucurbita moschata]2.4e-6255.97Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A  + KK KFFPCFR+ A+S   RT R  +A DE VFPFMAV ER+G+M H VQP     DG     G +KK GGGGALSRA+KAVLFGTSL    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ
             A    ++ +KQ+          NS +ENQR + +SSIS+R  +SD    +S + SS TS PFSS+S CSSS +S +I + SFRF+PTASNRLFRQ
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ

Query:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        INLRK    WF+LLV LLSL+ W K+GA VCTS WILC    RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Subjt:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

XP_022981581.1 uncharacterized protein LOC111480657 isoform X1 [Cucurbita maxima]1.1e-6456.66Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A  + KK KFFPCFR+ A+SS  RT R  DA DE VFPFMAV ER+G+M H VQP     DG  +  G +KK GGGGALSRA+KAVLFGTSL    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ
                  ++ +K++ + K      NS +ENQR Q +SSIS+R  +SD    +S + SS  S PFSS+S CSSS +SS+IN+ SFRF+PTASNRLFRQ
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ

Query:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        INLR     WF+LLVCLLSL+ W K+GA VCTS WILC    RR IGF+SPDDKAS   AAA+ S EY+KR ++EG L RDRSA +NS   ID
Subjt:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

XP_023525890.1 uncharacterized protein LOC111789371 [Cucurbita pepo subsp. pepo]2.4e-6256.31Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A  + KK KFFPCFR+ A+S   RT R  DA DE VFPFMAV ER+G+M H VQP     DG     G +KK GGGGALSRA+KAVLFGTSL    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ
                  ++ +K++ + K      NS +ENQR Q +SSIS+R  +SD    +  + SS  S PFSS+S CSSS +S +IN+ SFRF+PTASNRLFRQ
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ

Query:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        INLRK    WF+LLV LLSLI W K+GA VCTS WILC    RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Subjt:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

XP_038900051.1 uncharacterized protein LOC120087211 [Benincasa hispida]1.0e-6560.47Show/hide
Query:  MDSI-ATKERKKSKFFPCFRAPATSSHFRTER-RKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFS
        MDSI A K +KK+K FPCFRA A+     T R ++DA DE +FPF+ V   DGV            DG D D G RKKK GGGALSRA+KAVLFGTSL  
Subjt:  MDSI-ATKERKKSKFFPCFRAPATSSHFRTER-RKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFS

Query:  NENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNR--RIASDL-YHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASN
                        K+  + KAK   +NSK ENQRHQA  SISN   RIASDL YHNSS + SS TS PFSSSS CSSS SSS+++D SFRF PTASN
Subjt:  NENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNR--RIASDL-YHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASN

Query:  RLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDRSAAQNSSLRI
        RLFRQIN  KI SGWFLLLVCLLSL+ WGK+GAI+CTS WILCL RRRIG KS DDK S   A A+ S E YK+R+V+EG L RD S AQNS LRI
Subjt:  RLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDRSAAQNSSLRI

TrEMBL top hitse value%identityAlignment
A0A5A7SZR0 Uncharacterized protein6.1e-5655.03Show/hide
Query:  MDSIAT-KERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSN
        MDSIAT K +KK+K FPCFRA A+ S     R KD   E VFPF+ V E        V+PL     G D D G RKKKG  GALSRA KAVLFGTSL   
Subjt:  MDSIAT-KERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSN

Query:  ENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRR-IASD---LYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASN
                       K+  + KAK    +  + NQ HQA+SSI NR   ASD   LYHNSS + SS TS PFSSSS CSSS +SS++++ SFRF P  SN
Subjt:  ENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRR-IASD---LYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASN

Query:  RLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDR-SAAQNSSLRID
        RL RQINLRKI SGWF+LLVCLL+LI WGK+GAI+CTS WILCL RRR+G K       + +A A+ S E YK+RI +EG L R+R S+AQNS LRID
Subjt:  RLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDR-SAAQNSSLRID

A0A6J1CDH5 uncharacterized protein LOC1110105435.8e-8365.53Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A K RKK+K FPCFR+PA+  + RTER KD PDE VFP MAV E DG MFH+V+ +AS  D DDED G RK+KG GGALSRAIKAVLFGT+L    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLF
                      K+  + KAK   +NSKKENQ RH ++SSIS+R RIASD YHN S + SS TS+PFSSSS CSSS SSSDI++RSF   PTA  RLF
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLF

Query:  RQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
         QINLR+ICSGW + LVC+LSLI WGK+ AIVCTS WILC   RR GFKSP+ KAS   AAAIDS E+ KRIVIEGLLARDRSAAQNSSLRID
Subjt:  RQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

A0A6J1F8B9 uncharacterized protein LOC1114430633.2e-4951.03Show/hide
Query:  IATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSG
        + TK +K++K FPCFRA A+ S         AP+E VFPFM V  RD V+         P D  DED    KKKGG GA SRAI+AV+FGTSL       
Subjt:  IATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSG

Query:  CSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNR-RIASDL-YHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQI
                   K+ ++ KAK  +  + KE+QRH A S  S+R R  SDL Y N S  SS     PFSS S  SSS SS++ +D SFR  PTASNRL+ QI
Subjt:  CSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNR-RIASDL-YHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQI

Query:  NLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        N RKI SGWF+LLVCLLSL+ WGK GAI+CTS W+LCL R R  F+SPDDKAS     A+ S EY    ++E  L RDR AA+NS+LRID
Subjt:  NLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

A0A6J1FRN0 uncharacterized protein LOC111446618 isoform X11.1e-6255.97Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A  + KK KFFPCFR+ A+S   RT R  +A DE VFPFMAV ER+G+M H VQP     DG     G +KK GGGGALSRA+KAVLFGTSL    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ
             A    ++ +KQ+          NS +ENQR + +SSIS+R  +SD    +S + SS TS PFSS+S CSSS +S +I + SFRF+PTASNRLFRQ
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ

Query:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        INLRK    WF+LLV LLSL+ W K+GA VCTS WILC    RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Subjt:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

A0A6J1IWY5 uncharacterized protein LOC111480657 isoform X15.5e-6556.66Show/hide
Query:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE
        MDS A  + KK KFFPCFR+ A+SS  RT R  DA DE VFPFMAV ER+G+M H VQP     DG  +  G +KK GGGGALSRA+KAVLFGTSL    
Subjt:  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNE

Query:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ
                  ++ +K++ + K      NS +ENQR Q +SSIS+R  +SD    +S + SS  S PFSS+S CSSS +SS+IN+ SFRF+PTASNRLFRQ
Subjt:  NSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQ

Query:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
        INLR     WF+LLVCLLSL+ W K+GA VCTS WILC    RR IGF+SPDDKAS   AAA+ S EY+KR ++EG L RDRSA +NS   ID
Subjt:  INLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP--RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCGAGTAATCAGGGGAAATGATGAGGAACAATTTTTTTTTTTTTTTTTTTTTTTATATACAGCGCGATTTTTCCATTTTCCTCGTCTCCTCGTTTTCGCTTCTTATTCTA
CTCCCAACCCAGAGATTGCCATAGATGAAGTCATTGCGAACATGTCGAAGTTGAGCCTCCAAATTCGAGTTTTATGTCGAATGGACTCCATTGCAACCAAAGAGAGGAAG
AAAAGTAAATTTTTTCCGTGTTTCCGAGCGCCGGCCACCAGCAGCCATTTCAGGACGGAACGACGTAAGGATGCTCCAGACGAGCCGGTTTTTCCGTTCATGGCGGTGGG
AGAGAGGGACGGTGTGATGTTCCACACTGTGCAACCGTTGGCTTCGCCGTCGGATGGAGACGATGAAGATCCCGGTCTCCGGAAAAAGAAAGGCGGCGGTGGTGCTTTAT
CGCGGGCAATTAAGGCCGTCTTATTCGGAACGTCATTGTTTTCTAACGAAAATTCCGGATGTTCCGCGAGTGTTGCAGGGGAAGAAGATCAGAAACAGGAAAGCGAAACA
AAAGCAAAATTCACAAATCGGAATTCGAAAAAGGAGAATCAGAGGCATCAAGCTATGTCCTCAATCAGCAACAGAAGAATTGCTTCAGATCTCTACCACAACTCTTCATT
CTCTTCTTCTTCGCTCACTTCTGTGCCATTTTCATCCTCCTCGATCTGCAGTTCGTCCTGTTCCTCCTCAGACATCAACGACAGATCATTTCGATTTAATCCAACAGCCT
CGAATCGATTGTTCAGACAGATAAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTGGTATGTCTTCTGAGCTTGATTTCATGGGGAAAAGTCGGTGCTATTGTC
TGCACTTCTGCTTGGATCCTCTGTTTGCCTCGCCGGAGAATCGGATTCAAGTCGCCGGATGACAAGGCCAGTGAGGCGGCGGCGGCGGCGATTGATTCCAGTGAATACAA
GAAGAGAATCGTAATCGAAGGGCTGCTGGCGAGGGACCGTTCAGCTGCTCAAAATTCAAGCTTACGCATTGATTGA
mRNA sequenceShow/hide mRNA sequence
GCGAGTAATCAGGGGAAATGATGAGGAACAATTTTTTTTTTTTTTTTTTTTTTTATATACAGCGCGATTTTTCCATTTTCCTCGTCTCCTCGTTTTCGCTTCTTATTCTA
CTCCCAACCCAGAGATTGCCATAGATGAAGTCATTGCGAACATGTCGAAGTTGAGCCTCCAAATTCGAGTTTTATGTCGAATGGACTCCATTGCAACCAAAGAGAGGAAG
AAAAGTAAATTTTTTCCGTGTTTCCGAGCGCCGGCCACCAGCAGCCATTTCAGGACGGAACGACGTAAGGATGCTCCAGACGAGCCGGTTTTTCCGTTCATGGCGGTGGG
AGAGAGGGACGGTGTGATGTTCCACACTGTGCAACCGTTGGCTTCGCCGTCGGATGGAGACGATGAAGATCCCGGTCTCCGGAAAAAGAAAGGCGGCGGTGGTGCTTTAT
CGCGGGCAATTAAGGCCGTCTTATTCGGAACGTCATTGTTTTCTAACGAAAATTCCGGATGTTCCGCGAGTGTTGCAGGGGAAGAAGATCAGAAACAGGAAAGCGAAACA
AAAGCAAAATTCACAAATCGGAATTCGAAAAAGGAGAATCAGAGGCATCAAGCTATGTCCTCAATCAGCAACAGAAGAATTGCTTCAGATCTCTACCACAACTCTTCATT
CTCTTCTTCTTCGCTCACTTCTGTGCCATTTTCATCCTCCTCGATCTGCAGTTCGTCCTGTTCCTCCTCAGACATCAACGACAGATCATTTCGATTTAATCCAACAGCCT
CGAATCGATTGTTCAGACAGATAAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTGGTATGTCTTCTGAGCTTGATTTCATGGGGAAAAGTCGGTGCTATTGTC
TGCACTTCTGCTTGGATCCTCTGTTTGCCTCGCCGGAGAATCGGATTCAAGTCGCCGGATGACAAGGCCAGTGAGGCGGCGGCGGCGGCGATTGATTCCAGTGAATACAA
GAAGAGAATCGTAATCGAAGGGCTGCTGGCGAGGGACCGTTCAGCTGCTCAAAATTCAAGCTTACGCATTGATTGA
Protein sequenceShow/hide protein sequence
RVIRGNDEEQFFFFFFFLYTARFFHFPRLLVFASYSTPNPEIAIDEVIANMSKLSLQIRVLCRMDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVG
ERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSF
SSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYK
KRIVIEGLLARDRSAAQNSSLRID