; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023422 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023422
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00000892:3142939..3147654
RNA-Seq ExpressionSgr023422
SyntenySgr023422
Gene Ontology termsNA
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570710.1 hypothetical protein SDJN03_29625, partial [Cucurbita argyrosperma subsp. sororia]8.8e-11068.53Show/hide
Query:  EDCGEDKFEILLSTLELRLPSSSSSTGDLRKEEEEEDRPDGL-KTPTTTSSEPAAVLQCPPAPRKPKRLPSTKRKAARGWPTLVPHFFV--EMESLFPPP
        ED  EDKFEILLSTLELRLP SS       +E+ +E RPD L KTP   S EPAA+L+CPP PRKP+RLPS +++ A G  T +PH +V  EM+SL    
Subjt:  EDCGEDKFEILLSTLELRLPSSSSSTGDLRKEEEEEDRPDGL-KTPTTTSSEPAAVLQCPPAPRKPKRLPSTKRKAARGWPTLVPHFFV--EMESLFPPP

Query:  LLGGDFIGRGFPFKSNCQA---FHPVLYLPHMAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLG
        L   +    GFP  S  Q    F   L   HMAQQGEGWPLGLQPLNVRVG+PGNRD  GSVSFNTLMTASP+S FTDSS+DLDTESTGSFFHD +ITLG
Subjt:  LLGGDFIGRGFPFKSNCQA---FHPVLYLPHMAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLG

Query:  SLIGVSRILELSRRSIRGRKTESTKDNRSNARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFI
        SLIGVS ILELSRRSIRGR+TEST   RSNARSRTW F LCSRESTD D I DN PSLGHFLAEERRAADE RRN+ + +    +L LAE AP PNSLFI
Subjt:  SLIGVSRILELSRRSIRGRKTESTKDNRSNARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFI

Query:  NGCVAPPQSSLGSDVENGRNGGTEPSNDNGVAVLCACICG
        NGCVAPPQ SL SD E+GRNGGTEP+ND+ VA+LCACICG
Subjt:  NGCVAPPQSSLGSDVENGRNGGTEPSNDNGVAVLCACICG

KAG6605464.1 hypothetical protein SDJN03_02781, partial [Cucurbita argyrosperma subsp. sororia]2.9e-8979.9Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQGEGWPLGLQPLN+RVG+PGNRDY GS+SFNTL+TASPIS FTDSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TESTK+ RSN 
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV
        RSRTW FSLCSR++TDAD + +NGPSLG FL EERRAADE RRN+SV++YG D++ LA+ AP PNSLFINGCVAPPQSSL SD +  RNGGTEP ND+G+
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV

Query:  AVLCACICG
        A+LC+C+CG
Subjt:  AVLCACICG

XP_022148431.1 uncharacterized protein At3g17950-like [Momordica charantia]5.2e-9486.67Show/hide
Query:  MAQQGEGWPLGLQPLNVR-VGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSN
        MAQQGEGWPLGLQPLNVR +G+PGNRDYSGSVSFNTLMTASPIS F+DSSSDLDTESTGSFF DKSITLGSLIGVS ILELSRRS+RGRKTE+T D R N
Subjt:  MAQQGEGWPLGLQPLNVR-VGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSN

Query:  ARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG
         RSR WFFSLCSRESTDA+ I DNGPSLGHFLAEERRAADE RRN+SV++YG DDLALAE A  PNSLFINGCVAPPQSSLGS+VE GRNGGTEP+NDNG
Subjt:  ARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG

Query:  VAVLCACICG
        VAVLCACICG
Subjt:  VAVLCACICG

XP_038902040.1 uncharacterized protein At3g17950-like isoform X1 [Benincasa hispida]9.1e-9184.39Show/hide
Query:  GEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNARSRT
        GEGWPLGLQPLNVRVG+PGNRDY GSVSFNTLMTASPIS F+DSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TE+TK+ RSN RSR 
Subjt:  GEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNARSRT

Query:  WFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGVAVLC
        WFFSLCSREST+AD  VD+GPSLGHFLAEERRAADE RRN+SVI+YGPD+L LA+  P PNSLFINGCVAPPQSS+GS+ ENGRNGGTEP+NDN  A++C
Subjt:  WFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGVAVLC

Query:  ACICG
        ACICG
Subjt:  ACICG

XP_038902041.1 uncharacterized protein At3g17950-like isoform X2 [Benincasa hispida]5.7e-9384.69Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQGEGWPLGLQPLNVRVG+PGNRDY GSVSFNTLMTASPIS F+DSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TE+TK+ RSN 
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV
        RSR WFFSLCSREST+AD  VD+GPSLGHFLAEERRAADE RRN+SVI+YGPD+L LA+  P PNSLFINGCVAPPQSS+GS+ ENGRNGGTEP+NDN  
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV

Query:  AVLCACICG
        A++CACICG
Subjt:  AVLCACICG

TrEMBL top hitse value%identityAlignment
A0A0A0KCA6 Uncharacterized protein6.6e-8781.9Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQG+GWPLGLQPLNVRVG+PGNRDY GSVSFNTLMTASPIS F+DSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TESTKD RSN 
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRR-NESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG
        +SRTWFFSLCSRESTDAD I ++GPSLGHFLAEERRAADE RR N+S I+YG D+L LA+  P PNSLFINGCVAPPQ S+GS+ E   NGGTEP+NDN 
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRR-NESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG

Query:  VAVLCACICG
        VA++C+CICG
Subjt:  VAVLCACICG

A0A5A7V897 Uncharacterized protein1.1e-8682.38Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQG+GWPLGLQPLNVRVG+PGNRDY GSVSFNTLMTASPIS F+DSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TESTKD RSNA
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRR-NESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG
        +SRTWFFSLCSRESTDAD I ++GPSLGHFLAEERRAADE RR N+S I+YG D+L LA+  P PNSLFINGCVAPPQSS+GS   N  NGGTE +NDN 
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRR-NESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG

Query:  VAVLCACICG
        VA++C+CICG
Subjt:  VAVLCACICG

A0A6J1D5C9 uncharacterized protein At3g17950-like2.5e-9486.67Show/hide
Query:  MAQQGEGWPLGLQPLNVR-VGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSN
        MAQQGEGWPLGLQPLNVR +G+PGNRDYSGSVSFNTLMTASPIS F+DSSSDLDTESTGSFF DKSITLGSLIGVS ILELSRRS+RGRKTE+T D R N
Subjt:  MAQQGEGWPLGLQPLNVR-VGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSN

Query:  ARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG
         RSR WFFSLCSRESTDA+ I DNGPSLGHFLAEERRAADE RRN+SV++YG DDLALAE A  PNSLFINGCVAPPQSSLGS+VE GRNGGTEP+NDNG
Subjt:  ARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNG

Query:  VAVLCACICG
        VAVLCACICG
Subjt:  VAVLCACICG

A0A6J1G630 uncharacterized protein At3g17950-like5.4e-8979.43Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQGEGWPLGLQPLN+RVG+PGNRDY GS+SFNTL+TASPIS FTDSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TESTK+ RSN 
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV
        RSRTW FSLCSR++TDAD + +N PSLG FL EERRAADE RRN+SV++YG D++ LA+ AP PNSLFINGCVAPPQ SLGSD +  RNGGTEP ND+G+
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV

Query:  AVLCACICG
        A+LC+C+CG
Subjt:  AVLCACICG

A0A6J1KX21 uncharacterized protein At3g17950-like1.3e-8779.9Show/hide
Query:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA
        MAQQGEGWPLGLQPLN+RVG+PGNRDY GS+SFNTL+TASPIS FTDSSSDLDTESTGSFFHDKSITLGSLIGVS ILELSRRSIRGR+TESTK+ RSN 
Subjt:  MAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGRKTESTKDNRSNA

Query:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV
        RSRTW FSLCSR++TDAD +  NGPSLG FL EERRAADE RRN+SV++YG D++ LA+ AP PNSLFINGCVAPPQ SLGSD + GRNGGTEP ND+G+
Subjt:  RSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRNGGTEPSNDNGV

Query:  AVLCACICG
        A+LC+ +CG
Subjt:  AVLCACICG

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179501.7e-0734.68Show/hide
Query:  SSSDLDTESTGSFFHDKSITLGSLIG------------------VSRILELSR-RSIRGRKTESTKDNRSNA------RSRTWFFSLCSRESTDADIIVD
        SSSDLDTESTGSFFHD+SITLG+L+G                  VS  + +SR  S   R+    K   SN+      R R W +  C  +  DA     
Subjt:  SSSDLDTESTGSFFHDKSITLGSLIG------------------VSRILELSR-RSIRGRKTESTKDNRSNA------RSRTWFFSLCSRESTDADIIVD

Query:  NG----------PSLGHFLAEERRAADECRRN------ESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSS
        NG           SLG +L  ERR  DE   N      E  ++    D    +P     +LF +G V PP S+
Subjt:  NG----------PSLGHFLAEERRAADECRRN------ESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSS

Arabidopsis top hitse value%identityAlignment
AT2G28870.1 unknown protein1.8e-0442.67Show/hide
Query:  LRKEEEEEDRPDGLKTPTTTSSEPAAVLQ-CPPAPRKPKRLPSTK---RKAARGWPTLVPHFFVEMESLFPPPLL
        ++KEEEEED  D  KTPT +    +A+ + CPPAPRKPKR+PS     R + R    ++ +   E++ LF P  L
Subjt:  LRKEEEEEDRPDGLKTPTTTSSEPAAVLQ-CPPAPRKPKRLPSTK---RKAARGWPTLVPHFFVEMESLFPPPLL

AT3G17950.1 unknown protein1.2e-0834.68Show/hide
Query:  SSSDLDTESTGSFFHDKSITLGSLIG------------------VSRILELSR-RSIRGRKTESTKDNRSNA------RSRTWFFSLCSRESTDADIIVD
        SSSDLDTESTGSFFHD+SITLG+L+G                  VS  + +SR  S   R+    K   SN+      R R W +  C  +  DA     
Subjt:  SSSDLDTESTGSFFHDKSITLGSLIG------------------VSRILELSR-RSIRGRKTESTKDNRSNA------RSRTWFFSLCSRESTDADIIVD

Query:  NG----------PSLGHFLAEERRAADECRRN------ESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSS
        NG           SLG +L  ERR  DE   N      E  ++    D    +P     +LF +G V PP S+
Subjt:  NG----------PSLGHFLAEERRAADECRRN------ESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSS

AT5G02440.1 unknown protein1.4e-2543.86Show/hide
Query:  MAQQGEGWPLGLQPLNVRVG--LPGNRDY-------SGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGR--K
        MA Q EGWPLGL+P+N R+G    G   +       +GS+SF++L++ SP S    SSSDLD++S GSFF D+S TLG+LIG+S  LELSRRS R R  +
Subjt:  MAQQGEGWPLGLQPLNVRVG--LPGNRDY-------SGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILELSRRSIRGR--K

Query:  TESTKDNRSNARSRT----WFFSLCSRESTDADIIV------------DNGPSLGHFLAEERRAADECRRN
        T + ++++ +   +T    W FS+CS+ ST+A +I             +N  SLGHFL  ERRA     R+
Subjt:  TESTKDNRSNARSRT----WFFSLCSRESTDADIIV------------DNGPSLGHFLAEERRAADECRRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAGCTGCAGCAGAAAGTGGAAAAAGAAGCCGACGACGACAAGGAAGATTGTGGAGAAGACAAGTTCGAGATTTTACTGTCGACTTTGGAGCTAAGGTTGCCGTC
TTCTTCTTCTTCTACTGGAGATTTACGTAAGGAAGAAGAAGAAGAAGACCGGCCCGATGGTTTGAAGACTCCGACGACGACGTCTTCGGAACCTGCGGCGGTTCTTCAGT
GTCCTCCGGCCCCGAGAAAACCGAAGAGGTTGCCGTCGACTAAACGAAAGGCTGCCCGTGGCTGGCCGACTCTGGTGCCCCATTTCTTCGTCGAGATGGAGTCGTTGTTT
CCTCCGCCGCTTCTCGGCGGTGACTTCATTGGACGTGGGTTCCCCTTCAAATCAAACTGTCAAGCATTTCATCCAGTACTCTACTTGCCACACATGGCTCAACAGGGAGA
AGGGTGGCCTCTCGGTCTACAGCCACTGAATGTTAGAGTTGGGCTACCTGGAAACCGAGACTATTCAGGATCAGTGTCTTTCAACACTTTGATGACGGCTTCTCCTATTT
CGGCCTTCACGGATTCTTCATCCGACTTGGACACAGAGTCGACGGGATCTTTCTTCCACGACAAGAGCATCACACTTGGGAGTCTGATAGGAGTTTCTAGGATCTTGGAA
CTCTCAAGGAGATCAATTAGGGGAAGAAAAACAGAGAGCACAAAGGACAACAGGAGCAATGCCAGGTCTAGAACTTGGTTTTTCTCTCTGTGTTCGAGGGAGAGTACCGA
TGCTGATATTATCGTCGACAATGGCCCGTCGTTAGGCCACTTCCTGGCAGAAGAAAGAAGAGCAGCCGACGAATGTAGGAGAAATGAGAGTGTAATCATTTATGGACCAG
ATGATTTAGCATTAGCTGAGCCTGCTCCAGTACCAAACTCCCTATTCATCAATGGCTGTGTGGCACCTCCTCAATCAAGCCTTGGTTCAGATGTTGAAAATGGGAGAAAT
GGAGGAACTGAGCCTTCAAATGACAATGGAGTTGCAGTGCTTTGTGCTTGCATTTGTGGACAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGAGCTGCAGCAGAAAGTGGAAAAAGAAGCCGACGACGACAAGGAAGATTGTGGAGAAGACAAGTTCGAGATTTTACTGTCGACTTTGGAGCTAAGGTTGCCGTC
TTCTTCTTCTTCTACTGGAGATTTACGTAAGGAAGAAGAAGAAGAAGACCGGCCCGATGGTTTGAAGACTCCGACGACGACGTCTTCGGAACCTGCGGCGGTTCTTCAGT
GTCCTCCGGCCCCGAGAAAACCGAAGAGGTTGCCGTCGACTAAACGAAAGGCTGCCCGTGGCTGGCCGACTCTGGTGCCCCATTTCTTCGTCGAGATGGAGTCGTTGTTT
CCTCCGCCGCTTCTCGGCGGTGACTTCATTGGACGTGGGTTCCCCTTCAAATCAAACTGTCAAGCATTTCATCCAGTACTCTACTTGCCACACATGGCTCAACAGGGAGA
AGGGTGGCCTCTCGGTCTACAGCCACTGAATGTTAGAGTTGGGCTACCTGGAAACCGAGACTATTCAGGATCAGTGTCTTTCAACACTTTGATGACGGCTTCTCCTATTT
CGGCCTTCACGGATTCTTCATCCGACTTGGACACAGAGTCGACGGGATCTTTCTTCCACGACAAGAGCATCACACTTGGGAGTCTGATAGGAGTTTCTAGGATCTTGGAA
CTCTCAAGGAGATCAATTAGGGGAAGAAAAACAGAGAGCACAAAGGACAACAGGAGCAATGCCAGGTCTAGAACTTGGTTTTTCTCTCTGTGTTCGAGGGAGAGTACCGA
TGCTGATATTATCGTCGACAATGGCCCGTCGTTAGGCCACTTCCTGGCAGAAGAAAGAAGAGCAGCCGACGAATGTAGGAGAAATGAGAGTGTAATCATTTATGGACCAG
ATGATTTAGCATTAGCTGAGCCTGCTCCAGTACCAAACTCCCTATTCATCAATGGCTGTGTGGCACCTCCTCAATCAAGCCTTGGTTCAGATGTTGAAAATGGGAGAAAT
GGAGGAACTGAGCCTTCAAATGACAATGGAGTTGCAGTGCTTTGTGCTTGCATTTGTGGACAGTGA
Protein sequenceShow/hide protein sequence
MIELQQKVEKEADDDKEDCGEDKFEILLSTLELRLPSSSSSTGDLRKEEEEEDRPDGLKTPTTTSSEPAAVLQCPPAPRKPKRLPSTKRKAARGWPTLVPHFFVEMESLF
PPPLLGGDFIGRGFPFKSNCQAFHPVLYLPHMAQQGEGWPLGLQPLNVRVGLPGNRDYSGSVSFNTLMTASPISAFTDSSSDLDTESTGSFFHDKSITLGSLIGVSRILE
LSRRSIRGRKTESTKDNRSNARSRTWFFSLCSRESTDADIIVDNGPSLGHFLAEERRAADECRRNESVIIYGPDDLALAEPAPVPNSLFINGCVAPPQSSLGSDVENGRN
GGTEPSNDNGVAVLCACICGQ