; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019058 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019058
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF309 domain protein
Genome locationtig00153260:1213675..1218316
RNA-Seq ExpressionSgr019058
SyntenySgr019058
Gene Ontology termsNA
InterPro domainsIPR005500 - Protein of unknown function DUF309
IPR023203 - TTHA0068-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027740.1 hypothetical protein SDJN02_08917, partial [Cucurbita argyrosperma subsp. argyrosperma]6.7e-10685.22Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSS+ S LLRP  H NS F H SSLP PPRN+ RR ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E E DGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

XP_022945654.1 uncharacterized protein LOC111449825 [Cucurbita moschata]7.9e-10785.22Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSSS   LL P  H NS F H SSLP PPRN+ RR ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E EVDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

XP_022971602.1 uncharacterized protein LOC111470277 [Cucurbita maxima]1.2e-10785.65Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSSS   LLRP  H NS F H SSLP PPRN+ RR ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E EVDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

XP_023539066.1 uncharacterized protein LOC111799819 [Cucurbita pepo subsp. pepo]1.4e-10684.78Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSSS   LLRP  H N+ F H SSLP PPRN+  R ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E EVDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

XP_038905159.1 uncharacterized protein LOC120091273 [Benincasa hispida]2.1e-10784.65Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPR-NSRRRRSTISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNG
        MA LP+LYVSSSL   LRP   LNS F HE+SLP PPR  S +RR+TISLSFRTSY F+ DHEDEDEQ+ RDFGFDEAVDLFNQGAYYDCHD+LE LWNG
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPR-NSRRRRSTISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNG

Query:  AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQ
        AEDPTRTL HGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKM+F+SGPF+TFEREI+AVLDFVY TQIELAACDENVCV MEGSERSYELLGRYGAGQ
Subjt:  AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQ

Query:  KLYDLEREVDGSMCIVFSPQTSQTHPLR
        KLYD+E+EVDG MCIVFSPQTSQ HPLR
Subjt:  KLYDLEREVDGSMCIVFSPQTSQTHPLR

TrEMBL top hitse value%identityAlignment
A0A0A0LC38 Uncharacterized protein1.2e-9577.63Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNG
        MA L SLY+SSS    L P    +S   H S+L + PR +  RR +TI LSFRTSYRF+ DHED DE++  DFGFDEAVDLFNQGAYYDCHD+LE LWN 
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNG

Query:  AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQ
        AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPF TFEREI+AVLDFVY TQIELAACDE+VCV MEGSERSYELLGRYG GQ
Subjt:  AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQ

Query:  KLYDLEREVDGSMCIVFSPQTSQTHPLR
        KLYD+E++VDGS CIVFS QTSQTHPLR
Subjt:  KLYDLEREVDGSMCIVFSPQTSQTHPLR

A0A1S3B7D9 uncharacterized protein LOC1034868361.4e-9678.26Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSF--FCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA L SL++SSS    L P    NS     H S  P+ PR +  RR +TIS SFRTSYRF+ DHED DE++  DFGFDEAVDLFNQGAYYDCHD+LE LW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSF--FCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        N AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPFHTFEREI+AVLDFVY TQIELAACDE+VCV MEGSERSYELLGRYG 
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD+E++VDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

A0A5A7UHC0 Uncharacterized ypuF1.4e-9678.26Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSF--FCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA L SL++SSS    L P    NS     H S  P+ PR +  RR +TIS SFRTSYRF+ DHED DE++  DFGFDEAVDLFNQGAYYDCHD+LE LW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSF--FCHESSLPNPPRNSRRRR-STISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        N AEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPFHTFEREI+AVLDFVY TQIELAACDE+VCV MEGSERSYELLGRYG 
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD+E++VDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

A0A6J1G1I9 uncharacterized protein LOC1114498253.8e-10785.22Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSSS   LL P  H NS F H SSLP PPRN+ RR ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E EVDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

A0A6J1I2E8 uncharacterized protein LOC1114702775.9e-10885.65Show/hide
Query:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW
        MA LPSL VSSS   LLRP  H NS F H SSLP PPRN+ RR ++T SLSFRTSYRF+VDH  EDEDEQ+ R+F FDEAVDLFNQGAYYDCHD+LEILW
Subjt:  MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNS-RRRRSTISLSFRTSYRFSVDH--EDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILW

Query:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA
        NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFHTFEREISAVLDF+Y TQIELAACDENVCV MEGSERSYELLGRYGA
Subjt:  NGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGA

Query:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR
        GQKLYD E EVDGSMCIVFSPQTSQTHPLR
Subjt:  GQKLYDLEREVDGSMCIVFSPQTSQTHPLR

SwissProt top hitse value%identityAlignment
Q0WQW5 Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial3.1e-0528.96Show/hide
Query:  SPQTSQTHPLRPVYEFSRTLSSRPQAQFLQIYIAPQTNPCLAIKTVSFSLQNQFLYPKLISLSSSSHDLFYIRSILLNQFNDAYFCFN-LCNAIIHNITA
        S  T+  H  R ++  + T S   Q + L  +    T P              FLY K++ LSSS  D+ Y   +  +  N + F +N L  A  H++  
Subjt:  SPQTSQTHPLRPVYEFSRTLSSRPQAQFLQIYIAPQTNPCLAIKTVSFSLQNQFLYPKLISLSSSSHDLFYIRSILLNQFNDAYFCFN-LCNAIIHNITA

Query:  NSNGKSSNSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLYSIGGFID
         S  + +   YR+ +E        +   D+ T P+VLK    I    E +Q+H + +K G    +VYVNN LI LY   G +D
Subjt:  NSNGKSSNSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLYSIGGFID

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic4.5e-0436.23Show/hide
Query:  NSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLY
        N     A+E+LREM++  VE DEFT+  VL     ++ +R  +++HA ++K G L  N +V + L+ +Y
Subjt:  NSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLY

Arabidopsis top hitse value%identityAlignment
AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-0628.96Show/hide
Query:  SPQTSQTHPLRPVYEFSRTLSSRPQAQFLQIYIAPQTNPCLAIKTVSFSLQNQFLYPKLISLSSSSHDLFYIRSILLNQFNDAYFCFN-LCNAIIHNITA
        S  T+  H  R ++  + T S   Q + L  +    T P              FLY K++ LSSS  D+ Y   +  +  N + F +N L  A  H++  
Subjt:  SPQTSQTHPLRPVYEFSRTLSSRPQAQFLQIYIAPQTNPCLAIKTVSFSLQNQFLYPKLISLSSSSHDLFYIRSILLNQFNDAYFCFN-LCNAIIHNITA

Query:  NSNGKSSNSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLYSIGGFID
         S  + +   YR+ +E        +   D+ T P+VLK    I    E +Q+H + +K G    +VYVNN LI LY   G +D
Subjt:  NSNGKSSNSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLYSIGGFID

AT2G41120.1 unknown protein1.1e-5861.49Show/hide
Query:  DHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISA
        D ED DE     + F+EAV LFN+  YY  HD LE LW  AE+PTRTLIHGILQCAVG HHLFN NH+GAMMELGEG+CKLRKM FE GPFH FER++SA
Subjt:  DHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISA

Query:  VLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQKLYDLEREVD------GSMCIVFSPQTSQTHPLR
        VL+FVYQTQ+ELAAC E++C+ M+ S+RSY+LLG Y AG+ +Y LE  +D       +  I+FSP  S + P R
Subjt:  VLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQKLYDLEREVD------GSMCIVFSPQTSQTHPLR

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-0536.23Show/hide
Query:  NSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLY
        N     A+E+LREM++  VE DEFT+  VL     ++ +R  +++HA ++K G L  N +V + L+ +Y
Subjt:  NSTYRRAMEHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCTTCCATCCCTGTATGTTTCCTCCTCCTTATCACCCCTTCTTCGACCTCGCCACCATTTGAACTCCTTTTTCTGCCATGAAAGCAGTCTCCCAAATCCTCC
AAGAAACAGCAGACGAAGAAGAAGCACGATATCGCTCTCCTTCCGCACCTCCTACCGATTTTCCGTCGACCATGAAGACGAAGACGAGCAGGTTATCAGAGATTTCGGCT
TTGACGAAGCAGTGGATCTCTTCAATCAAGGAGCGTATTACGATTGCCACGACATCCTTGAAATTCTATGGAACGGAGCCGAAGACCCTACCAGAACCCTAATTCATGGC
ATTCTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTGGGAGAGGGGCTGTGTAAGCTACGGAAGATGGAGTTTGAGAG
TGGCCCTTTCCATACATTCGAGAGGGAGATTTCTGCAGTTCTGGACTTTGTTTACCAGACCCAGATTGAATTAGCTGCCTGTGATGAGAATGTGTGTGTTGCAATGGAGG
GTTCAGAGAGATCATATGAATTGCTTGGAAGGTACGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGTTGATGGGAGCATGTGCATTGTCTTCTCTCCTCAAACT
TCTCAAACTCATCCACTCAGGCCAGTGTATGAATTCTCAAGAACTCTATCTTCTCGCCCACAAGCTCAATTCCTGCAAATCTATATCGCACCTCAAACAAATCCATGCCT
CGCCATTAAAACAGTTTCCTTCTCTCTCCAAAATCAATTCCTGTATCCCAAACTCATTTCTCTCTCTTCCTCCTCCCACGACCTTTTCTACATCCGCTCCATCCTTCTCA
ACCAGTTCAACGATGCATACTTCTGCTTCAATCTCTGCAACGCCATCATCCACAACATTACCGCAAACTCCAATGGTAAGAGTAGCAATTCTACCTATCGCAGGGCCATG
GAACACTTGAGGGAAATGCTTGTGATCGACGTTGAACTGGATGAGTTCACATTGCCGTATGTTCTCAAAGAGTTGGTTCAGATTCAGGCGATGAGAGAAGACCAACAGAT
TCACGCTCGTTCTATCAAGATTGGACTACTGGTATTCAATGTTTATGTGAATAACACGTTGATCAGATTGTATTCCATCGGTGGCTTTATCGATGCAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCTTCCATCCCTGTATGTTTCCTCCTCCTTATCACCCCTTCTTCGACCTCGCCACCATTTGAACTCCTTTTTCTGCCATGAAAGCAGTCTCCCAAATCCTCC
AAGAAACAGCAGACGAAGAAGAAGCACGATATCGCTCTCCTTCCGCACCTCCTACCGATTTTCCGTCGACCATGAAGACGAAGACGAGCAGGTTATCAGAGATTTCGGCT
TTGACGAAGCAGTGGATCTCTTCAATCAAGGAGCGTATTACGATTGCCACGACATCCTTGAAATTCTATGGAACGGAGCCGAAGACCCTACCAGAACCCTAATTCATGGC
ATTCTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTGGGAGAGGGGCTGTGTAAGCTACGGAAGATGGAGTTTGAGAG
TGGCCCTTTCCATACATTCGAGAGGGAGATTTCTGCAGTTCTGGACTTTGTTTACCAGACCCAGATTGAATTAGCTGCCTGTGATGAGAATGTGTGTGTTGCAATGGAGG
GTTCAGAGAGATCATATGAATTGCTTGGAAGGTACGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGTTGATGGGAGCATGTGCATTGTCTTCTCTCCTCAAACT
TCTCAAACTCATCCACTCAGGCCAGTGTATGAATTCTCAAGAACTCTATCTTCTCGCCCACAAGCTCAATTCCTGCAAATCTATATCGCACCTCAAACAAATCCATGCCT
CGCCATTAAAACAGTTTCCTTCTCTCTCCAAAATCAATTCCTGTATCCCAAACTCATTTCTCTCTCTTCCTCCTCCCACGACCTTTTCTACATCCGCTCCATCCTTCTCA
ACCAGTTCAACGATGCATACTTCTGCTTCAATCTCTGCAACGCCATCATCCACAACATTACCGCAAACTCCAATGGTAAGAGTAGCAATTCTACCTATCGCAGGGCCATG
GAACACTTGAGGGAAATGCTTGTGATCGACGTTGAACTGGATGAGTTCACATTGCCGTATGTTCTCAAAGAGTTGGTTCAGATTCAGGCGATGAGAGAAGACCAACAGAT
TCACGCTCGTTCTATCAAGATTGGACTACTGGTATTCAATGTTTATGTGAATAACACGTTGATCAGATTGTATTCCATCGGTGGCTTTATCGATGCAGTCTAG
Protein sequenceShow/hide protein sequence
MAFLPSLYVSSSLSPLLRPRHHLNSFFCHESSLPNPPRNSRRRRSTISLSFRTSYRFSVDHEDEDEQVIRDFGFDEAVDLFNQGAYYDCHDILEILWNGAEDPTRTLIHG
ILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHTFEREISAVLDFVYQTQIELAACDENVCVAMEGSERSYELLGRYGAGQKLYDLEREVDGSMCIVFSPQT
SQTHPLRPVYEFSRTLSSRPQAQFLQIYIAPQTNPCLAIKTVSFSLQNQFLYPKLISLSSSSHDLFYIRSILLNQFNDAYFCFNLCNAIIHNITANSNGKSSNSTYRRAM
EHLREMLVIDVELDEFTLPYVLKELVQIQAMREDQQIHARSIKIGLLVFNVYVNNTLIRLYSIGGFIDAV