; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030318 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030318
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationtig00153640:2200843..2203792
RNA-Seq ExpressionSgr030318
SyntenySgr030318
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]2.1e-12491.19Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESML  RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLG-SSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETST LFPSQSD SVPTSPVSPYRYQRPFSG  PST TNTSLG S+T+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLG-SSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK
        ELPYCSMPEPGPNIEAE+RPCSCIKSLVDERVYQLEECSS  +GVSE EY++QK CKDL +
Subjt:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK

XP_022155254.1 uncharacterized protein LOC111022394 isoform X1 [Momordica charantia]4.3e-12289.58Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPP SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESML LRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSDSVPTSPVSPYRYQRPFSGSTPSTST-NTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETST LFP QSDSVPTSPVSPYRYQRPFS  TPSTST N SLG ST+PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTTLFPSQSDSVPTSPVSPYRYQRPFSGSTPSTST-NTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKKS
        LPYCSM EPGPNIEAEERPC+  KSLV+ERVYQLEECS++ VSEPEY+QQK CKDL ++
Subjt:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKKS

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]5.7e-12290.42Show/hide
Query:  MGVESNSAAPPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESML LRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETST LFPSQSD SVPTSPVSPYRYQRPFSG TPST TNTSLG +T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK
        SMELPYCSMPEPGPNIEAEER CS IKSLVDERVYQL ECSS+GVSEPEY++QK CKDL +
Subjt:  SMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK

XP_022984066.1 uncharacterized protein LOC111482488 isoform X1 [Cucurbita maxima]1.9e-12290.7Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESML LRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETST LFPSQSD SVPTSPVSPYRYQRPFSG TPST+TNTSLG +T+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK
        LPYCSMPEPGPNIEAEER CS IKSLVDERVYQL ECS++GVSEPEY++QK CKDL +
Subjt:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]3.9e-12390.7Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESML  R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETST LFP+QSD SVPTSPVSPYRYQRPFSG TPST TNTSLG ST+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK
        LPYCSMPEPGPNIEAEERPCSCIKSLVDER +QLEECSS+GVSEPEY+++K CKDL +
Subjt:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.0e-12491.19Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESML  RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLG-SSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETST LFPSQSD SVPTSPVSPYRYQRPFSG  PST TNTSLG S+T+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLG-SSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK
        ELPYCSMPEPGPNIEAE+RPCSCIKSLVDERVYQLEECSS  +GVSE EY++QK CKDL +
Subjt:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK

A0A5A7TRC2 Uncharacterized protein3.6e-12289.66Show/hide
Query:  MGVESNSA-APPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA  PPP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PL ENLMDSPARSESML  RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-APPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETST LFPSQSD SVPTSPVSPYRYQRPFSG  PS  TNTSLG ST+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK
        ELPYCSMPEPGPNIEAE+RPCSCIKSLVDERVYQLEECSS  +GVSE EY++QK CKDL +
Subjt:  ELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSS--VGVSEPEYSQQKLCKDLKK

A0A6J1DPQ3 uncharacterized protein LOC111022394 isoform X12.1e-12289.58Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPP SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESML LRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSDSVPTSPVSPYRYQRPFSGSTPSTST-NTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETST LFP QSDSVPTSPVSPYRYQRPFS  TPSTST N SLG ST+PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTTLFPSQSDSVPTSPVSPYRYQRPFSGSTPSTST-NTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKKS
        LPYCSM EPGPNIEAEERPC+  KSLV+ERVYQLEECS++ VSEPEY+QQK CKDL ++
Subjt:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKKS

A0A6J1F722 uncharacterized protein LOC111441454 isoform X12.7e-12290.42Show/hide
Query:  MGVESNSAAPPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESML LRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETST LFPSQSD SVPTSPVSPYRYQRPFSG TPST TNTSLG +T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK
        SMELPYCSMPEPGPNIEAEER CS IKSLVDERVYQL ECSS+GVSEPEY++QK CKDL +
Subjt:  SMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK

A0A6J1J464 uncharacterized protein LOC111482488 isoform X19.4e-12390.7Show/hide
Query:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESML LRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETST LFPSQSD SVPTSPVSPYRYQRPFSG TPST+TNTSLG +T+PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTTLFPSQSD-SVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK
        LPYCSMPEPGPNIEAEER CS IKSLVDERVYQL ECS++GVSEPEY++QK CKDL +
Subjt:  LPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)1.8e-5754.68Show/hide
Query:  GVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFCE
        G      A  P    S  SP GKR RDP+DEVYLDN  S KRYLSEIMA SLNGLTVGD LP N+++SPARSES L  RD++S QYSPMSEDSD+ RFCE
Subjt:  GVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFCE

Query:  TST---TLFPSQSDSVPTSPVSPYRYQRPF-SGSTPSTSTN---------TSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM
          T   +   SQ +S PTSPVSPYRYQRP  S ++P  S            S+ S+    T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQM
Subjt:  TST---TLFPSQSDSVPTSPVSPYRYQRPF-SGSTPSTSTN---------TSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM

Query:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDL
        R QP G SS   P         NI+ EER CS  KS+ ++R Y   E   +  +E   S+ K CK L
Subjt:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPCSCIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTGGAATCGAACTCAGCGGCACCGCCACCGTCATCGTCGTCTTCTACACCTTCTCCGAGCGGGAAACGAGCCAGAGATCCCGACGATGAAGTTTACCTCGATAA
TTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCAAGTTTGAATGGCTTGACGGTTGGAGACCCCCTCCCAGAGAATCTCATGGATTCTCCTGCTAGGTCGG
AGTCCATGCTTTGTCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCGGAAGATTCAGATGACTGTCGGTTTTGTGAGACATCCACAACCTTATTTCCCTCGCAG
TCTGATAGTGTACCTACCAGTCCAGTCTCTCCATACCGATATCAAAGACCGTTCAGCGGGTCGACTCCTTCAACAAGTACTAATACTTCACTTGGATCTTCTACCAATCC
CGTCACTAGCTTGCAACCTCATCAACGTGGATCAGATTCCGAGGGCCGTTTCCCATCATCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCGGCGCTCTTAC
GTTCTGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGGCCTAATATAGAAGCCGAAGAGCGGCCGTGTTCT
TGCATAAAATCGTTAGTCGATGAAAGGGTTTATCAACTTGAGGAATGCTCCTCTGTAGGAGTGTCCGAGCCTGAATATAGTCAACAGAAATTATGCAAGGACTTGAAAAA
GTCTGGAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTGGAATCGAACTCAGCGGCACCGCCACCGTCATCGTCGTCTTCTACACCTTCTCCGAGCGGGAAACGAGCCAGAGATCCCGACGATGAAGTTTACCTCGATAA
TTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCAAGTTTGAATGGCTTGACGGTTGGAGACCCCCTCCCAGAGAATCTCATGGATTCTCCTGCTAGGTCGG
AGTCCATGCTTTGTCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCGGAAGATTCAGATGACTGTCGGTTTTGTGAGACATCCACAACCTTATTTCCCTCGCAG
TCTGATAGTGTACCTACCAGTCCAGTCTCTCCATACCGATATCAAAGACCGTTCAGCGGGTCGACTCCTTCAACAAGTACTAATACTTCACTTGGATCTTCTACCAATCC
CGTCACTAGCTTGCAACCTCATCAACGTGGATCAGATTCCGAGGGCCGTTTCCCATCATCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCGGCGCTCTTAC
GTTCTGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGGCCTAATATAGAAGCCGAAGAGCGGCCGTGTTCT
TGCATAAAATCGTTAGTCGATGAAAGGGTTTATCAACTTGAGGAATGCTCCTCTGTAGGAGTGTCCGAGCCTGAATATAGTCAACAGAAATTATGCAAGGACTTGAAAAA
GTCTGGAGAGTAG
Protein sequenceShow/hide protein sequence
MGVESNSAAPPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLCLRDEMSWQYSPMSEDSDDCRFCETSTTLFPSQ
SDSVPTSPVSPYRYQRPFSGSTPSTSTNTSLGSSTNPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPCS
CIKSLVDERVYQLEECSSVGVSEPEYSQQKLCKDLKKSGE