; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G01140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G01140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationChr4:621009..624402
RNA-Seq ExpressionCSPI04G01140
SyntenyCSPI04G01140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]1.6e-148100Show/hide
Query:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]6.7e-14297.05Show/hide
Query:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

XP_008454306.1 PREDICTED: uncharacterized protein LOC103494744 isoform X2 [Cucumis melo]3.6e-13594.1Show/hide
Query:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPA        RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

XP_011652951.1 uncharacterized protein LOC101212915 isoform X2 [Cucumis sativus]8.8e-14297.04Show/hide
Query:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPA        RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]1.6e-13594.44Show/hide
Query:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPP  SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQR+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP+QSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        ELPYCSMPEPGPNIEAE+RPCSCIKSLVDER +QLEECSSM  GVSE EYNE+KSCKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein8.0e-149100Show/hide
Query:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X13.2e-14297.05Show/hide
Query:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

A0A1S3BYE6 uncharacterized protein LOC103494744 isoform X21.7e-13594.1Show/hide
Query:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPA        RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

A0A5A7TRC2 Uncharacterized protein3.2e-14297.05Show/hide
Query:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X11.5e-13191.94Show/hide
Query:  MGVESNSAPPPP---TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC
        MGVESNS PPPP   +SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPP---TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+ T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG
        SSMELPYCSMPEPGPNIEAE+R CS IKSLVDERVYQL ECSSM  GVSE EYNEQKSCKDLNR+MKDS SGG
Subjt:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)1.5e-5955.3Show/hide
Query:  PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TSTNL
        PP P S S   SP GKR RDP+DEVYLDN  S KRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  T+T  
Subjt:  PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TSTNL

Query:  FPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSP---------VTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
          S    S PTSPVSPYRYQRP +       + T L  S T P          T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQMR QP G 
Subjt:  FPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSP---------VTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEY-NEQKSCKDLN
        SS   P         NI+ E+R CS  KS+ ++R Y      + G  +  +E  ++ KSCK L+
Subjt:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEY-NEQKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCGAATCAAACTCCGCGCCGCCCCCGCCAACGTCCTCCTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCAGAGATCCCGACGATGAAGTTTACCTCGACAA
TTTCCACTCTCACAAACGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAACGGATTGACGGTGGGGGACCCCCTTTCAGAGAATCTTATGGATTCCCCTGCGAGGTCAG
AGTCTATGCTTTATCAAAGGGATGAAATGTCCTGGCAATATTCCCCTATGTCAGAAGACTCAGATGACTGCCGGTTTTGTGAGACATCCACCAATTTGTTTCCCTCGCAG
TCTGATAGCAGTGTACCTACCAGCCCGGTCTCTCCATACCGATATCAGAGGCCATTCAGTGGGGTGGCTCCTTCAACAGGTACCAATACTTCGCTTGGATGTTCTACTAC
CAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGATTCTGAGGGTCGTTTCCCATCATCTCCAAGTGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGC
TCCTGCGTTCGGTACAGATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCTATGCCTGAGCCTGGACCAAATATAGAAGCTGAAGACCGGCCA
TGCTCTTGTATAAAATCGTTGGTTGATGAAAGAGTTTATCAACTTGAGGAATGCTCATCAATGGGGTTGGGAGTGTCTGAGTCCGAATATAATGAACAGAAATCATGCAA
GGACTTGAACAGGGATATGAAAGACAGCCGGTCTGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
GTTTGAAAGAAATTTGTAGAAATGTTTAGAGCTTCAAAATTCAAACTTTTATAAAAATCCCAAACAAAAGTTTGAAATAATAAAATTAAAGCTGTTTTTGGGAAAATTCG
AAGGAGAGTCCCAGAAAGGGGAAAACCATAGAAAAAGCTTTGTTGTTGTTCATCTTCTACAATTCCTTCCGGTGATTCCATTTCCAGTTTCAAAAATCGCCAACATTCTC
CTCTTCTCCACCCGAAGGATCATCGAATTTCTATTTTTTGCTTCCATTTCTTGCTCCGGAAACTGTTCAAGTTGCCCTAATGGGCGTCGAATCAAACTCCGCGCCGCCCC
CGCCAACGTCCTCCTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCAGAGATCCCGACGATGAAGTTTACCTCGACAATTTCCACTCTCACAAACGCTACCTCAGTGAG
ATAATGGCTTCTAGTTTGAACGGATTGACGGTGGGGGACCCCCTTTCAGAGAATCTTATGGATTCCCCTGCGAGGTCAGAGTCTATGCTTTATCAAAGGGATGAAATGTC
CTGGCAATATTCCCCTATGTCAGAAGACTCAGATGACTGCCGGTTTTGTGAGACATCCACCAATTTGTTTCCCTCGCAGTCTGATAGCAGTGTACCTACCAGCCCGGTCT
CTCCATACCGATATCAGAGGCCATTCAGTGGGGTGGCTCCTTCAACAGGTACCAATACTTCGCTTGGATGTTCTACTACCAGTCCCGTCACTAGCTTGCAGCCCCATCAA
CGTGGATCAGATTCTGAGGGTCGTTTCCCATCATCTCCAAGTGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGCTCCTGCGTTCGGTACAGATGAGAGCACAACC
TCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCTATGCCTGAGCCTGGACCAAATATAGAAGCTGAAGACCGGCCATGCTCTTGTATAAAATCGTTGGTTGATGAAA
GAGTTTATCAACTTGAGGAATGCTCATCAATGGGGTTGGGAGTGTCTGAGTCCGAATATAATGAACAGAAATCATGCAAGGACTTGAACAGGGATATGAAAGACAGCCGG
TCTGGAGGGTAGTAAATGCTAAAAATGTACAGAGGAAAATCTCTTCTAACATAACAACACATGTGACACATTGCTGCTGTGAGACTGGAAAATGTTTGCCTAATTTACTT
GCTTTCTCTTGATTGAGGATGGTTTAGTCTTCTGCTAGCTCCATTTCCAAGTGCGTGGTGCCTTGTGCTATTTTCATTGTCAAGTGTGTGTGGAAGTTGCGTTTAACCAT
TCAACGCCTGTGAAGCCTGGGTTGGGAAGATGTTCATGGTTGATCTGGGTTAATCATGTATGAAAGCTCTGTCTATCTATTTGAGTCTATTGGAGCTACCGGAGTCAAGA
GAAGAAGTTTCTCAGAGTTAAGTTTATTGATAAATGATGCGGCTTTATGCGTCACAAGGTTGAAGTTGCAAGTGTTCCATTCGTTTCATATCTGTGAGAACACTGTTATT
ATTGAATTATTTTGTTCTAATTGTGTTATGAGCATGCAGGCATAATTGATGAGAAAACTAAAAGATTTAAAGAGGTGAGGAAGGGAGCAAAGGGAATTTGAAGCAAATAG
CTAAAATAACATACATTATATTGTAGAGTATAGTAAAATTACCGATTAGATAGCTGATGTATCTATCATTGGTATGTATATTTTTTTTCCATTTGGATTTGTATAGTTGG
AGACTGCAAGTTACCATTCTGAAGAGCTACTGATCGTCCATAAAAAAATTGCATGTATCTATGTTGGGGGGT
Protein sequenceShow/hide protein sequence
MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQ
SDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEDRP
CSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSRSGG