; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0016867 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0016867
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationchr07:23862484..23866536
RNA-Seq ExpressionIVF0016867
SyntenyIVF0016867
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]6.25e-16188.56Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPP TSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLG S                    FP   SDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]1.76e-16791.48Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQPAGPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

XP_008454306.1 PREDICTED: uncharacterized protein LOC103494744 isoform X2 [Cucumis melo]1.28e-15888.52Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPAR        DEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQPAGPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

XP_011652951.1 uncharacterized protein LOC101212915 isoform X2 [Cucumis sativus]4.53e-15285.61Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPP TSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPAR        DEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLG S                    FP   SDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]8.35e-15184.44Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPP   SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQR+EMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFP+QSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAE+RPCSCIKSLVDER +QLEECSSMG  VSE EYNE+KSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein5.8e-12688.56Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLG S                    FP   SDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS--------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSS

Query:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDS+SGG
Subjt:  MELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X16.0e-13191.48Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQPAGPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

A0A1S3BYE6 uncharacterized protein LOC103494744 isoform X22.4e-12488.52Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPA        RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQPAGPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

A0A5A7TRC2 Uncharacterized protein6.0e-13191.48Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLG S                   FP   SDICHSADLRRAALLRSVQMRAQPAGPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPSSM

Query:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
Subjt:  ELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X19.3e-11683.82Show/hide
Query:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC
        MGVESNS PPPPP  +SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGM PS GTNTSLG +                   FP   SDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWS-------------------FPIISSDICHSADLRRAALLRSVQMRAQPAGPS

Query:  SMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG
        SMELPYCSMPEPGPNIEAE+R CS IKSLVDERVYQL ECSSM  GVSE EYNEQKSCKDLNR+MKDS+SGG
Subjt:  SMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)1.5e-4950.76Show/hide
Query:  PPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TSTNL
        PP P S S   SP GKR RDPEDEVYLDN  S KRYLSEIMA SLNGLTVG+ L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  T+T  
Subjt:  PPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TSTNL

Query:  FPSQSDSSVPTSPVSPYRYQRPF---------------SGMAP----SNGTNTSLGWS----------FPIISSDICHSADLRRAALLRSVQMRAQPAGP
          S    S PTSPVSPYRYQRP                S   P    SN   T+   S          FP   SDICHS DLRR ALLRSVQMR QP G 
Subjt:  FPSQSDSSVPTSPVSPYRYQRPF---------------SGMAP----SNGTNTSLGWS----------FPIISSDICHSADLRRAALLRSVQMRAQPAGP

Query:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEY-NEQKSCKDLN
        SS   P         NI+ E+R CS  KS+ ++R Y      + G  +  +E  ++ KSCK L+
Subjt:  SSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEY-NEQKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATCAGTACTAAATTCTTTCATGTCCGCAAACCTTGGAAAATCCTACGCCGCACCTTTCCGGTGGATTTCCGCCCCCTGCGCCTTGCCCTGATGGGCGTCGAATC
AAACTCCGCGCCGCCGCCGCCGCCAACGTCCTCGTCTTCTACGCCTTCTCCGAGCGGGAAGAGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTC
ACAAACGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTGGGGGAGCCCCTTTCAGAGAATCTTATGGATTCCCCTGCGAGGTCAGAGTCTATGCTT
TATCAAAGGGATGAAATGTCCTGGCAATATTCCCCTATGTCAGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCCACCAACTTGTTTCCCTCGCAGTCAGATAGCAG
TGTACCTACCAGCCCGGTCTCTCCATACAGATATCAGAGGCCATTCAGTGGGATGGCTCCTTCAAATGGTACCAATACTTCGCTTGGATGGTCGTTTCCCATCATCTCAA
GTGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGCTCCTGCGTTCGGTACAGATGAGAGCACAACCTGCTGGTCCATCATCTATGGAGTTGCCATATTGCTCTATG
CCTGAGCCTGGACCAAATATAGAAGCTGAAGACCGGCCATGTTCTTGTATAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCATCTATGGGATTGGG
AGTGTCTGAGTCTGAATATAATGAACAAAAATCATGCAAGGACTTGAACAGGGATATGAAAGACAGCCAGTCTGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCATCAGTACTAAATTCTTTCATGTCCGCAAACCTTGGAAAATCCTACGCCGCACCTTTCCGGTGGATTTCCGCCCCCTGCGCCTTGCCCTGATGGGCGTCGAATC
AAACTCCGCGCCGCCGCCGCCGCCAACGTCCTCGTCTTCTACGCCTTCTCCGAGCGGGAAGAGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTC
ACAAACGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTGGGGGAGCCCCTTTCAGAGAATCTTATGGATTCCCCTGCGAGGTCAGAGTCTATGCTT
TATCAAAGGGATGAAATGTCCTGGCAATATTCCCCTATGTCAGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCCACCAACTTGTTTCCCTCGCAGTCAGATAGCAG
TGTACCTACCAGCCCGGTCTCTCCATACAGATATCAGAGGCCATTCAGTGGGATGGCTCCTTCAAATGGTACCAATACTTCGCTTGGATGGTCGTTTCCCATCATCTCAA
GTGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGCTCCTGCGTTCGGTACAGATGAGAGCACAACCTGCTGGTCCATCATCTATGGAGTTGCCATATTGCTCTATG
CCTGAGCCTGGACCAAATATAGAAGCTGAAGACCGGCCATGTTCTTGTATAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCATCTATGGGATTGGG
AGTGTCTGAGTCTGAATATAATGAACAAAAATCATGCAAGGACTTGAACAGGGATATGAAAGACAGCCAGTCTGGAGGGTAG
Protein sequenceShow/hide protein sequence
MSISTKFFHVRKPWKILRRTFPVDFRPLRLALMGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGEPLSENLMDSPARSESML
YQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGMAPSNGTNTSLGWSFPIISSDICHSADLRRAALLRSVQMRAQPAGPSSMELPYCSM
PEPGPNIEAEDRPCSCIKSLVDERVYQLEECSSMGLGVSESEYNEQKSCKDLNRDMKDSQSGG