; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G09150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G09150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; BEST Arabidopsis thaliana protein match is: protamine P1 family protein .
Genome locationClcChr04:22788647..22790019
RNA-Seq ExpressionClc04G09150
SyntenyClc04G09150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047280.1 uncharacterized protein E6C27_scaffold908G00730 [Cucumis melo var. makuwa]3.0e-9474.47Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQLS SISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKN+AIET+EPSSPKVTCMGQVRTNKRSSN+TPA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER
        WN+SAMLFR K  IRR   ISESRVGNEAED     E++DG  RD V A SSVPSPP NALILTRCRS PNRSS   NRYRSS ITSD  T EEE+KTER
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER

Query:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMA
          G          NSERL  KLE+S GDGD KSV+            NRNLILTRCKSEPARIAEKLYGELN++EEER  +A
Subjt:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMA

XP_008449922.1 PREDICTED: uncharacterized protein LOC103491651 [Cucumis melo]6.6e-9774.91Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQLS SISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKN+AIET+EPSSPKVTCMGQVRTNKRSSN+TPA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER
        WN+SAMLFR K  IRR   ISESRVGNEAED     E++DG  RD V A SSVPSPP NALILTRCRS PNRSS   NRYRSS ITSD  T EEE+KTER
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER

Query:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMAKNNSY
          G          NSERL  KLE+S GDGD KSV+            NRNLILTRCKSEPARIAEKLYGELN++EEER VM K NSY
Subjt:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMAKNNSY

XP_022925671.1 uncharacterized protein LOC111433021 [Cucurbita moschata]5.9e-9072.12Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQ    ISSPSR DLFPPPLMSFLRADAGNRSKSGRSRSSPIF+RKKN+AIETQEPSSPKVTCMGQVRTNKRSS R PA R RWIRSVLSFNRR CRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER
        WN+S M F+    IRRKSSI+ESRV +EAED    EE++G ARD V ASS PSPP NALILTRCRSAP+RSS YCNRY  S I SD+T EEE   +KTE 
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER

Query:  DN----------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG
        +N           NSER+F KLENS+G+ D  SV+NKE K+EE SM NR+LILTRCKSEP RI E+LYG
Subjt:  DN----------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]2.0e-9072.49Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQ    ISSPSR DLFPPPLMSFLRADAGNRSKSGRSRSSPIF+RKKN+AIETQEPSSPKVTCMGQVRTNKRSS R PA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER
        WN+S M F+ K  IRRKSSI+ESRV +EAED    EE++G ARD V ASS PSPP NALILTRCRSAP+RSS Y NRY  S I SD+  EEE   +KTE 
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER

Query:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG
        +NG          NSER+F KLENS+G+ D  SV+NKE K+EE SM NR+LILTRCKSEP RI E+LYG
Subjt:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]3.6e-10378.34Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKN-IAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRT
        MK+LS SISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFV KKN +AIETQEPSSPKVTCMGQVR    SSN+TPAAR RWIRSVLSFNRR+CRT
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKN-IAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRT

Query:  FWNKSAMLFRRKCAIRRKSSISESRVGNEAE----DEENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEEDKTERDN
        FWN SAM FRRK  IRRKSSI ESRVGNEAE    DEENDGGARD V +SSVPSPP NALILTRCRSAPNR+S Y NRYRS PITSD +GEEE+K E D 
Subjt:  FWNKSAMLFRRKCAIRRKSSISESRVGNEAE----DEENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEEDKTERDN

Query:  GNS----------ERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERS
        GNS          E L+NK+EN+ GDGD + V  KER MEEKSMLNR LILTRCKSEPARIAEK+YGELN+REEER+
Subjt:  GNS----------ERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERS

TrEMBL top hitse value%identityAlignment
A0A0A0KK43 Uncharacterized protein1.3e-8771.28Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQLS SISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIFV KKN+AIETQEPSSPKVTCMGQVRTNK SSN+TPA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVAS-SVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKT--GEEEDKTE
        WN+SAML R K  IRR   ISESRVGNEAED     EE+DG   D V +S SVPSPP NALIL+RCRSAPNRSS    RYRSS ITSD T   EEE+KTE
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVAS-SVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKT--GEEEDKTE

Query:  ---RDN-------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGEL-NVREEERSVMAKNNSY
           R+N       G SERL  K+E+S GDGDSKSV+            N NLILTR KSEP RIAEKLYGEL N++EE+R VM K   Y
Subjt:  ---RDN-------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGEL-NVREEERSVMAKNNSY

A0A1S3BN59 uncharacterized protein LOC1034916513.2e-9774.91Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQLS SISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKN+AIET+EPSSPKVTCMGQVRTNKRSSN+TPA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER
        WN+SAMLFR K  IRR   ISESRVGNEAED     E++DG  RD V A SSVPSPP NALILTRCRS PNRSS   NRYRSS ITSD  T EEE+KTER
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER

Query:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMAKNNSY
          G          NSERL  KLE+S GDGD KSV+            NRNLILTRCKSEPARIAEKLYGELN++EEER VM K NSY
Subjt:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMAKNNSY

A0A5A7TVU1 Uncharacterized protein1.5e-9474.47Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQLS SISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKN+AIET+EPSSPKVTCMGQVRTNKRSSN+TPA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER
        WN+SAMLFR K  IRR   ISESRVGNEAED     E++DG  RD V A SSVPSPP NALILTRCRS PNRSS   NRYRSS ITSD  T EEE+KTER
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED-----EENDGGARDPVVA-SSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSD-KTGEEEDKTER

Query:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMA
          G          NSERL  KLE+S GDGD KSV+            NRNLILTRCKSEPARIAEKLYGELN++EEER  +A
Subjt:  DNG----------NSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMA

A0A6J1ECV0 uncharacterized protein LOC1114330212.9e-9072.12Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQ    ISSPSR DLFPPPLMSFLRADAGNRSKSGRSRSSPIF+RKKN+AIETQEPSSPKVTCMGQVRTNKRSS R PA R RWIRSVLSFNRR CRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER
        WN+S M F+    IRRKSSI+ESRV +EAED    EE++G ARD V ASS PSPP NALILTRCRSAP+RSS YCNRY  S I SD+T EEE   +KTE 
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEE---DKTER

Query:  DN----------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG
        +N           NSER+F KLENS+G+ D  SV+NKE K+EE SM NR+LILTRCKSEP RI E+LYG
Subjt:  DN----------GNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG

A0A6J1IHY6 uncharacterized protein LOC1114776269.6e-8671.32Show/hide
Query:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF
        MKQ    ISSPSR DLFPPPLMSFLRADAGNRSKSGRSRSSPIF+RKKN+ IETQEPSSPKVTCMGQVRTNKRSS R PA R RWIRSVLSFNRRHCRTF
Subjt:  MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTF

Query:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEEDK--TERD
        WN+S M F+ K  IRRKSSI+ESRV +EAED    EE++G ARD V ASS PSPP NALILTRCRSAP+RSS Y N  RS     +  G   ++  ++ +
Subjt:  WNKSAMLFRRKCAIRRKSSISESRVGNEAED----EENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEEDK--TERD

Query:  NGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG
        + NSER+F KLENS+G+ D  SV+NKE K+EE SM NR+LILTRCKSEP RI EKLYG
Subjt:  NGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein1.3e-1834.67Show/hide
Query:  STSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKN--IAIETQEPSSPKVTCMGQVRTNK--------------RSSNRTPAARFRWIRS
        S  +SSP RT+  PP LM FLR  + +RS+S RSR  PIF R+KN   A ETQEP+SPKVTCMGQVR N+               +  R  + R  W++ 
Subjt:  STSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKN--IAIETQEPSSPKVTCMGQVRTNK--------------RSSNRTPAARFRWIRS

Query:  VLSFNRRHCRTF------------WNK----SAMLFRRKCAIRRKSSISESRVGNEA----EDEENDGGARDPVVASSVPS----PPINALILTRCRSAP
            N   C +F            W K    S   F +K   R  SS SE   G       E EE          ASS  S    PP NA +LTRCRSAP
Subjt:  VLSFNRRHCRTF------------WNK----SAMLFRRKCAIRRKSSISESRVGNEA----EDEENDGGARDPVVASSVPS----PPINALILTRCRSAP

Query:  NRSSLYCN-------RYRSSPITSDKTGEEEDKTERDNGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVR
         RS    N           +P     + E    +E    +      +LE+S      +S  ++E K        + LILTRC SEPAR+  ++    N R
Subjt:  NRSSLYCN-------RYRSSPITSDKTGEEEDKTERDNGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVR

AT5G03110.1 FUNCTIONS IN: molecular_function unknown1.8e-2335.92Show/hide
Query:  ISSPSRTDLFPPPLMSFLR--ADAGNRSKS-----GRSRSSPIFVRK-KNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAA-------RFRWIRSVLSF
        +SSP R + +PPP M FLR  ++ G+ S+S     GRSR+SP+FVR+ K+ A   QEPSSPKVTCMGQVR N+      P +       R  W+R+   +
Subjt:  ISSPSRTDLFPPPLMSFLR--ADAGNRSKS-----GRSRSSPIFVRK-KNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAA-------RFRWIRSVLSF

Query:  N----RRHCRTFWNKSAMLFRRKCAIR-------------RKSSISESRVGNEAEDEENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYR
        N    +    TFW K   L    CA R               ++ S   +  E E EEN    +  +  S   +PPINAL+LTR RSAP RSS    R+ 
Subjt:  N----RRHCRTFWNKSAMLFRRKCAIR-------------RKSSISESRVGNEAEDEENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYR

Query:  SSPITSDKTGEEEDKTERDNGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREE
             + +  E +     +  +SE    K+   H   + + V   E +   K    R  +LTR KSEPARI EK+   L   EE
Subjt:  SSPITSDKTGEEEDKTERDNGNSERLFNKLENSHGDGDSKSVHNKERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATTATCGACATCGATTTCCAGTCCCAGTCGGACCGACCTGTTTCCACCGCCATTGATGAGCTTTCTCAGAGCCGATGCTGGAAATCGGAGTAAAAGCGGCCG
GTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACATCGCCATTGAAACTCAAGAGCCGTCCTCTCCGAAGGTCACTTGTATGGGACAAGTCCGCACCAATAAACGCT
CCTCTAATAGAACTCCCGCCGCTCGATTCCGGTGGATTAGAAGCGTCCTCTCTTTCAATCGACGCCATTGTCGAACCTTCTGGAACAAGTCGGCGATGCTCTTCCGGAGA
AAGTGTGCAATTAGACGAAAATCATCGATCTCTGAATCTCGCGTCGGAAACGAAGCCGAAGATGAAGAGAACGACGGAGGAGCTAGGGATCCGGTTGTTGCGTCTTCGGT
GCCATCGCCGCCGATAAATGCTCTCATTCTTACGAGATGTAGATCTGCACCAAATCGGTCGTCCTTATACTGCAATCGGTATCGAAGTTCGCCGATTACGAGCGACAAAA
CAGGAGAAGAAGAAGACAAAACAGAGCGCGATAATGGAAACTCGGAGCGATTGTTCAATAAACTCGAAAATTCACACGGAGATGGAGATTCTAAGTCTGTACATAACAAA
GAGAGGAAAATGGAGGAAAAATCGATGTTGAATCGGAACCTGATCCTGACGAGATGTAAATCGGAACCTGCGAGAATTGCAGAGAAACTGTACGGAGAATTGAATGTTCG
GGAAGAAGAAAGGTCGGTTATGGCTAAGAACAATTCTTACTAA
mRNA sequenceShow/hide mRNA sequence
TGTGATTGTTAAAAATTACAAATAATTTCCAAAAATGAGAATAGAGAAAATTTGAAAGGCAAAAGAACCAAATGGCAATTTTCCAAAATAGAAAGAAAACAAAGTCATAG
AGGATAAACAGATAGCTCTGTGACATGAAACCAAAGGGACCAATCACCAGCACAGATTCTTAGTGGAGGAAGAAACGAGACGAAACGAAACGATTCAATCCATGCTTTGA
TTTTCATTTCATCACAACAATGAAGCAATTATCGACATCGATTTCCAGTCCCAGTCGGACCGACCTGTTTCCACCGCCATTGATGAGCTTTCTCAGAGCCGATGCTGGAA
ATCGGAGTAAAAGCGGCCGGTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACATCGCCATTGAAACTCAAGAGCCGTCCTCTCCGAAGGTCACTTGTATGGGACAA
GTCCGCACCAATAAACGCTCCTCTAATAGAACTCCCGCCGCTCGATTCCGGTGGATTAGAAGCGTCCTCTCTTTCAATCGACGCCATTGTCGAACCTTCTGGAACAAGTC
GGCGATGCTCTTCCGGAGAAAGTGTGCAATTAGACGAAAATCATCGATCTCTGAATCTCGCGTCGGAAACGAAGCCGAAGATGAAGAGAACGACGGAGGAGCTAGGGATC
CGGTTGTTGCGTCTTCGGTGCCATCGCCGCCGATAAATGCTCTCATTCTTACGAGATGTAGATCTGCACCAAATCGGTCGTCCTTATACTGCAATCGGTATCGAAGTTCG
CCGATTACGAGCGACAAAACAGGAGAAGAAGAAGACAAAACAGAGCGCGATAATGGAAACTCGGAGCGATTGTTCAATAAACTCGAAAATTCACACGGAGATGGAGATTC
TAAGTCTGTACATAACAAAGAGAGGAAAATGGAGGAAAAATCGATGTTGAATCGGAACCTGATCCTGACGAGATGTAAATCGGAACCTGCGAGAATTGCAGAGAAACTGT
ACGGAGAATTGAATGTTCGGGAAGAAGAAAGGTCGGTTATGGCTAAGAACAATTCTTACTAATTGGACAACTTGCGATTAGATGATTGAAATAGCTAGAATCTCTTCCAG
TTCTTAGGTTTTCTCCATGACTGAAAACGCAAGATTTTATTTTTTTTTATTTTTTTTTTATTTTTTTATTTTTGCTTTGAATTTTGTTTCTGTTATGTTGAGTTTGGCGG
GTGAAGAAATGGAGAATTTCGGTGAGGCCAAATGGAAGAACAAGAAATGGTAGTTTAGTAGTAAGATAGATTTAAGGAAGTGTTTAGTGCACTATATTGTAAATTATTAG
GCATTATAAATTAAGCAATAATGGATATATACTCTCCTTCAAGCTTCATTAAA
Protein sequenceShow/hide protein sequence
MKQLSTSISSPSRTDLFPPPLMSFLRADAGNRSKSGRSRSSPIFVRKKNIAIETQEPSSPKVTCMGQVRTNKRSSNRTPAARFRWIRSVLSFNRRHCRTFWNKSAMLFRR
KCAIRRKSSISESRVGNEAEDEENDGGARDPVVASSVPSPPINALILTRCRSAPNRSSLYCNRYRSSPITSDKTGEEEDKTERDNGNSERLFNKLENSHGDGDSKSVHNK
ERKMEEKSMLNRNLILTRCKSEPARIAEKLYGELNVREEERSVMAKNNSY