; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0074061 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0074061
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionProtein ALP1-like
Genome locationCMiso1.1chr03:21616419..21617316
RNA-Seq ExpressionCmc03g0074061
SyntenyCmc03g0074061
Gene Ontology termsGO:0071555 - cell wall organization (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016787 - hydrolase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]3.8e-8360.68Show/hide
Query:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC
        +PCRTS LRGHDYV+E+LNG++ RC   FRMK    I FCE LK   NLK SRYLT+QE+V IFLL I HNE NR+  ERFQHSGHTIS+ F+ VL+ VC
Subjt:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC

Query:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN
        K+G  II+PP+ D++P +I  N+KY+PFFKDC+GAI GTH++AS+P ++QI +RG+ T TT N+M  CSF M FTYV+SGWEG+ANDSR+ LEC+ N   
Subjt:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN

Query:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
         FP P   +YY++D GY+NM GFL+P+RG+R HL
Subjt:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

XP_028067161.1 uncharacterized protein LOC114269968 [Camellia sinensis]3.9e-8058.97Show/hide
Query:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC
        +PCRTS L+GHDYV+E+LNG++ R  + FRM+    IN CE LK    L+ SRYLTVQE+V IFLL I HNE NR+  ERFQHSG TIS  FN VL+ VC
Subjt:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC

Query:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN
        ++G+++IRPP+ D VP EI  N ++YPFFKDC+GAI GTH++A +P +EQI +RG+ T TT N+M VCSF M FTYV++GWEG+ANDSR+ +E + +  N
Subjt:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN

Query:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
         FPMP  D+YY++D GY+NM GFL+P+RG+R HL
Subjt:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

XP_028094390.1 uncharacterized protein LOC114294454 [Camellia sinensis]2.3e-8058.97Show/hide
Query:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC
        +PCRTS L+GHDYV+E+LNG++ R  + FRM+    IN CE LK    L+ SRYLTVQE+V IFLL I HNE NR+  ERFQHSG TIS  FN VL+ VC
Subjt:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC

Query:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN
        ++G+++IRPP+ D VP EI  N ++YPFFKDC+GAI GTH++A +P +EQI +RG+ T TT N+M VCSF M FTYV++GWEG+ANDSR+ +E + +  N
Subjt:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN

Query:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
         FPMP  D+YY++D GY+NM GFL+P+RG+R HL
Subjt:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

XP_028100667.1 uncharacterized protein LOC114300013 [Camellia sinensis]3.9e-8058.97Show/hide
Query:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC
        +PCRTS L+GHDYV+E+LNG++ R  + FRM+    IN CE LK    L+ SRYLTVQE+V IFLL I HNE NR+  ERFQHSG TIS  FN VL+ VC
Subjt:  QPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC

Query:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN
        ++G+++IRPP+ D VP EI  N ++YPFFKDC+GAI GTH++A +P +EQI +RG+ T TT N+M VCSF M FTYV++GWEG+ANDSR+ +E + +  N
Subjt:  KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGN

Query:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
         FPMP  D+YY++D GY+NM GFL+P+RG+R HL
Subjt:  KFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

XP_030481634.1 protein ALP1-like [Cannabis sativa]1.6e-7856.43Show/hide
Query:  HRIQASKQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFN
        +R+  +K+PCR SAL GH+YV+E+L+G++SRC+D FRM +   I FC  LK K  L+ SRYL+V+E+V +FL ++ HNE +R+ AERFQHS  TIS  F 
Subjt:  HRIQASKQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFN

Query:  LVLRKVCKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLE
         VLR VC++ +E+I PP+ D VP EI  N KYYPF K+C+GAI GTH++A +P +EQI +RGRK +TT N+M +CSF M FTYV++GWEGSAND+RIL E
Subjt:  LVLRKVCKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLE

Query:  CIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
        C  N   +FP P   +YYL+D GY+NM GFL+P+RG+R HL
Subjt:  CIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

TrEMBL top hitse value%identityAlignment
A0A0B2SJL0 Putative nuclease HARBI1 (Fragment)5.0e-6550.64Show/hide
Query:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV
        K PCRTS L G  Y I+ L G+++RC++ F MK+   +NFCE LK   NL   + ++++E + +FL+II HN  +R+ AERFQHS HT+S  F ++L+ V
Subjt:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV

Query:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG
        CK+G  II   N  +    I  N KYYP+FKDCIGAI G HV+A    ++Q  FRGRK   T N++ VC F MLFT+V SGWEG+ANDSR+ L+ + ++ 
Subjt:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG

Query:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL
        N FP P  DQ+YLID G+SNM G+LAPFR  + HL
Subjt:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHL

A0A1S3E695 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like8.5e-6551.53Show/hide
Query:  TSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGR
        TS+L G ++V E+LNG+++ CFD FRMK+   +NFC +L+ K  L  SR + V+EKV  FL II HN  +R+A+ RFQHS  TIS  F  VLR VC++G+
Subjt:  TSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGR

Query:  EIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPM
        E+I+  +++ +P  I +NSKYYP+FK+CIGAI GTH++A +P  +QI  RGRKT  T N+M  C F M+FTYV SGWEGSA+DS++ L+ I N    FP 
Subjt:  EIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPM

Query:  PKRDQYYLIDLGYSNMSGFLAPFRGQRCH
        P R  +YL+D GY    G L P+RG+R H
Subjt:  PKRDQYYLIDLGYSNMSGFLAPFRGQRCH

A0A3Q7Y331 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like8.5e-6551.53Show/hide
Query:  TSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGR
        TS+L G ++V E+LNG+++ CFD FRMK+   +NFC +L+ K  L  SR + V+EKV  FL II HN  +R+A+ RFQHS  TIS  F  VLR VC++G+
Subjt:  TSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGR

Query:  EIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPM
        E+I+  +++ +P  I +NSKYYP+FK+CIGAI GTH++A +P  +QI  RGRKT  T N+M  C F M+FTYV SGWEGSA+DS++ L+ I N    FP 
Subjt:  EIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPM

Query:  PKRDQYYLIDLGYSNMSGFLAPFRGQRCH
        P R  +YL+D GY    G L P+RG+R H
Subjt:  PKRDQYYLIDLGYSNMSGFLAPFRGQRCH

A0A4Y7J673 Uncharacterized protein1.5e-6650Show/hide
Query:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV
        K P  TS L G +++ ELLNG+  R ++  RM   T +  C  L++   L+  R ++V+E V IFL  +S +  NR+ AE FQHS  T+   F  VL+ +
Subjt:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV

Query:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG
        C++G  II+PPNMD VP EI++N K+YP+F DC+GAI GTH++A +P ++QI FRGRK   T NIM  CSF MLFT+V +GWEG+AND+R+L++ I N  
Subjt:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG

Query:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR
        NKFPMP+  +YY++D  Y+NM GFL P+RG+R HLR
Subjt:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR

A0A5D3C7F6 Protein ALP1-like3.7e-7687.97Show/hide
Query:  AERFQHSGHTISLAFNLVLRKVCKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYV
        AERFQHSGHTISLAFN VLRKVCK+G EIIRPPNMD V  +IVSNSKYYPFFKDCIGAI GTHVAASIPQNEQI FRGRKTNTTWNIM VCSF MLFTYV
Subjt:  AERFQHSGHTISLAFNLVLRKVCKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYV

Query:  MSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR
        MSGWEGSANDSRIL ECIKN  NKFPMPKRDQYYL++ GYSNM GFLAPFRGQR HLR
Subjt:  MSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein3.8e-3332.26Show/hide
Query:  LNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGREIIRPP---NMDN
        L  + + C    RM        C  L++  +L+P+  ++++E V +FL I  HNE  R    RF  +  T+   F  VL     +  + IR P    +  
Subjt:  LNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGREIIRPP---NMDN

Query:  VPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLID
        +P  +  + +Y+P+F   +GA+ GTHV   +  + Q M+  R  N + NIM +C   MLFTY+ +G  GS  D+ + L+  +   ++FP+P  ++YYL+D
Subjt:  VPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLID

Query:  LGYSNMSGFLAPFRGQR
         GY N  G LAP+R  R
Subjt:  LGYSNMSGFLAPFRGQR

AT5G28730.1 unknown protein2.5e-2430.33Show/hide
Query:  NDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGREIIRPPNMDN---VPM
        N+  C    RM        CE L  K  L+ S  +++ E V IFL+I + N++ R  A RF H+  TI   F+ VL+ + ++  E IRP  ++    +  
Subjt:  NDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVCKIGREIIRPPNMDN---VPM

Query:  EIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGY
         +  +++Y+PF  D +G                          ++N++ +C   MLFTY   G  GS +D+R+L   I +    F +P   +YYL+D GY
Subjt:  EIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGY

Query:  SNMSGFLAPFR
        +N  G+LAP+R
Subjt:  SNMSGFLAPFR

AT5G28950.1 unknown protein2.6e-2142.11Show/hide
Query:  VPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQ
        VP +I  +++ YP+FKDC+GAI  TH+ A + Q +   FR RK + + N++  C+F + F YV+SGWEGSA+DS++L + +    N+ P+P+ D+
Subjt:  VPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQ

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.1e-1146.03Show/hide
Query:  LFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR
        +F YV+SGWEGSA+DSR+L + ++            ++YL+D G++N   FLAPFRG R HL+
Subjt:  LFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQYYLIDLGYSNMSGFLAPFRGQRCHLR

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.7e-3733.91Show/hide
Query:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV
        K+  + S   G+ +V ++LNG + +CF+ FRM +      C+ L+++  L+ +  + ++ ++ IFL II HN   R   E F +SG TIS  FN VL  V
Subjt:  KQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKV

Query:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG
          I ++  +P    N   + + N    P+FKDC+G +   H+   +  +EQ  FR      T N++   SF + F YV++GWEGSA+D ++L   +    
Subjt:  CKIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIG

Query:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRG
        NK  +P + +YY++D  Y N+ GF+AP+ G
Subjt:  NKFPMPKRDQYYLIDLGYSNMSGFLAPFRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTGCACAGGATACAAGCTTCAAAACAACCATGTAGAACTTCTGCACTGAGAGGACATGATTATGTGATCGAGTTGTTAAATGGAAACGATAGTAGATGTTTTGA
TTGTTTTAGGATGAAAAGAATTACATCCATAAATTTTTGTGAAGATTTAAAATCAAAGACGAATCTGAAACCATCTAGATATCTTACCGTTCAAGAAAAAGTTGTTATAT
TCTTATTAATCATATCACATAATGAAAGCAATCGTATAGCAGCAGAAAGATTTCAACATTCGGGTCATACTATTTCTCTAGCTTTTAACCTTGTTTTGAGGAAGGTTTGC
AAGATTGGTAGAGAAATTATTCGCCCACCCAATATGGACAATGTACCAATGGAGATCGTATCAAATTCAAAATATTACCCTTTCTTTAAGGATTGTATTGGTGCTATTGG
TGGTACTCATGTTGCTGCAAGTATTCCCCAAAATGAACAAATAATGTTTCGTGGAAGAAAAACTAACACGACATGGAATATAATGTACGTTTGTTCATTTTATATGTTAT
TCACGTATGTCATGTCTGGTTGGGAAGGATCAGCCAATGATTCTCGCATACTTCTAGAATGTATCAAGAATATCGGGAATAAATTTCCTATGCCTAAGAGAGATCAATAC
TATCTTATCGATTTAGGATATTCAAATATGTCCGGATTTTTAGCACCATTTCGAGGTCAAAGATGTCATTTACGATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTGCACAGGATACAAGCTTCAAAACAACCATGTAGAACTTCTGCACTGAGAGGACATGATTATGTGATCGAGTTGTTAAATGGAAACGATAGTAGATGTTTTGA
TTGTTTTAGGATGAAAAGAATTACATCCATAAATTTTTGTGAAGATTTAAAATCAAAGACGAATCTGAAACCATCTAGATATCTTACCGTTCAAGAAAAAGTTGTTATAT
TCTTATTAATCATATCACATAATGAAAGCAATCGTATAGCAGCAGAAAGATTTCAACATTCGGGTCATACTATTTCTCTAGCTTTTAACCTTGTTTTGAGGAAGGTTTGC
AAGATTGGTAGAGAAATTATTCGCCCACCCAATATGGACAATGTACCAATGGAGATCGTATCAAATTCAAAATATTACCCTTTCTTTAAGGATTGTATTGGTGCTATTGG
TGGTACTCATGTTGCTGCAAGTATTCCCCAAAATGAACAAATAATGTTTCGTGGAAGAAAAACTAACACGACATGGAATATAATGTACGTTTGTTCATTTTATATGTTAT
TCACGTATGTCATGTCTGGTTGGGAAGGATCAGCCAATGATTCTCGCATACTTCTAGAATGTATCAAGAATATCGGGAATAAATTTCCTATGCCTAAGAGAGATCAATAC
TATCTTATCGATTTAGGATATTCAAATATGTCCGGATTTTTAGCACCATTTCGAGGTCAAAGATGTCATTTACGATTTTAG
Protein sequenceShow/hide protein sequence
MSLHRIQASKQPCRTSALRGHDYVIELLNGNDSRCFDCFRMKRITSINFCEDLKSKTNLKPSRYLTVQEKVVIFLLIISHNESNRIAAERFQHSGHTISLAFNLVLRKVC
KIGREIIRPPNMDNVPMEIVSNSKYYPFFKDCIGAIGGTHVAASIPQNEQIMFRGRKTNTTWNIMYVCSFYMLFTYVMSGWEGSANDSRILLECIKNIGNKFPMPKRDQY
YLIDLGYSNMSGFLAPFRGQRCHLRF