; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001506 (gene) of Snake gourd v1 genome

Gene IDTan0001506
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtamine P1 family protein
Genome locationLG03:66721563..66722581
RNA-Seq ExpressionTan0001506
SyntenyTan0001506
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047280.1 uncharacterized protein E6C27_scaffold908G00730 [Cucumis melo var. makuwa]2.8e-8569.44Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQ SK ISSPSRTDLFPPPLMSF+RADAGNRSKSSRSRSSPIF+RKKNVAIET++PSSPKVTCMGQVR NKRSS    T A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE
         FWNRSAMLFR KRE R+   ISESRVGNE EDSEKDEE+D    RDAV+A  SVP PPKNALILTRCRS P+ S    NRYRSSS+    T +    +E
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE

Query:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMA
        E+KTERG G          NS RL K LE S GD D  SVN            NRNLILTRCKSEPARI+EKLY ELNL EEER  +A
Subjt:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMA

XP_008449922.1 PREDICTED: uncharacterized protein LOC103491651 [Cucumis melo]2.3e-8769.02Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQ SK ISSPSRTDLFPPPLMSF+RADAGNRSKSSRSRSSPIF+RKKNVAIET++PSSPKVTCMGQVR NKRSS    T A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE
         FWNRSAMLFR KRE R+   ISESRVGNE EDSEKDEE+D    RDAV+A  SVP PPKNALILTRCRS P+ S    NRYRSSS+    T +    +E
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE

Query:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNN
        E+KTERG G          NS RL K LE S GD D  SVN            NRNLILTRCKSEPARI+EKLY ELNL EEER VM   +S  LNN
Subjt:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNN

XP_022925671.1 uncharacterized protein LOC111433021 [Cucurbita moschata]4.3e-8668.89Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQS KPISSPSR DLFPPPLMSF+RADAGNRSKS RSRSSPIF+RKKNVAIETQ+PSSPKVTCMGQVR NKRSS+     A +CRWIRSVLSFNRR CR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ
         FWNRS M F+  RE R+KSSI+ESRV +E EDSE++EE++G  RD VFA S P PPKNALILTRCRSAPH S  Y NRY  SS+R DRT EEE E E+ 
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ

Query:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY
        +    +G        NS R+F+ LE S G+ D +SVN+KE KIE+NS  NR+LILTRCKSEP RI E+LY
Subjt:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]5.4e-8970Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQS KPISSPSR DLFPPPLMSF+RADAGNRSKS RSRSSPIF+RKKNVAIETQ+PSSPKVTCMGQVR NKRSS+     A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ
         FWNRS M F+ KRE R+KSSI+ESRV +E EDSE++EE++G  RD VFA S P PPKNALILTRCRSAPH S  YGNRY  SS+R DR  EEE E E+ 
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ

Query:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY
        +   G+G        NS R+F+ LE S G+ D +SVN+KE KIE+NS  NR+LILTRCKSEP RI E+LY
Subjt:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]4.7e-8567.61Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKN-VAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHC
        MK+ SK ISSPSRTDLFPPPLMSF+RADAGNRSKS RSRSSPIFV KKN VAIETQ+PSSPKVTCMGQVRA+        T AA+CRWIRSVLSFNRR+C
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKN-VAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHC

Query:  RAFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEEND-GVRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEE
        R FWN SAM FRRK E R+KSSI ESRVGNE EDSEKDEEND G RDAVF+ SVP PPKNALILTRCRSAP+ +  YGNRYRS  +  D +GEEE++ EE
Subjt:  RAFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEEND-GVRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEE

Query:  QKTERGDGNSGR----------LFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERS
               GNS            L+  +E + GD D   V+ KE  +E+ S LNR LILTRCKSEPARI+EK+Y ELNL EEER+
Subjt:  QKTERGDGNSGR----------LFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERS

TrEMBL top hitse value%identityAlignment
A0A0A0KK43 Uncharacterized protein7.6e-8165.33Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQ SK ISSPSRTDLFPPPLMSF+RADAGNRSKSSRSRSSPIFV KKNVAIETQ+PSSPKVTCMGQVR NK SS    T A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDGVR--DAVFAP-SVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE
         FWNRSAML R KRE R+   ISESRVGNE EDSEKDEE D  R  DAV++  SVP PPKNALIL+RCRSAP+ S   G RYRSSS+  D T E E   E
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDGVR--DAVFAP-SVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE

Query:  EQKTERGD----------GNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNNFRL
        E+KTE G           G S RL K +E S GD DS SVN            N NLILTR KSEP RI+EKLY ELN L+EE+  +     C + N  L
Subjt:  EQKTERGD----------GNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNNFRL

A0A1S3BN59 uncharacterized protein LOC1034916511.1e-8769.02Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQ SK ISSPSRTDLFPPPLMSF+RADAGNRSKSSRSRSSPIF+RKKNVAIET++PSSPKVTCMGQVR NKRSS    T A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE
         FWNRSAMLFR KRE R+   ISESRVGNE EDSEKDEE+D    RDAV+A  SVP PPKNALILTRCRS P+ S    NRYRSSS+    T +    +E
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE

Query:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNN
        E+KTERG G          NS RL K LE S GD D  SVN            NRNLILTRCKSEPARI+EKLY ELNL EEER VM   +S  LNN
Subjt:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNN

A0A5A7TVU1 Uncharacterized protein1.3e-8569.44Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQ SK ISSPSRTDLFPPPLMSF+RADAGNRSKSSRSRSSPIF+RKKNVAIET++PSSPKVTCMGQVR NKRSS    T A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE
         FWNRSAMLFR KRE R+   ISESRVGNE EDSEKDEE+D    RDAV+A  SVP PPKNALILTRCRS P+ S    NRYRSSS+    T +    +E
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG--VRDAVFA-PSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDE

Query:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMA
        E+KTERG G          NS RL K LE S GD D  SVN            NRNLILTRCKSEPARI+EKLY ELNL EEER  +A
Subjt:  EQKTERGDG----------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMA

A0A6J1ECV0 uncharacterized protein LOC1114330212.1e-8668.89Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQS KPISSPSR DLFPPPLMSF+RADAGNRSKS RSRSSPIF+RKKNVAIETQ+PSSPKVTCMGQVR NKRSS+     A +CRWIRSVLSFNRR CR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ
         FWNRS M F+  RE R+KSSI+ESRV +E EDSE++EE++G  RD VFA S P PPKNALILTRCRSAPH S  Y NRY  SS+R DRT EEE E E+ 
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQ

Query:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY
        +    +G        NS R+F+ LE S G+ D +SVN+KE KIE+NS  NR+LILTRCKSEP RI E+LY
Subjt:  KTERGDG--------NSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY

A0A6J1IHY6 uncharacterized protein LOC1114776261.3e-8369.7Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR
        MKQS KPISSPSR DLFPPPLMSF+RADAGNRSKS RSRSSPIF+RKKNV IETQ+PSSPKVTCMGQVR NKRSS+     A +CRWIRSVLSFNRRHCR
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCR

Query:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEK--EDE
         FWNRS M F+ K E R+KSSI+ESRV +E EDSE++EE++G  RDAVFA S P PPKNALILTRCRSAPH S  YGN  RS     DRT EE K     
Subjt:  AFWNRSAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDG-VRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEK--EDE

Query:  EQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY
           ++    NS R+FK LE S G+ D +SVN+KE KIE+NS  NR+LILTRCKSEP RI EKLY
Subjt:  EQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein9.7e-2034.32Show/hide
Query:  SSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKN--VAIETQDPSSPKVTCMGQVRANKRSSKTSITS------------AAQCRWIR
        SS+P+SSP RT+  PP LM F+R  + +RS+ SRSR  PIF R+KN   A ETQ+P+SPKVTCMGQVR N+       T+            + +C W++
Subjt:  SSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKN--VAIETQDPSSPKVTCMGQVRANKRSSKTSITS------------AAQCRWIR

Query:  SVLSFNRRHCRAF------------WNR----SAMLFRRKRETRQKSSISE-----SRVGNEMEDSEKDEENDGVRDAVFAPSVPPPPKNALILTRCRSA
             N   C +F            W +    S   F +K E R  SS SE     S V  E  +  + EEN     +        PP+NA +LTRCRSA
Subjt:  SVLSFNRRHCRAF------------WNR----SAMLFRRKRETRQKSSISE-----SRVGNEMEDSEKDEENDGVRDAVFAPSVPPPPKNALILTRCRSA

Query:  PHTS-----LIYGNRYRSSSMRIDR--TGEEEKEDEEQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPAR-ISEKL
        P+ S      ++ ++  ++     R  + E     EE KT               E + D    S  S+E K        + LILTRC SEPAR + E  
Subjt:  PHTS-----LIYGNRYRSSSMRIDR--TGEEEKEDEEQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPAR-ISEKL

Query:  YRE
        YR+
Subjt:  YRE

AT5G03110.1 FUNCTIONS IN: molecular_function unknown3.0e-2134.15Show/hide
Query:  MKQSSKPISSPSRTDLFPPPLMSFIR--ADAGNRSKS-----SRSRSSPIFVRK-KNVAIETQDPSSPKVTCMGQVRANKRSSKTSITS-----AAQCRW
        M  S +P+SSP R + +PPP M F+R  ++ G+ S+S      RSR+SP+FVR+ K+ A   Q+PSSPKVTCMGQVR N+   K    S       +C W
Subjt:  MKQSSKPISSPSRTDLFPPPLMSFIR--ADAGNRSKS-----SRSRSSPIFVRK-KNVAIETQDPSSPKVTCMGQVRANKRSSKTSITS-----AAQCRW

Query:  IRSVLSFN----RRHCRAFWNR--------SAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDGVRDAVFAPSVPPPPKNALILTRCRSAPHTSLIY
        +R+   +N    +     FW +        +    + K   R +     +    E+++  + EEN  +     +P+   PP NAL+LTR RSAP      
Subjt:  IRSVLSFN----RRHCRAFWNR--------SAMLFRRKRETRQKSSISESRVGNEMEDSEKDEENDGVRDAVFAPSVPPPPKNALILTRCRSAPHTSLIY

Query:  GNRYRSSSMRIDRTGEE--EKEDEEQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKL
           YRSSS+   R  EE  ++E E Q+  R +     +   +E+  G  +   V+  E +     R  R  +LTR KSEPARI EK+
Subjt:  GNRYRSSSMRIDRTGEE--EKEDEEQKTERGDGNSGRLFKNLEESIGDVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATCATCGAAACCGATTTCCAGTCCCAGTCGGACCGACCTGTTTCCGCCGCCATTGATGAGCTTTATCAGAGCCGATGCCGGAAATCGTAGTAAAAGCAGCCG
GTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACGTCGCCATTGAAACTCAAGACCCGTCCTCTCCTAAGGTCACTTGTATGGGACAAGTCCGCGCCAATAAACGCT
CCTCTAAAACCTCTATCACCAGCGCCGCCCAGTGCCGGTGGATTAGAAGCGTCCTATCGTTCAATCGACGCCATTGTCGAGCCTTCTGGAACAGGTCGGCGATGCTATTC
CGAAGAAAGCGTGAAACTAGACAGAAATCATCAATCTCTGAATCTCGCGTCGGAAATGAAATGGAGGATTCGGAGAAAGATGAAGAGAACGACGGAGTCAGGGATGCGGT
TTTTGCGCCCTCGGTTCCACCGCCGCCGAAGAACGCTCTCATTCTGACGAGATGTAGATCTGCGCCGCATACGTCGTTGATTTACGGCAATCGGTATCGGAGCTCGTCGA
TGAGGATCGACAGAACTGGAGAAGAAGAGAAAGAAGATGAAGAACAGAAAACAGAGCGCGGTGATGGAAACTCGGGGCGATTGTTCAAAAACCTCGAAGAGTCAATCGGA
GATGTCGATTCAAACTCTGTAAATAGCAAAGAGATCAAAATCGAGGACAACTCGAGATTGAATCGGAACCTGATTCTGACGAGATGTAAATCGGAACCTGCAAGAATTTC
GGAGAAACTTTACAGAGAGTTGAATCTTTTGGAAGAAGAAAGGTCGGTTATGGCTACGAACGATTCTTGCTCATTGAACAACTTTCGATTAGATGATTAA
mRNA sequenceShow/hide mRNA sequence
GATAGCTCTGTGTCATGAAACCAGAGCGACCAATCACCAGCACAGATTCTTAGTGGAGGACAAAACGAAACGAAACGAAACGACTCAATTCATGTTTTGATTTTCATTGC
AACAATGAAGCAATCATCGAAACCGATTTCCAGTCCCAGTCGGACCGACCTGTTTCCGCCGCCATTGATGAGCTTTATCAGAGCCGATGCCGGAAATCGTAGTAAAAGCA
GCCGGTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACGTCGCCATTGAAACTCAAGACCCGTCCTCTCCTAAGGTCACTTGTATGGGACAAGTCCGCGCCAATAAA
CGCTCCTCTAAAACCTCTATCACCAGCGCCGCCCAGTGCCGGTGGATTAGAAGCGTCCTATCGTTCAATCGACGCCATTGTCGAGCCTTCTGGAACAGGTCGGCGATGCT
ATTCCGAAGAAAGCGTGAAACTAGACAGAAATCATCAATCTCTGAATCTCGCGTCGGAAATGAAATGGAGGATTCGGAGAAAGATGAAGAGAACGACGGAGTCAGGGATG
CGGTTTTTGCGCCCTCGGTTCCACCGCCGCCGAAGAACGCTCTCATTCTGACGAGATGTAGATCTGCGCCGCATACGTCGTTGATTTACGGCAATCGGTATCGGAGCTCG
TCGATGAGGATCGACAGAACTGGAGAAGAAGAGAAAGAAGATGAAGAACAGAAAACAGAGCGCGGTGATGGAAACTCGGGGCGATTGTTCAAAAACCTCGAAGAGTCAAT
CGGAGATGTCGATTCAAACTCTGTAAATAGCAAAGAGATCAAAATCGAGGACAACTCGAGATTGAATCGGAACCTGATTCTGACGAGATGTAAATCGGAACCTGCAAGAA
TTTCGGAGAAACTTTACAGAGAGTTGAATCTTTTGGAAGAAGAAAGGTCGGTTATGGCTACGAACGATTCTTGCTCATTGAACAACTTTCGATTAGATGATTAAATTTCT
TCCAGTGTTTTTCTTTCCTCCATGAATGA
Protein sequenceShow/hide protein sequence
MKQSSKPISSPSRTDLFPPPLMSFIRADAGNRSKSSRSRSSPIFVRKKNVAIETQDPSSPKVTCMGQVRANKRSSKTSITSAAQCRWIRSVLSFNRRHCRAFWNRSAMLF
RRKRETRQKSSISESRVGNEMEDSEKDEENDGVRDAVFAPSVPPPPKNALILTRCRSAPHTSLIYGNRYRSSSMRIDRTGEEEKEDEEQKTERGDGNSGRLFKNLEESIG
DVDSNSVNSKEIKIEDNSRLNRNLILTRCKSEPARISEKLYRELNLLEEERSVMATNDSCSLNNFRLDD