; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G019523 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G019523
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: 50S rib s in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 36; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
Genome locationGy14Chr5:25857836..25862419
RNA-Seq ExpressionCsGy5G019523
SyntenyCsGy5G019523
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032525.1 hypothetical protein SDJN02_06574, partial [Cucurbita argyrosperma subsp. argyrosperma]3.93e-8079.74Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNS RRSPRRR+LP+ K RDD K LSRSNTLKDLFVSSPPY  +D ++++T   AP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
         RNVTPVC +E  +G +G  GW+PGSPRPGWTGFRYKYLLRKTWRP+L GIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

XP_008467191.1 PREDICTED: uncharacterized protein LOC103504599 [Cucumis melo]1.17e-9893.55Show/hide
Query:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT
        MASLLKFKLLPTH CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR R+LPENK RDD KGLSRSNTLKDLFVSSPPY+GTDCDVH+TAVT
Subjt:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT

Query:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        APT+N TPVCEKEKEEG VGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
Subjt:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

XP_011655483.1 uncharacterized protein LOC105435544 [Cucumis sativus]7.74e-108100Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

XP_023513243.1 uncharacterized protein LOC111777755 [Cucurbita pepo subsp. pepo]2.37e-8179.74Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNS RRSPRRR+LP+ K RDD K LSRSNTLKDLFVSSPPY  ++C++++T   AP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
         RNVTPVC +E  +G +G  GW+PGSPRPGWTGFRYKYLLRKTWRP+L GIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

XP_038875200.1 uncharacterized protein LOC120067717 [Benincasa hispida]6.84e-8886.93Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSP RR+LPE K RDD K LSRSNTLKDLFVSSPPY+ TD D+H+TAV AP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        TR+ T VC KEKE+G VGS GWNPGSPRPGWTGFRYKYLLRKTWRPVL GIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

TrEMBL top hitse value%identityAlignment
A0A0A0KPK6 Uncharacterized protein3.75e-108100Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

A0A1S3CSY3 uncharacterized protein LOC1035045995.68e-9993.55Show/hide
Query:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT
        MASLLKFKLLPTH CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR R+LPENK RDD KGLSRSNTLKDLFVSSPPY+GTDCDVH+TAVT
Subjt:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT

Query:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        APT+N TPVCEKEKEEG VGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
Subjt:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

A0A5D3BMD8 Uncharacterized protein5.68e-9993.55Show/hide
Query:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT
        MASLLKFKLLPTH CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR R+LPENK RDD KGLSRSNTLKDLFVSSPPY+GTDCDVH+TAVT
Subjt:  MASLLKFKLLPTH-CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRR-RSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVT

Query:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        APT+N TPVCEKEKEEG VGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
Subjt:  APTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

A0A6J1FLF8 uncharacterized protein LOC1114453021.09e-7778.43Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLL THCGVAQSPTLSPRTSPL+HLRRRKTTLRMLLTRN  RRSPRR +LPE KN DDRK L+R N LKDLFVSSPPY GTD  +++TA+ AP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
         RNVTPVC+K+ ++ H+GS G  PGSPRPGWTGFRYKYLL+KTWRPVL GIPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

A0A6J1HGM7 uncharacterized protein LOC1114633737.40e-7877.78Show/hide
Query:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP
        MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNS RRSPRRR+LP+ K RDD K LSRSNTLKDLFVSSPPY  +D ++++T   AP
Subjt:  MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAP

Query:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
         RNVTPVC +E  +G +G   W+PGSPRPGWTGFR+KYLLRKTWRP+L  IPE
Subjt:  TRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G66890.1 FUNCTIONS IN: molecular_function unknown1.7e-1544.51Show/hide
Query:  SLLKFKLLPTHCG-VAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGR-RSPR-RRSLPENKNRDDRKGLSRSNTLKDLFVSSPPY----------LGTD
        SL KFKLL THC  VA+SPT     SP++HLRRRK TLR+LLTR+S R R P  + ++ E+K  D R  +  S  L DLFVSSPP+           GT 
Subjt:  SLLKFKLLPTHCG-VAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGR-RSPR-RRSLPENKNRDDRKGLSRSNTLKDLFVSSPPY----------LGTD

Query:  CDVHQTAVTAPTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
         +V       P   V+       EE      G+N GS RP  +      LLR++WRPVLV IPE
Subjt:  CDVHQTAVTAPTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE

AT1G68430.1 unknown protein3.7e-1038.41Show/hide
Query:  MASLLKFKLLPTHCGV-AQSPTL--SPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAV
        MA+L +FK L T CGV AQSPT   SPRTSPLV LRR+KTTL+MLL+  S  R   ++ L  + ++D          LKDLFVSS            +A 
Subjt:  MASLLKFKLLPTHCGV-AQSPTL--SPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAV

Query:  TAPTRNVTPVCEKEKEEGHVGSPGWNPGS--------PRPGWTGFRYKYLLRKTWRPVLVGIPE
             +  P  + ++E     +   N  S          P W GF  K LL++ WRP L  I E
Subjt:  TAPTRNVTPVCEKEKEEGHVGSPGWNPGS--------PRPGWTGFRYKYLLRKTWRPVLVGIPE

AT5G16200.1 50S ribosomal protein-related1.8e-0935.09Show/hide
Query:  MASLLKFKLLPTH----CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNS-GRRSPRRRSLPENKNRDDRKGLSRSNT-------LKDLFVS--SPPYL
        ++ L KFKL   H       A SPT+SP  SP+ +LRRRK TLRML  ++S  RR   R+ L E+   D   G  +          L++L V+  SPPY 
Subjt:  MASLLKFKLLPTH----CGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNS-GRRSPRRRSLPENKNRDDRKGLSRSNT-------LKDLFVS--SPPYL

Query:  GTDCDV----HQTAVTAPTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE
        G + D+     + +  +P    +    +E      GS G            FR + +LRK WRPVLV IPE
Subjt:  GTDCDV----HQTAVTAPTRNVTPVCEKEKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGCTTCTCAAATTCAAGCTCCTTCCGACGCACTGCGGCGTCGCTCAGAGTCCTACTTTAAGCCCTAGGACCAGTCCTCTCGTTCACCTCCGCCGTCGGAAAAC
CACTCTTCGGATGCTTCTCACTCGGAATTCCGGCCGTCGGTCTCCTCGTCGCCGCTCCTTACCGGAGAATAAGAACAGAGACGACCGTAAGGGTTTGTCGCGGAGTAACA
CGTTGAAGGATTTGTTCGTTTCCTCGCCACCGTATCTTGGAACTGACTGTGATGTTCACCAGACGGCGGTCACGGCTCCAACCCGGAACGTGACTCCAGTTTGTGAGAAG
GAGAAGGAGGAGGGTCACGTTGGATCACCCGGTTGGAATCCCGGTTCCCCTAGACCGGGTTGGACCGGTTTCCGATATAAATACTTGCTTCGGAAAACTTGGCGGCCGGT
GCTCGTCGGAATCCCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
TTAGTAAATGACTTTTGTAGGGGTAAAGAGGTAATTTAAAGCTCTAGTCAAAGAATATGAACAAAAAAATGGTATTAATATTAAATACTATTTATTAATTAATAGTAGAA
TTTTACTGGGAAAATGAACCAACTTTTTTTCCTCTCTTCTCTTTCTTCAACATTCAATAAATTTCTCTTTTCAATTCAAAGTTTTAGACTAAAAAAATATAATGAAACCC
CAAAAAGAAGAAAATAAGAAAAATAAAACCCCCAATTCCCAACGCTCTCAAAAATCCTCTCCGAATTCCCTTCTTCAATCATTACAAGATTTTCAAATTCCATTCCAAAA
CTCCATCCATCGATTCAATAATTCTTCTTCCTCTGACCTTTCTCCATGGCGTCGCTTCTCAAATTCAAGCTCCTTCCGACGCACTGCGGCGTCGCTCAGAGTCCTACTTT
AAGCCCTAGGACCAGTCCTCTCGTTCACCTCCGCCGTCGGAAAACCACTCTTCGGATGCTTCTCACTCGGAATTCCGGCCGTCGGTCTCCTCGTCGCCGCTCCTTACCGG
AGAATAAGAACAGAGACGACCGTAAGGGTTTGTCGCGGAGTAACACGTTGAAGGATTTGTTCGTTTCCTCGCCACCGTATCTTGGAACTGACTGTGATGTTCACCAGACG
GCGGTCACGGCTCCAACCCGGAACGTGACTCCAGTTTGTGAGAAGGAGAAGGAGGAGGGTCACGTTGGATCACCCGGTTGGAATCCCGGTTCCCCTAGACCGGGTTGGAC
CGGTTTCCGATATAAATACTTGCTTCGGAAAACTTGGCGGCCGGTGCTCGTCGGAATCCCCGAGTAATGGATTCTCTTTTTTTTTTTTTTTTTTTTTTTAGTTTTAAAAG
AAAACATCTTTTAATTAATTTAAAAATATATCTAATCTAATATTTCTATGTTTTTTTAGTATAGATTTCAATCTTTACTTAAAATAGGGATGGGGGAAAGGATTAAGATC
TTAATCAAGTGGGTTTGGGGAAATAATAATTTGTTGCTATGGAAAATTATGAATAGATTTCCCATGAGTGGGAATTTGCTTATTTTTCTGTTCAATTTTACGCTTGTTTA
GTTTTTCTAATCATATGTTAATTAGTCAAATTAACTCTATACAATTCGTAGTACCAAATTTAAGGCTTAAACCTTGATTGATTTTCTTGATTGTTTTATTATATAGTTTA
TGAAGTTGTATTTAAAATCTTGTTCTTAACTTTGTATGGTTTAGTTGGAGTGAGCAAATCCTAAACCAAATCTTTGATTGACCCCACTTTGACCATAGTCAAGAATAGTT
AGTGAAATTTGAGTTAACAAATCTACTAGTGGCTTAACTCGATTTTAAAAAAAAAAAATTGATGAAGATTGTCATCCATCGGTTCTTTTTATATCCTAAGTATCGTTCTT
CCACAAATTTGTGGACTTCCAAATCTAGAAATTTTAACACCTTCCATCCCATAGTTGGGGGAACACGTCGTCCTCTTGTATATACTATCATGCGAGTGTTTTCAGTGCAC
ATGCTCAACCCACCTAATGCCTTCATTTTATGTGTGTACATCCGTTGAACATTCATGTGCACACGTACAATGGGTTGTTGGTTGAACATGCATGGAGCATCGGGTGGACA
TCATGTGGACTATTTGGATTTTGTACTAGGTGGTAGGTTACTCAAAAGTTGAAGTTCTTATCAAATAACAAATCGTTCGTATAAATATGGAAGTGTAATCATGTTTGGCA
TAAATTCTATTTAGAAACTATTAACTTTTGAACTCATTATTATTATTATTATCATCATCATCATCATCAATGATATGTGTTCTAAGGCCACACTTTAACGAGTTTGACCG
TTAAGTTTCCAAAAGAATATTCAAATTTCTATTAAAGTGACCCATTAAAGAATTTAATCAAATAAAGGTTAAAATAGTACTGAGAGGAAGTTTAAGATTCAAAATATGAG
ATCTAAAACTAAAAAAGCCAAGGATCTAAACAAAAAATTAGTACAAAAGCTATTGTGTAATAATCAAAATATATTATTTCCTTTTAGTTTATTTATTTGGGACTTATTTT
TTGAGGGATGTGTAGAGAGAGAGAGAGAGAGAGAGAGAGAAAAAAAGAATTGGGACCAAGAATTTGAATCAGATATAGAAAAGGCATATGAAATGTTTCTGTCACAAAGA
AAAAGCAAAATAAATTCCACTCCTAAAAAGCAACGCATCTCAACTTTAGATAAATATAACTCTGATTTCATTTGTAAAGACATAAAAATTGACGTGCCCTTTTGGTGGGA
TAAGTTCCTATTGCTGCCTTCAAGGATCTTCTCCATTTCTTTCTTATCGACCCCTTTACACAAATTTTCAACTTCATCCTTATGATCTTATTAGTTTATAGACTTAATTC
ATATTAAGCCGATGTGACTTCAAAAGTTTCAATTTAGTCTTATTTGACCAATTTGGTCCCTAAACATTTTATTTGTATGTTACAAATTTTCATTTGAGTTATTATGCTCG
TGGACTATTTTTGGATTTAATACCGAAAAACTATTGTTCTCTCAAACTTCTACCAAACTAGTAAGAAAAAGAGGGAGAGAATAAATACTTAATCTCCAAAATTTAAGTTT
GATACCATCTTAGTCCTTGTATTTTTAAGTTTAATATTGTAACTCTCTGGGTACCGTCAAATAATACATTTTTTTCCATATCTCAAAAACTTGAAAATCAAATTTGAAAA
CTCGAGTATCTAAATGCATGCTAGGGAGATCAAAAGTGTTCTTTTCATTATTATGCAACTCTTCCAAAATAAGAAAAAAACAGAGAAAAGAAGACACTAGATCTTAGTGA
AAACCTAAGTACTTACAATTATTAGTTCAAAGTTAGTGCTAGATTTGATGTTTTCCCCCATAATTTATAAGCTTAGTTGTTTATACTTTTTCCCATATATCATGTACTCA
AATTCATCTTCTGTTCTCACCAGGGCCCCACAATTTCACATTACCCATGTAATCTTGAGCACATCCATCTTCTTCTCAAGCATACACAAGTAATATATATCAATGACTAA
CCTCCACTGAGTATGATATGATATGGTATGAAAAACAGAATGAACAGTACGTACACGAAATGTGTATCAGGACAAAACAATTCAAATTTCAAGCTATAAATAAAATATAA
CCTTCATATAAAATGTATTGAAACAATCTATTATCCAATCAATGAATATATTAATGTAACTTGCTCAGCATTTCATTACCAAAATTCCTTGACTCGATAAGATACAAGTT
TTAATATAGCTTTCATTCACTCACAAACACATGTGGATGCTACAGGTTTTGTTCTCTTTTTCCCACCTATAGCATCTCTTAACAAGAAACATGTTACAAAATTCACAGCA
CATGGCGAGCATGCGCTTATTCACCGCCAGAAAATTTTTATTGTTCGTTTATAAGATTTAGAAAAAAGTAAAAAATTAAAAAGCAACCAGATTTAATCAATGTCTAGATG
TTAAGCGTTCTTTATTCCCCAGAGGAAAACTAGTTTGCTTACTTGGTTGGGATTGTGATTGTTGTATGTCGGAATCTCTAACTTCGTTTGCATCGTCTATGTTCGCCTTT
TGGGCCTTCACATGACTATTGCTATTGGAGTTCGGATGGGTTTCTGAATATTTATCCTCACCAGAAGGTATAACTCGCAGTAGGAACCTATGTATCATTTCATAGCTGGT
GAAAGTTATTACAGCAGATGGAGTTGTTCGCAACAGATTCGTCGCACAACCTCTATAAAAACCTGGGACACCTTCCTTTCGAAAAACCTTTTTGATACAGTCCATTACTC
CTGAGTATTGAGGAGCAATGTTTCTGGCTTGGCCTTGCTCTTGTAGCCGGGAACGAACTACCTGGAGATGAAGTTTAAACAGGATAAAGAATTATTAAAGACACACATCA
CAAACACTTGCAGAGAGAGGGTTTAAAAACCTACCTCATGTGGGTAAGTCATTACCGAGGCCGTAACTTTCGACAATGAGGAGGCAATAGCTAAATGTCCTGGACTTAGT
TTGTCAACGGTTGTGTTTTCTGGAAAGAAAACAAAACCAAAACCAGTTTAAAAAAGCAATCACATTTGAATGAGTACATACACGCTTACTGATATTCTATATTATAACCA
AAAGATTATCATAGTTACGTACCTCTTTTTGCAATATACGATTTGAGCCTTTCGTATGCAGGAAATTGAATTGCAACATGACTTATCCCAACCAATGAAGGGATGATACC
ACTGTTCAAAGAACAAATCAAAGACATGTTCTCATAATTATTTTAAAATATGAGAATAGTACAACTTATGAGTCATTATTTGTCAGAACAAAAGTTTATCACAACAACCA
TTATCTCACTCATTACAGTCAAGCCACAAATTTGGGAAAACATATATCCATTAAGAACTTCAATTTTCACAAATATGGTGAGAACAAACCTGTAAAGCCCACGAATTCCT
TCTTCACGTACAATCCTGGTAAATGCTGAAACCATGCCTGTGTAAGGAACCACACCAGGCCTCATTCCCTGTGT
Protein sequenceShow/hide protein sequence
MASLLKFKLLPTHCGVAQSPTLSPRTSPLVHLRRRKTTLRMLLTRNSGRRSPRRRSLPENKNRDDRKGLSRSNTLKDLFVSSPPYLGTDCDVHQTAVTAPTRNVTPVCEK
EKEEGHVGSPGWNPGSPRPGWTGFRYKYLLRKTWRPVLVGIPE