; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21505 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21505
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionIntegrator complex subunit 7
Genome locationctg936:13376..16925
RNA-Seq ExpressionCucsat.G21505
SyntenyCucsat.G21505
Gene Ontology termsGO:0016180 - snRNA processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032039 - integrator complex (cellular component)
InterPro domainsIPR016024 - Armadillo-type fold
IPR033060 - Integrator complex subunit 7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040971.1 integrator complex subunit 7 [Cucumis melo var. makuwa]7.47e-15393.33Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

XP_008464722.1 PREDICTED: uncharacterized protein LOC103502541 isoform X1 [Cucumis melo]7.34e-13992.44Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

XP_011654518.1 uncharacterized protein LOC101204851 isoform X1 [Cucumis sativus]4.36e-14998.32Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYL+FSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

XP_011654519.1 uncharacterized protein LOC101204851 isoform X2 [Cucumis sativus]5.09e-15097.91Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYL+FSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLV
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKV  +
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLV

XP_038892420.1 uncharacterized protein LOC120081531 isoform X2 [Benincasa hispida]1.01e-13690.76Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPG+AVEAI QIG RLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDK+IRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +S+RSKQYQGILSKAR+QNHHELL+RVKVVLNGGDPEAR LALILLGCWAHFAKDSAQIRYL+FSS++SSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTS+ IRMAGARVFAKLGCSHS+AK AYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

TrEMBL top hitse value%identityAlignment
A0A0A0KJG8 Uncharacterized protein2.69e-16499.17Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYL+FSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

A0A1S3CMM3 uncharacterized protein LOC103502541 isoform X13.55e-13992.44Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

A0A5A7TH30 Integrator complex subunit 73.62e-15393.33Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

A0A6J1FP76 uncharacterized protein LOC1114472553.61e-13187.39Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AACAMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVV+VFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        + ++SKQYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY++ SS+ SSH+SEVKASIFAAACI QLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL I+MAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

A0A6J1JBG3 uncharacterized protein LOC1114852602.10e-12886.13Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AA AMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVVRVFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        + ++S+QYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY++  SL SSH+SEVKASIFAAACI QLADDFA+VFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL ++MAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

SwissProt top hitse value%identityAlignment
Q54PL2 Integrator complex subunit 7 homolog1.9e-0525.7Show/hide
Query:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY
        L  N+++ RL+D F+     ++  +++VF    SE++                 +V N  E+L R+  V+   DP AR L+L +LG   H   D   I +
Subjt:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY

Query:  LVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKVMLV
         + + + S    E++A+IF    +C+++  F+   +  +  ++ +  +  I ++   R+F  +  SHS+A T  K MLV
Subjt:  LVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKVMLV

Arabidopsis top hitse value%identityAlignment
AT4G20060.1 ARM repeat superfamily protein2.4e-6456.43Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS
        ME+ SAACAMEWSI+LEK+LR K   +AVEAI + G +L+QWS+EPE  +AVYN+F LV  ED+LFSNTILLRL DAF   DK I+LAVVRVF+S    S
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS

Query:  RNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI
        R  + ++     LSK RV NH ELLTRVK V + GD E++ LALIL GCW  FA + A +RYLVFSS+ S H  E ++++FAAAC C++ADDFA V L +
Subjt:  RNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI

Query:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKVML
        L N M     +T   R+A  RVFAK+GCSH++A  A+K+ +
Subjt:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKVML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAAGTTCTGCAGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACGTCAGATTGGCTG
TCGACTTCAGCAATGGAGTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCCTTTAAGTTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCAACAGCTCACGAAGTAAACAATACCAAGGG
ATTCTTTCAAAGGCAAGGGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTTGATTCTATTGGG
ATGTTGGGCACATTTTGCAAAAGACAGTGCCCAGATACGTTATTTGGTATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATATTTGCTGCAGCAT
GCATTTGTCAGTTAGCAGATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGAGCTCGAGTTTTT
GCAAAACTGGGATGCTCACATTCAATGGCCAAAACTGCTTATAAGGTTATGCTCGTCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGAAGTTCTGCAGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACGTCAGATTGGCTG
TCGACTTCAGCAATGGAGTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCCTTTAAGTTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCAACAGCTCACGAAGTAAACAATACCAAGGG
ATTCTTTCAAAGGCAAGGGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTTGATTCTATTGGG
ATGTTGGGCACATTTTGCAAAAGACAGTGCCCAGATACGTTATTTGGTATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATATTTGCTGCAGCAT
GCATTTGTCAGTTAGCAGATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGAGCTCGAGTTTTT
GCAAAACTGGGATGCTCACATTCAATGGCCAAAACTGCTTATAAGGTTATGCTCGTCTTTTGA
Protein sequenceShow/hide protein sequence
MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQG
ILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTIRMAGARVF
AKLGCSHSMAKTAYKVMLVF