; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003444 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003444
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Genome locationChr08:1416131..1420921
RNA-Seq ExpressionHG10003444
SyntenyHG10003444
Gene Ontology termsGO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
InterPro domainsIPR023214 - HAD superfamily
IPR039189 - CTD phosphatase Fcp1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037160.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-7168.86Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHL PEEEYLR+Q DSLE      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

XP_022133134.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia]8.9e-7268.83Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
        RLDEESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
Subjt:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS

Query:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV
        TQLGH+TPEEEYLRSQTDSLE      L+++
Subjt:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV

XP_022133135.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Momordica charantia]8.9e-7268.83Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
        RLDEESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
Subjt:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS

Query:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV
        TQLGH+TPEEEYLRSQTDSLE      L+++
Subjt:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV

XP_023525838.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo]1.8e-7269.3Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHLTPEE+YLR+QTDSLE      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

XP_038890381.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida]6.6e-7571.05Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNNAESERIKRRKVEKLENSEEDILYGVEEQ+ E +SKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMK+LL HKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHLTPEEEYLRSQTDSL+      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

TrEMBL top hitse value%identityAlignment
A0A6J1BUF9 RNA polymerase II C-terminal domain phosphatase-like4.3e-7268.83Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
        RLDEESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
Subjt:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS

Query:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV
        TQLGH+TPEEEYLRSQTDSLE      L+++
Subjt:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV

A0A6J1BV42 RNA polymerase II C-terminal domain phosphatase-like4.3e-7268.83Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
        RLDEESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS
Subjt:  RLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNS

Query:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV
        TQLGH+TPEEEYLRSQTDSLE      L+++
Subjt:  TQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV

A0A6J1EFC1 RNA polymerase II C-terminal domain phosphatase-like6.2e-7169.3Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP +N E  NN ESERIKRRKVEKL  SEED L GVEEQ+LEVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHLTPEEEYLRSQ DSLE      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

A0A6J1GC38 RNA polymerase II C-terminal domain phosphatase-like1.3e-7168.86Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNS AHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHLTPEE+YLR+QTDSLE      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

A0A6J1ID30 RNA polymerase II C-terminal domain phosphatase-like4.8e-7168.42Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDS P E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL
        EESGVTFGYIH+                                              GLRLNNDEINRLRNIDMK LLQHKKLILVLDLDHTLLNSTQL
Subjt:  EESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQL

Query:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV
        GHLTPEEEYLR+Q DSLE      L+++
Subjt:  GHLTPEEEYLRSQTDSLEGTLSSHLYIV

SwissProt top hitse value%identityAlignment
Q00IB6 RNA polymerase II C-terminal domain phosphatase-like 42.0e-2943.18Show/hide
Query:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR
        MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE               E  S +  C HPGSFGNMC +CGQ+
Subjt:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR

Query:  LDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST
        L EE+GV+F YIH+                                               +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T
Subjt:  LDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST

Query:  QLGHLTPEEEYLRSQTDSLE
         L  L PEEEYL+S T SL+
Subjt:  QLGHLTPEEEYLRSQTDSLE

Arabidopsis top hitse value%identityAlignment
AT2G04930.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein6.5e-0442.37Show/hide
Query:  GLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS
        GL+L+N+ +   +++  K + L  KKL LVLDLDHTLL+S  + +L+  E YL  +  S
Subjt:  GLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS

AT5G54210.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein8.5e-0436.14Show/hide
Query:  RSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLE
        RS +E  RG        +  + GL+L++  +   + +  +      KKL LVLDLDHTLL++  + +LT EE YL  + DS E
Subjt:  RSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLE

AT5G58003.1 C-terminal domain phosphatase-like 41.4e-3043.18Show/hide
Query:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR
        MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE               E  S +  C HPGSFGNMC +CGQ+
Subjt:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR

Query:  LDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST
        L EE+GV+F YIH+                                               +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T
Subjt:  LDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST

Query:  QLGHLTPEEEYLRSQTDSLE
         L  L PEEEYL+S T SL+
Subjt:  QLGHLTPEEEYLRSQTDSLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAA
TACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAG
AAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATA
CATCGGAGGATAGAGGGAATCGGGAAGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATCAGTTTTAGAGGAGCGTAGGGGACTGAA
AGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGC
TTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACA
TTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATGTGGTGTTTACTGTTCTTTGATAAGGTGTTTACTGTTATTTCATTCCCATCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAA
TACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAG
AAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATA
CATCGGAGGATAGAGGGAATCGGGAAGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATCAGTTTTAGAGGAGCGTAGGGGACTGAA
AGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGC
TTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACA
TTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATGTGGTGTTTACTGTTCTTTGATAAGGTGTTTACTGTTATTTCATTCCCATCGTGA
Protein sequenceShow/hide protein sequence
MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYI
HRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGT
LSSHLYIVLLSKVTLMWCLLFFDKVFTVISFPS