; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013960 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013960
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationChr02:6439498..6441387
RNA-Seq ExpressionHG10013960
SyntenyHG10013960
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466499.1 PREDICTED: uncharacterized protein LOC103503892 isoform X1 [Cucumis melo]7.8e-10791.81Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSVS +SKKKMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QD++ TD NAVIL NDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

XP_008466500.1 PREDICTED: uncharacterized protein LOC103503892 isoform X2 [Cucumis melo]1.9e-10591.81Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSVS  SKKKMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QD++ TD NAVIL NDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]7.3e-10589.66Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHH+IKVGTMI+ LVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSV+  SK+KMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
          QDS+ TD NAVILNNDHQ KTKKQK TRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

XP_038889707.1 uncharacterized protein LOC120079556 isoform X1 [Benincasa hispida]3.9e-10689.13Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA IIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENEVMG
        VIVLGF+S VITDEDIRDEFKH+TKHGEEMFV+R HKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSV+N+SKKK RENE  G
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENEVMG

Query:  QDSIATDANAVILNNDHQLKTKKQKTTRIS
        + S++TDANA+ILNNDHQ KTK+QKTTRIS
Subjt:  QDSIATDANAVILNNDHQLKTKKQKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]3.0e-10689.22Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        M+GLKVSDANMVVYVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKIIDK AKILSGVHPYF VTI AKLLLFSPKPNML+EGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMREN--EV
        VIVLGFASAVIT+EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHI+GSLVPSHTGSIHWLEKNSVEG+V+++SKKKMREN  EV
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMREN--EV

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QDS+AT+ NA+ILNNDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein3.5e-10589.66Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHH+IKVGTMI+ LVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSV+  SK+KMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
          QDS+ TD NAVILNNDHQ KTKKQK TRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X29.3e-10691.81Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSVS  SKKKMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QD++ TD NAVIL NDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X13.8e-10791.81Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSVS +SKKKMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QD++ TD NAVIL NDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA433.8e-10791.81Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYF VTI AKLLLFSPKPNMLLEGKVVKLRQE+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V
        VIVLGFASAVIT EDIRDEFKH+TKHGEEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNSVEGSVS +SKKKMRENE  V
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENE--V

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        M QD++ TD NAVIL NDHQ KTKKQKTTRIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167573.3e-10385.34Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+AKILSGVHPYF VT+ AKLLLFSPKPNMLLEGKVVKLRQE++H
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMREN--EV
        VIVLGFASA ITDEDIRDEFKH+TKH EEMFV+RAHKHHVIKVGTMI+FLVKSF EEILHISGSLVPSHTGSIHWLEKNS+EGSV++  +KK R+N  E 
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMREN--EV

Query:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS
        + QDS+ATD NA+ILNNDHQ KTKKQKT+RIS
Subjt:  MGQDSIATDANAVILNNDHQLKTKKQKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa435.2e-0523.31Show/hide
Query:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIHVIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++K+AK++    P+  + +   +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIHVIVLGFAS

Query:  AVITDEDI-RDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEE--ILHISGSLVPS
        A I  + I +D    +    EE    + +  ++++ G  ++F+V     E  +  + G+L  S
Subjt:  AVITDEDI-RDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases2.4e-5852.58Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYF V +N +LLLF PKP   +EGK+VK+  E+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMR
        VIVLGF++AVITD DIR+EFK++ + GE  FV+R+HK H +K+GTM++  V+SF EE++HI+GSL+P +TG + WLEK S E   ++   K+ +
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMR

AT1G75670.2 DNA-directed RNA polymerases2.4e-5852.58Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYF V +N +LLLF PKP   +EGK+VK+  E+IH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIH

Query:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMR
        VIVLGF++AVITD DIR+EFK++ + GE  FV+R+HK H +K+GTM++  V+SF EE++HI+GSL+P +TG + WLEK S E   ++   K+ +
Subjt:  VIVLGFASAVITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTGTTTACGTTCACCCATCCAAAAGTAAGAAGGTGTCGCAAGCAGTGCTTCGAGAGCTCGGCGCTATGCTTTTAAA
ATTTGATGAAAAATTTGAAGGTGTGCTACTGGCTTATGAGGCCAAAATTATTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTTAGCGTGACAATAAATG
CAAAGCTATTACTTTTCTCTCCAAAACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAAGCAATCCATGTTATTGTCTTAGGTTTCGCCTCTGCTGTA
ATAACTGACGAAGACATTCGCGACGAATTTAAGCATAAAACAAAACATGGAGAAGAAATGTTTGTCAACAGAGCTCACAAGCACCATGTAATAAAGGTTGGGACAATGAT
ACAATTTTTGGTGAAGAGTTTTTATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGGTTGAAGGTT
CAGTAAGTAATAACAGCAAAAAGAAGATGAGAGAAAACGAGGTGATGGGTCAGGATAGCATTGCCACGGATGCAAATGCAGTTATCTTGAACAATGACCATCAGTTAAAA
ACCAAAAAGCAAAAAACTACCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTGTTTACGTTCACCCATCCAAAAGTAAGAAGGTGTCGCAAGCAGTGCTTCGAGAGCTCGGCGCTATGCTTTTAAA
ATTTGATGAAAAATTTGAAGGTGTGCTACTGGCTTATGAGGCCAAAATTATTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTTAGCGTGACAATAAATG
CAAAGCTATTACTTTTCTCTCCAAAACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAAGCAATCCATGTTATTGTCTTAGGTTTCGCCTCTGCTGTA
ATAACTGACGAAGACATTCGCGACGAATTTAAGCATAAAACAAAACATGGAGAAGAAATGTTTGTCAACAGAGCTCACAAGCACCATGTAATAAAGGTTGGGACAATGAT
ACAATTTTTGGTGAAGAGTTTTTATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGGTTGAAGGTT
CAGTAAGTAATAACAGCAAAAAGAAGATGAGAGAAAACGAGGTGATGGGTCAGGATAGCATTGCCACGGATGCAAATGCAGTTATCTTGAACAATGACCATCAGTTAAAA
ACCAAAAAGCAAAAAACTACCAGAATATCTTGA
Protein sequenceShow/hide protein sequence
MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFSVTINAKLLLFSPKPNMLLEGKVVKLRQEAIHVIVLGFASAV
ITDEDIRDEFKHKTKHGEEMFVNRAHKHHVIKVGTMIQFLVKSFYEEILHISGSLVPSHTGSIHWLEKNSVEGSVSNNSKKKMRENEVMGQDSIATDANAVILNNDHQLK
TKKQKTTRIS