; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006705 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006705
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationChr07:21226232..21228214
RNA-Seq ExpressionHG10006705
SyntenyHG10006705
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466500.1 PREDICTED: uncharacterized protein LOC103503892 isoform X2 [Cucumis melo]1.6e-10490.48Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ + KK RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]9.4e-10589.18Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSVT + +K RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           SV TD NA+ILN+D Q KTKKQK TRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

XP_038889707.1 uncharacterized protein LOC120079556 isoform X1 [Benincasa hispida]1.1e-10892.54Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA I+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG
        VIVLGF+S VITDEDIRDE KHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSVTN+SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG

Query:  SVATDANALILNDDVQSKTKKQKTTRIS
        SV+TDANALILN+D QSKTK+QKTTRIS
Subjt:  SVATDANALILNDDVQSKTKKQKTTRIS

XP_038889709.1 uncharacterized protein LOC120079556 isoform X2 [Benincasa hispida]6.3e-10992.07Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA I+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEGDGS
        VIVLGF+S VITDEDIRDE KHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSVTN+ KK RENEG  S
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEGDGS

Query:  VATDANALILNDDVQSKTKKQKTTRIS
        V+TDANALILN+D QSKTK+QKTTRIS
Subjt:  VATDANALILNDDVQSKTKKQKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]3.2e-10590.09Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        M+GLKVSDANMVVYVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKI+DK AKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGD-
        VIVLGFASAVIT+EDIRDE KHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHI+GSLVPSHTGSIHWLEKNSV G+VT++SKKK RENEG+ 
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGD-

Query:  ---GSVATDANALILNDDVQSKTKKQKTTRIS
            SVAT+ NALILN+D QSKTKKQKTTRIS
Subjt:  ---GSVATDANALILNDDVQSKTKKQKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein4.5e-10589.18Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSVT + +K RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           SV TD NA+ILN+D Q KTKKQK TRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X27.7e-10590.48Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ + KK RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X11.3e-10490.95Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ +SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--

Query:  --DGSVATDANALILNDDVQSKTKKQKTTRIS
            +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  --DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA431.3e-10490.95Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--
        VIVLGFASAVIT EDIRDE KHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ +SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--

Query:  --DGSVATDANALILNDDVQSKTKKQKTTRIS
            +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  --DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167573.8e-10487.93Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+AKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG
        VIVLGFASA ITDEDIRDE KHRTKH EEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNS+ GSVT+  +KK R+NEG+ 
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG

Query:  ----SVATDANALILNDDVQSKTKKQKTTRIS
            SVATD NALILN+D QSKTKKQKT+RIS
Subjt:  ----SVATDANALILNDDVQSKTKKQKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa437.9e-0623.93Show/hide
Query:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++K+AK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDEDI-RDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS
        A I  + I +D I       EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEDI-RDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases6.8e-6155.15Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE
        VIVLGF++AVITD DIR+E K+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S     T+   K+R+
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE

AT1G75670.2 DNA-directed RNA polymerases6.8e-6155.15Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE
        VIVLGF++AVITD DIR+E K+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S     T+   K+R+
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTGTTTATGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGAGAGCTCGGCGCTATGCTTCTAAA
ATTTGATGAAAAATTTGAAGGTGTGCTACTGGCTTATGAGGCCAAAATTGTTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTCGGCGTGACAATAAAGG
CGAAGCTATTACTTTTCTCTCCAAAACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATGAAGACATTCGCGACGAAATCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTTAGCAGAGCTCACAAACACCATGTAATAAAGGTTGGGACAATGAT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGTTCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAAAAGAATTCAGTTGGAGGTT
CAGTAACTAATACCAGCAAAAAGAAGAGAGAAAACGAGGGAGACGGGAGCGTTGCCACGGATGCAAATGCACTTATCTTGAACGACGACGTTCAATCAAAAACCAAAAAG
CAAAAAACTACCAGAATATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTGTTTATGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGAGAGCTCGGCGCTATGCTTCTAAA
ATTTGATGAAAAATTTGAAGGTGTGCTACTGGCTTATGAGGCCAAAATTGTTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTCGGCGTGACAATAAAGG
CGAAGCTATTACTTTTCTCTCCAAAACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATGAAGACATTCGCGACGAAATCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTTAGCAGAGCTCACAAACACCATGTAATAAAGGTTGGGACAATGAT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGTTCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAAAAGAATTCAGTTGGAGGTT
CAGTAACTAATACCAGCAAAAAGAAGAGAGAAAACGAGGGAGACGGGAGCGTTGCCACGGATGCAAATGCACTTATCTTGAACGACGACGTTCAATCAAAAACCAAAAAG
CAAAAAACTACCAGAATATCTTAA
Protein sequenceShow/hide protein sequence
MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAV
ITDEDIRDEIKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEGDGSVATDANALILNDDVQSKTKK
QKTTRIS