; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G002710 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G002710
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationchr11:2753621..2757641
RNA-Seq ExpressionLsi11G002710
SyntenyLsi11G002710
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037901.1 rpa43, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-8877.73Show/hide
Query:  FVSVAVSMEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVK
        FVS+AVSMEGLKVSDAN+V+YVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA I+DK+AKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVK
Subjt:  FVSVAVSMEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVK

Query:  LRQESIHVIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKR
        LRQESIHVIVLGFASAVITDEDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIH LEKNSV GS T + KK R
Subjt:  LRQESIHVIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKR

Query:  ENEGDG----SVATDANALILNDDVQSKTKKQKTTRIS
        +N+ +     SVATD NAL+LN+D QSKTKKQKT+RIS
Subjt:  ENEGDG----SVATDANALILNDDVQSKTKKQKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]2.1e-8779.65Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSVT + +K RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           SV TD NA+ILN+D Q KTKKQK TRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

XP_038889707.1 uncharacterized protein LOC120079556 isoform X1 [Benincasa hispida]5.4e-9182.46Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA I+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG
        VIVLGF+S VITDEDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSVTN+SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG

Query:  SVATDANALILNDDVQSKTKKQKTTRIS
        SV+TDANALILN+D QSKTK+QKTTRIS
Subjt:  SVATDANALILNDDVQSKTKKQKTTRIS

XP_038889709.1 uncharacterized protein LOC120079556 isoform X2 [Benincasa hispida]3.1e-9181.94Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA I+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEGDGS
        VIVLGF+S VITDEDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSVTN+ KK RENEG  S
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEGDGS

Query:  VATDANALILNDDVQSKTKKQKTTRIS
        V+TDANALILN+D QSKTK+QKTTRIS
Subjt:  VATDANALILNDDVQSKTKKQKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]4.7e-8779.74Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        M+GLKVSDANMVVYVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKI+DK AKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGD-
        VIVLGFASAVIT+EDIRDE KHRTKHGEE+                       FDEEILHI+GSLVPSHTGSIHWLEKNSV G+VT++SKKK RENEG+ 
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGD-

Query:  ---GSVATDANALILNDDVQSKTKKQKTTRIS
            SVAT+ NALILN+D QSKTKKQKTTRIS
Subjt:  ---GSVATDANALILNDDVQSKTKKQKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein1.0e-8779.65Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSVT + +K RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           SV TD NA+ILN+D Q KTKKQK TRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X28.6e-8780.09Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---
        VIVLGFASAVIT EDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ + KK RENEG   
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRENEG---

Query:  -DGSVATDANALILNDDVQSKTKKQKTTRIS
           +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  -DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X11.5e-8680.6Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--
        VIVLGFASAVIT EDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ +SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--

Query:  --DGSVATDANALILNDDVQSKTKKQKTTRIS
            +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  --DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA431.5e-8680.6Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--
        VIVLGFASAVIT EDIRDE KHRTKHGEE+                       FDEEILHISGSLVPSHTGSIHWLEKNSV GSV+ +SKKK RENEG  
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEG--

Query:  --DGSVATDANALILNDDVQSKTKKQKTTRIS
            +V TD NA+ILND  Q KTKKQKTTRIS
Subjt:  --DGSVATDANALILNDDVQSKTKKQKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167575.6e-8677.59Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+VVYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+AKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG
        VIVLGFASA ITDEDIRDE KHRTKH EE+                       FDEEILHISGSLVPSHTGSIHWLEKNS+ GSVT+  +KK R+NEG+ 
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGEEI-----------------------FDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKK-RENEGDG

Query:  ----SVATDANALILNDDVQSKTKKQKTTRIS
            SVATD NALILN+D QSKTKKQKT+RIS
Subjt:  ----SVATDANALILNDDVQSKTKKQKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa431.8e-0425.93Show/hide
Query:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++K+AK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDEDI
        A I  + I
Subjt:  AVITDEDI

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases6.0e-4847.94Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGE-----------------------EIFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE
        VIVLGF++AVITD DIR+E K+R + GE                       + FDEE++HI+GSL+P +TG + WLEK S     T+   K+R+
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGE-----------------------EIFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE

AT1G75670.2 DNA-directed RNA polymerases6.0e-4847.94Show/hide
Query:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEIKHRTKHGE-----------------------EIFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE
        VIVLGF++AVITD DIR+E K+R + GE                       + FDEE++HI+GSL+P +TG + WLEK S     T+   K+R+
Subjt:  VIVLGFASAVITDEDIRDEIKHRTKHGE-----------------------EIFDEEILHISGSLVPSHTGSIHWLEKNSVGGSVTNTSKKKRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCTCTGCTCGATCCGTCTGTCAGTACTGGGTGCTTACGCCGACTGCTCACGCCGTTTCCGATTCTTCCTCTGACCGCCACCGTCCCTAGTCGGAGTTCATTGGG
TTCTGGTGAGTGTGCAGTGGGTAAATGGGAGAGTCTTCTCCGGCCGCTGCAGATTCTCCTCCTCTTTGTTTCGGTCGCCGTTTCAATGGAGGGGCTCAAGGTTTCGGATG
CTAATATGGTTGTTTATGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGAGAGCTCGGCGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGTGTG
CTACTGGCTTATGAGGCCAAAATTGTTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTCGGCGTGACAATAAAGGCGAAGCTATTACTTTTCTCTCCAAA
ACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGACG
AAATCAAGCATAGAACAAAACATGGAGAAGAAATTTTTGATGAGGAAATATTGCATATCTCTGGTTCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAAAAG
AATTCAGTTGGAGGTTCAGTAACTAATACCAGCAAAAAGAAGAGAGAAAACGAGGGAGACGGGAGCGTTGCCACGGATGCAAATGCACTTATCTTGAACGACGACGTTCA
ATCAAAAACCAAAAAGCAAAAAACTACCAGAATATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCTCTGCTCGATCCGTCTGTCAGTACTGGGTGCTTACGCCGACTGCTCACGCCGTTTCCGATTCTTCCTCTGACCGCCACCGTCCCTAGTCGGAGTTCATTGGG
TTCTGGTGAGTGTGCAGTGGGTAAATGGGAGAGTCTTCTCCGGCCGCTGCAGATTCTCCTCCTCTTTGTTTCGGTCGCCGTTTCAATGGAGGGGCTCAAGGTTTCGGATG
CTAATATGGTTGTTTATGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGAGAGCTCGGCGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGTGTG
CTACTGGCTTATGAGGCCAAAATTGTTGATAAAAATGCGAAGATTCTATCTGGAGTGCATCCCTATTTCGGCGTGACAATAAAGGCGAAGCTATTACTTTTCTCTCCAAA
ACCAAACATGCTTTTAGAAGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGACG
AAATCAAGCATAGAACAAAACATGGAGAAGAAATTTTTGATGAGGAAATATTGCATATCTCTGGTTCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAAAAG
AATTCAGTTGGAGGTTCAGTAACTAATACCAGCAAAAAGAAGAGAGAAAACGAGGGAGACGGGAGCGTTGCCACGGATGCAAATGCACTTATCTTGAACGACGACGTTCA
ATCAAAAACCAAAAAGCAAAAAACTACCAGAATATCTTAAGTACTTCTAATTGTGATATAACATAGATTGTTTTTGTATCAGGGTGATGATGATTCAAACTAATTCAAAG
CAGAGGTCCTCATTGTTTTCCTATAGTAAGAAGGGCTTGGGAGATGCACCTTGCACTTGGGAAGATCAATGACAACATTAGATATCAGGGTGTGAACAAATGCAGCTCCC
CATTTTGTAACTGTTATGAAGAGATAGCCCAAATGCATGTATTATGGCTCCTTTAGTTTCCTTTAATAGGCTTGTCTAATAACAATTTATCAAATTTTGTTCTACCTTTA
AATTTTAAATTTTTGTTGGAAGCTAGAAGTTTATTATTAGGTCAAAATACACTTTTGGTCGTTAAGATTTGAGCTTGGTGTCTATTTGATTTATGAGGTTTCAAAAACAG
TTCTAAATAGTCTCTGATGGTAACTTGACCATTAGTTAACTAACGGAATTGTGATGTGGCATAGTTGAATAGATAAATTTTAATGTATGTGGCAATATCTATTATTTTTA
TCATGGTCATATTTATATGGGTA
Protein sequenceShow/hide protein sequence
MVSLLDPSVSTGCLRRLLTPFPILPLTATVPSRSSLGSGECAVGKWESLLRPLQILLLFVSVAVSMEGLKVSDANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGV
LLAYEAKIVDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEIKHRTKHGEEIFDEEILHISGSLVPSHTGSIHWLEK
NSVGGSVTNTSKKKRENEGDGSVATDANALILNDDVQSKTKKQKTTRIS