; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027733 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027733
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein DCL homolog, chloroplastic-like
Genome locationtig00153055:2135828..2148344
RNA-Seq ExpressionSgr027733
SyntenySgr027733
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR035892 - C2 domain superfamily
IPR044673 - Protein DCL-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608449.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.4e-5782.96Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  IVQV RRSCCTA  ASTPP G ++SA+NTT+VLSA+DPPKY RWDEP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

KAG7037785.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-5782.96Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  IVQV RRSCCTA  ASTPP G ++SA+NTT+VLSA+DPPKY RWDEP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

XP_022135338.1 protein DCL, chloroplastic isoform X1 [Momordica charantia]4.3e-6085.93Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPLLRLGLRHRGLC  IVQV RRSCCTATAA TPPDGN+TSA+N T+VLS+SDPPKY RWDEPDYRKWKDQEEE+L+DIEP +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLTS DE+IVV+RLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

XP_022135348.1 uncharacterized protein LOC111007322 isoform X2 [Momordica charantia]3.3e-6085.29Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPLLRLGLRHRGLC  IVQV RRSCCTATAA TPPDGN+TSA+N T+VLS+SDPPKY RWDEPDYRKWKDQEEE+L+DIEP +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIMH
        GERLTS DE+IVV+RLLAHHPHAEDKIGCGLESIM+
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIMH

XP_022940238.1 protein DCL homolog, chloroplastic-like [Cucurbita moschata]1.3e-5682.22Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  IVQV RRSCCTA  ASTPP G ++SA+NTT+VLSA+DPPKY RW+EP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

TrEMBL top hitse value%identityAlignment
A0A6J1C4J4 protein DCL, chloroplastic isoform X12.1e-6085.93Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPLLRLGLRHRGLC  IVQV RRSCCTATAA TPPDGN+TSA+N T+VLS+SDPPKY RWDEPDYRKWKDQEEE+L+DIEP +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLTS DE+IVV+RLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

A0A6J1C4K2 uncharacterized protein LOC111007322 isoform X21.6e-6085.29Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPLLRLGLRHRGLC  IVQV RRSCCTATAA TPPDGN+TSA+N T+VLS+SDPPKY RWDEPDYRKWKDQEEE+L+DIEP +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIMH
        GERLTS DE+IVV+RLLAHHPHAEDKIGCGLESIM+
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIMH

A0A6J1FNQ4 protein DCL homolog, chloroplastic-like6.3e-5782.22Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  IVQV RRSCCTA  ASTPP G ++SA+NTT+VLSA+DPPKY RW+EP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

A0A6J1J443 protein DCL, chloroplastic-like isoform X21.4e-5681.48Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  I+QV RRSCCTA  ASTPP G ++SA+NTT+VLS +DPPKY RWDEP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

A0A6J1J4G4 protein DCL homolog, chloroplastic-like isoform X11.4e-5681.48Show/hide
Query:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD
        ++GHPL+RLGLR+RGLC  I+QV RRSCCTA  ASTPP G ++SA+NTT+VLS +DPPKY RWDEP YRKWK+QEEEILSDI+P +SLTKEILHSNRYVD
Subjt:  VQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVD

Query:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        GERLT EDEKIVVDRLLAHHPHAEDKIGCGLESIM
Subjt:  GERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

SwissProt top hitse value%identityAlignment
Q42463 Protein DCL, chloroplastic5.2e-0839.71Show/hide
Query:  DYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI
        D   W D E++IL D  P V   + ILHS +Y  G+RL+ + ++ ++ RLL +HP  + KIG G++ I
Subjt:  DYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI

Q5D869 DNA-directed RNA polymerase V subunit 14.4e-0735.14Show/hide
Query:  QEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI---MHTSSSCCPC
        +E+E+LSD+EP +   ++I+H + Y DG+ ++ +D+  V++++L  HP  E K+G G++ I    HT  S   C
Subjt:  QEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI---MHTSSSCCPC

Q9C642 Protein DCL homolog, chloroplastic3.0e-0842.86Show/hide
Query:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        D E++IL    P V   + ILHS +Y + +RL+ E E+ +++ LL +HP  E KIGCG++ IM
Subjt:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

Arabidopsis top hitse value%identityAlignment
AT1G45230.1 Protein of unknown function (DUF3223)2.2e-0942.86Show/hide
Query:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        D E++IL    P V   + ILHS +Y + +RL+ E E+ +++ LL +HP  E KIGCG++ IM
Subjt:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

AT1G45230.2 Protein of unknown function (DUF3223)2.2e-0942.86Show/hide
Query:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        D E++IL    P V   + ILHS +Y + +RL+ E E+ +++ LL +HP  E KIGCG++ IM
Subjt:  DQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

AT2G40030.1 nuclear RNA polymerase D1B3.1e-0835.14Show/hide
Query:  QEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI---MHTSSSCCPC
        +E+E+LSD+EP +   ++I+H + Y DG+ ++ +D+  V++++L  HP  E K+G G++ I    HT  S   C
Subjt:  QEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESI---MHTSSSCCPC

AT3G46630.1 Protein of unknown function (DUF3223)5.9e-2354Show/hide
Query:  TPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM
        +P +G+  +A N T+ +  +      R+++PDYRKWK+ E EIL DIEP   L KEILHS+RY+DGERL  EDEKIV+++LL +HP+++DKIGCGL+ IM
Subjt:  TPPDGNVTSADNTTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIM

AT4G34150.1 Calcium-dependent lipid-binding (CaLB domain) family protein1.1e-2642.31Show/hide
Query:  LVGCYNLEDRWPVATEYSCVLLEYGGSTKRTKPCQGGGKHLVFEEKVVFEFTEGVRELKVAVWTSQPPGNDGVIGFLSVQLQQVLSDGYVDSTWTLQRKD
        +VGC  L+D    + +   V+LEYGG + RT+ C  GGK+ VF+EK +F   EG+R+LKVAVW S     D  IG  ++QLQ+VLS  Y D TWTLQ K 
Subjt:  LVGCYNLEDRWPVATEYSCVLLEYGGSTKRTKPCQGGGKHLVFEEKVVFEFTEGVRELKVAVWTSQPPGNDGVIGFLSVQLQQVLSDGYVDSTWTLQRKD

Query:  GRPAGHIRLILQFPSSSSTFQRQNSSYTAPSLNSAAPLTPNTTQPPILSQLLPYDS
        GR AG ++L+L +  +     ++++  +APS    AP  P  + PP  S   PY +
Subjt:  GRPAGHIRLILQFPSSSSTFQRQNSSYTAPSLNSAAPLTPNTTQPPILSQLLPYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGAAGACCAACACAAGCTACCGAAGCGACATCATGGGATTCTGGGGAGGAATGCAAAGGCCAACACAGCTGCTTGAGGTCACAGGCACTTGTTGGATGCTACAA
CTTGGAGGACAGATGGCCGGTTGCGACCGAATACTCCTGCGTCCTTCTTGAATATGGTGGTTCCACAAAGAGGACCAAGCCATGCCAAGGTGGAGGCAAACACCTTGTGT
TCGAGGAGAAAGTTGTTTTCGAATTCACTGAAGGAGTTCGAGAACTGAAAGTTGCAGTCTGGACCAGTCAACCCCCAGGAAACGATGGAGTCATTGGCTTCCTGAGTGTA
CAGCTCCAACAAGTCCTTTCCGATGGTTATGTTGACTCTACCTGGACTTTGCAGAGGAAAGATGGCAGGCCTGCAGGCCATATACGACTCATATTGCAATTTCCCAGTTC
CAGTTCTACATTTCAACGACAAAACTCAAGTTACACTGCGCCATCGTTGAATTCTGCTGCTCCTCTGACACCCAATACAACCCAACCGCCAATATTGTCCCAACTCCTAC
CGTATGATTCCTTTCGCCGACGTCGCCGGCAGTTCCTCGGCCAAATGCTACATATCCTTGCACCACGTATCCTCCCAACTCAGCTCCGCCACCACAACCTCTGTACCCTC
CCACTCAGCCAACGCAACCGAATGCCACCGGACCGGCCGCTCTATGGTCCCAACTCCTACCGTATGGTTCGTCTACGGCGGCAGTTCCTCCACCAGCTGCCCCTGCCTAC
AATGTTCCATATCCTTCAGACCCATACCCTAAGACTCAACCAACGAGCAGCTACTGTGCAAGCACCAGCCCGCCTGCAGGATACCCTCGGAATCCTCATCCTCATCCTCA
TCACCGTCAGTACCCAACTTATCCTCCGACCCCACCAGGCTTACGTTTGTGGGATTCACTCCGATGAGAACCAATTCCTGAAGCTTCCTCAAGAGCTTGCACCGTTCTGC
TATCGCTACAATGCCCAGGTTGGTACACTCGGGGGTCTTAACGAGGGCAGAACAATTATCAAGAACTGCGTTCATTCCTTTGGCTCCAAATGTGCAAGATCCGCACGAGA
GTTTTTTCAAACCCTTGCAGTTCTTGGCAAAAGCAGCCATGCCGACATCGGTCAATTCACGGCACGCACGAAGTTTGAGACGAGTCAAATTACGGCACCGAAGGGAAATG
AGAATAAGCGCGTCGTCTCCGATGCTCGCAGATCTGCGGTCACACTTCAAAGCAAGTTTGGTGACGGCATCGAATCGAGTAAAGAGGGAAGGTATCATGGAGGAAAGATC
TGCTTCTGCTTTAAGGGAGAGACGGTGACGGCTTTGTCCCTCAACTTTGAGCCAGCGCCGGCATACGAGAGAGCAGCCCTTCCGGTCGACGGAGCTAAGGGATTGGAAAA
TGCAAGCCAAGCACTCATCTGGAAGATCGGAAATGAAGTCGGAGGCTCCATGAACGATTTCCTGGAAGTCATCGGTTTCGTCCAGGGGCATCCTCTCCTTCGGTTAGGAC
TCAGGCACCGCGGGCTATGTATTCGGATCGTACAGGTGCCTCGTCGGTCTTGTTGCACTGCGACGGCGGCGTCTACTCCACCAGACGGCAACGTAACATCTGCTGACAAT
ACCACCGCAGTCTTGAGTGCCAGTGACCCACCCAAGTACCCAAGGTGGGATGAGCCTGATTATCGAAAGTGGAAGGACCAGGAAGAGGAAATTCTCAGCGACATCGAGCC
TACCGTATCCCTCACAAAAGAGATCCTCCACTCCAATAGGTATGTGGATGGGGAGCGATTGACATCTGAGGACGAGAAAATTGTGGTTGACAGGCTTCTTGCTCATCATC
CACATGCTGAAGATAAAATTGGATGTGGGCTCGAATCCATTATGCACACCTCCTCAAGTTGCTGCCCGTGTGGTTCAAGTAATGGCCTCCAATTGACACCCAACTCCTTC
CCAATAAGGGGCACATTTGAAGTGTGCTTGAAGTCAGCAAGCAAGACCATATTAAGAGCAGCTCATACCATTCTAACACTGAAGCTTCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGAAGACCAACACAAGCTACCGAAGCGACATCATGGGATTCTGGGGAGGAATGCAAAGGCCAACACAGCTGCTTGAGGTCACAGGCACTTGTTGGATGCTACAA
CTTGGAGGACAGATGGCCGGTTGCGACCGAATACTCCTGCGTCCTTCTTGAATATGGTGGTTCCACAAAGAGGACCAAGCCATGCCAAGGTGGAGGCAAACACCTTGTGT
TCGAGGAGAAAGTTGTTTTCGAATTCACTGAAGGAGTTCGAGAACTGAAAGTTGCAGTCTGGACCAGTCAACCCCCAGGAAACGATGGAGTCATTGGCTTCCTGAGTGTA
CAGCTCCAACAAGTCCTTTCCGATGGTTATGTTGACTCTACCTGGACTTTGCAGAGGAAAGATGGCAGGCCTGCAGGCCATATACGACTCATATTGCAATTTCCCAGTTC
CAGTTCTACATTTCAACGACAAAACTCAAGTTACACTGCGCCATCGTTGAATTCTGCTGCTCCTCTGACACCCAATACAACCCAACCGCCAATATTGTCCCAACTCCTAC
CGTATGATTCCTTTCGCCGACGTCGCCGGCAGTTCCTCGGCCAAATGCTACATATCCTTGCACCACGTATCCTCCCAACTCAGCTCCGCCACCACAACCTCTGTACCCTC
CCACTCAGCCAACGCAACCGAATGCCACCGGACCGGCCGCTCTATGGTCCCAACTCCTACCGTATGGTTCGTCTACGGCGGCAGTTCCTCCACCAGCTGCCCCTGCCTAC
AATGTTCCATATCCTTCAGACCCATACCCTAAGACTCAACCAACGAGCAGCTACTGTGCAAGCACCAGCCCGCCTGCAGGATACCCTCGGAATCCTCATCCTCATCCTCA
TCACCGTCAGTACCCAACTTATCCTCCGACCCCACCAGGCTTACGTTTGTGGGATTCACTCCGATGAGAACCAATTCCTGAAGCTTCCTCAAGAGCTTGCACCGTTCTGC
TATCGCTACAATGCCCAGGTTGGTACACTCGGGGGTCTTAACGAGGGCAGAACAATTATCAAGAACTGCGTTCATTCCTTTGGCTCCAAATGTGCAAGATCCGCACGAGA
GTTTTTTCAAACCCTTGCAGTTCTTGGCAAAAGCAGCCATGCCGACATCGGTCAATTCACGGCACGCACGAAGTTTGAGACGAGTCAAATTACGGCACCGAAGGGAAATG
AGAATAAGCGCGTCGTCTCCGATGCTCGCAGATCTGCGGTCACACTTCAAAGCAAGTTTGGTGACGGCATCGAATCGAGTAAAGAGGGAAGGTATCATGGAGGAAAGATC
TGCTTCTGCTTTAAGGGAGAGACGGTGACGGCTTTGTCCCTCAACTTTGAGCCAGCGCCGGCATACGAGAGAGCAGCCCTTCCGGTCGACGGAGCTAAGGGATTGGAAAA
TGCAAGCCAAGCACTCATCTGGAAGATCGGAAATGAAGTCGGAGGCTCCATGAACGATTTCCTGGAAGTCATCGGTTTCGTCCAGGGGCATCCTCTCCTTCGGTTAGGAC
TCAGGCACCGCGGGCTATGTATTCGGATCGTACAGGTGCCTCGTCGGTCTTGTTGCACTGCGACGGCGGCGTCTACTCCACCAGACGGCAACGTAACATCTGCTGACAAT
ACCACCGCAGTCTTGAGTGCCAGTGACCCACCCAAGTACCCAAGGTGGGATGAGCCTGATTATCGAAAGTGGAAGGACCAGGAAGAGGAAATTCTCAGCGACATCGAGCC
TACCGTATCCCTCACAAAAGAGATCCTCCACTCCAATAGGTATGTGGATGGGGAGCGATTGACATCTGAGGACGAGAAAATTGTGGTTGACAGGCTTCTTGCTCATCATC
CACATGCTGAAGATAAAATTGGATGTGGGCTCGAATCCATTATGCACACCTCCTCAAGTTGCTGCCCGTGTGGTTCAAGTAATGGCCTCCAATTGACACCCAACTCCTTC
CCAATAAGGGGCACATTTGAAGTGTGCTTGAAGTCAGCAAGCAAGACCATATTAAGAGCAGCTCATACCATTCTAACACTGAAGCTTCTGTGA
Protein sequenceShow/hide protein sequence
MDRRPTQATEATSWDSGEECKGQHSCLRSQALVGCYNLEDRWPVATEYSCVLLEYGGSTKRTKPCQGGGKHLVFEEKVVFEFTEGVRELKVAVWTSQPPGNDGVIGFLSV
QLQQVLSDGYVDSTWTLQRKDGRPAGHIRLILQFPSSSSTFQRQNSSYTAPSLNSAAPLTPNTTQPPILSQLLPYDSFRRRRRQFLGQMLHILAPRILPTQLRHHNLCTL
PLSQRNRMPPDRPLYGPNSYRMVRLRRQFLHQLPLPTMFHILQTHTLRLNQRAATVQAPARLQDTLGILILILITVSTQLILRPHQAYVCGIHSDENQFLKLPQELAPFC
YRYNAQVGTLGGLNEGRTIIKNCVHSFGSKCARSAREFFQTLAVLGKSSHADIGQFTARTKFETSQITAPKGNENKRVVSDARRSAVTLQSKFGDGIESSKEGRYHGGKI
CFCFKGETVTALSLNFEPAPAYERAALPVDGAKGLENASQALIWKIGNEVGGSMNDFLEVIGFVQGHPLLRLGLRHRGLCIRIVQVPRRSCCTATAASTPPDGNVTSADN
TTAVLSASDPPKYPRWDEPDYRKWKDQEEEILSDIEPTVSLTKEILHSNRYVDGERLTSEDEKIVVDRLLAHHPHAEDKIGCGLESIMHTSSSCCPCGSSNGLQLTPNSF
PIRGTFEVCLKSASKTILRAAHTILTLKLL