; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013162 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013162
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiongag_pre-integrs domain-containing protein
Genome locationtig00153705:100557..103372
RNA-Seq ExpressionSgr013162
SyntenySgr013162
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8532467.1 hypothetical protein F0562_032500 [Nyssa sinensis]5.7e-6046.71Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++ +M +  E+Q+ LLLSSL NSWETLVVT+SNSA DGKL+M  V  +LFN+  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKWLEERGQSNSQLK-NKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GHMKKNCY W  E+ + N+ LK NK + T+ T   D VA  S   + CL+V  E +EWVVDT ASY+     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKWLEERGQSNSQLK-NKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQ
        KM N+S S IVGIGDV I+TS G  + L+D+RH+PDL+LNL+S I+LDR GY N+F  G WKL+KG L + +G  C  LYKT++K+C D LN + D    
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQ

Query:  NLWH
        NLWH
Subjt:  NLWH

KAA8532543.1 hypothetical protein F0562_032641 [Nyssa sinensis]3.0e-6147.3Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL++  V  +LFN+  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GH+K+NCY W  E++ + N+Q KNK + T  T  GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE
        KM N+S S IVGIGDV I+TS G T+ L+DVRH+PDL+LNL+S I+LDR GY N+F  GTWKL+KG L + +G  C TLYKT++K+C D LN  ++
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE

KAA8532769.1 hypothetical protein F0562_032802 [Nyssa sinensis]8.8e-6146.96Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL+M  V  +LFNE  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GH+K+NCY W  E++ ++N+Q KN+ + T  T  GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE
        KM N+S S IVGIGDV I+TS G  + L+DVRH+PDL+LNL+S I+LDR GY N+F  G WKL+KG L + +G  C TLYKT++K+C D LN  ++
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE

KAA8550233.1 hypothetical protein F0562_001917 [Nyssa sinensis]1.5e-6856.25Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--DGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVL
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL+M  V  +LFNE  RRK  +GH+K+NCY W  E++ ++N+Q KN+ + T  T  
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--DGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVL

Query:  GD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLD
        GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTVKM N+S S IVGIGDV I+TS G  + L+DVRH+PDL+LNL+S I+LD
Subjt:  GD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLD

Query:  RVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQNLWH
        R GY N+F  G WKL+KG L + +G  C TLYKT++K+C D LN + D    NLWH
Subjt:  RVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQNLWH

KAF8387595.1 hypothetical protein HHK36_026248 [Tetracentron sinense]8.0e-6250.94Show/hide
Query:  GPVDE-----ASLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRKDGHMKKNCYKWLEERGQSNS-QLKNK
        GP+ E       +VNQL+++++ L  E+Q+LLLLSSLP+SWETLVVT+SNSA +G+L+MS+V ++LFNE  RRK           +E RG+  S Q K+ 
Subjt:  GPVDE-----ASLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRKDGHMKKNCYKWLEERGQSNS-QLKNK

Query:  GRETLITVLG-DVAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQL
         + T  T+   DV   S  ++ CL+V+ +D+EWVVDT ASYH     ++FT+YKAGDFGTVKM N+S S I+GIGDV I+T+ G T+ +KDVRH+PDL+L
Subjt:  GRETLITVLG-DVAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQL

Query:  NLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVT-DEAFQNLWH
        NL+S I+LDR GY+N+F  G WKL+ G L + +G  C TLYKT  K+C + LN   D +  NLWH
Subjt:  NLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVT-DEAFQNLWH

TrEMBL top hitse value%identityAlignment
A0A5J5ANG2 CCHC-type domain-containing protein1.5e-6147.3Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL++  V  +LFN+  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GH+K+NCY W  E++ + N+Q KNK + T  T  GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE
        KM N+S S IVGIGDV I+TS G T+ L+DVRH+PDL+LNL+S I+LDR GY N+F  GTWKL+KG L + +G  C TLYKT++K+C D LN  ++
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE

A0A5J5AR66 CCHC-type domain-containing protein4.3e-6146.96Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL+M  V  +LFNE  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GH+K+NCY W  E++ ++N+Q KN+ + T  T  GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE
        KM N+S S IVGIGDV I+TS G  + L+DVRH+PDL+LNL+S I+LDR GY N+F  G WKL+KG L + +G  C TLYKT++K+C D LN  ++
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE

A0A5J5AST7 Uncharacterized protein2.8e-6046.71Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++ +M +  E+Q+ LLLSSL NSWETLVVT+SNSA DGKL+M  V  +LFN+  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKWLEERGQSNSQLK-NKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GHMKKNCY W  E+ + N+ LK NK + T+ T   D VA  S   + CL+V  E +EWVVDT ASY+     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKWLEERGQSNSQLK-NKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQ
        KM N+S S IVGIGDV I+TS G  + L+D+RH+PDL+LNL+S I+LDR GY N+F  G WKL+KG L + +G  C  LYKT++K+C D LN + D    
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQ

Query:  NLWH
        NLWH
Subjt:  NLWH

A0A5J5BW15 CCHC-type domain-containing protein3.6e-6046.28Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL+M  V  +LFNE  RR                                      
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARR--------------------------------------

Query:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV
                    K+GH+K+NCY W  E++ ++N+Q KN+ + T  T  GD V   S  ++ CL+V  E ++WVVDT ASYH     ++FT+YKAGDFGTV
Subjt:  ------------KDGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVLGD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTV

Query:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE
        KM N+S S IVGIGDV I+TS G  + L+DVRH+P+L+LNL+S I+LDR GY N+F  G WKL+KG L + +G  C TLYKT++K+C D LN  ++
Subjt:  KMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDE

A0A5J5C4A8 gag_pre-integrs domain-containing protein7.3e-6956.25Show/hide
Query:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--DGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVL
        S+VNQL++++M +  E+Q+LLLLSSLP+SWETLVVT+SNSA DGKL+M  V  +LFNE  RRK  +GH+K+NCY W  E++ ++N+Q KN+ + T  T  
Subjt:  SLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--DGHMKKNCYKW-LEERGQSNSQLKNKGRETLITVL

Query:  GD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLD
        GD V   S  ++ CL+V  E +EWVVDT ASYH     ++FT+YKAGDFGTVKM N+S S IVGIGDV I+TS G  + L+DVRH+PDL+LNL+S I+LD
Subjt:  GD-VAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMENSSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLD

Query:  RVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQNLWH
        R GY N+F  G WKL+KG L + +G  C TLYKT++K+C D LN + D    NLWH
Subjt:  RVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLN-VTDEAFQNLWH

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-4134.33Show/hide
Query:  LVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--------------------------------------
        L+ QL+++ + +  E +++LLL+SLP+S++ L  T+ +  +  +L   V    L NE  R+K                                      
Subjt:  LVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSASDGKLTMSVVKDALFNEGARRK--------------------------------------

Query:  -----------DGHMKKNCYKWLEERGQSNSQLKNKGRETLITVLGDVAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMEN
                    GH K++C    + +G+++ Q  +     ++    +V      ++ C+++S  + EWVVDT AS+H     D F  Y AGDFGTVKM N
Subjt:  -----------DGHMKKNCYKWLEERGQSNSQLKNKGRETLITVLGDVAYCSTHDKTCLYVSREDMEWVVDTTASYH-----DYFTTYKAGDFGTVKMEN

Query:  SSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVT-DEAFQNLWH
        +S S I GIGD+ IKT+ G T++LKDVRH+PDL++NL+S I+LDR GY+++F+   W+L+KG L I +G   GTLY+T+ +IC   LN   DE   +LWH
Subjt:  SSSSGIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVT-DEAFQNLWH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACAGAAGAGTCAATATCATCTTCAGGAGCTATGATTATGCTCACAGCTACTAACTACATGCTGTGGAAACCTCGGATGGAAGATTTCCTCACTTGTCTTCCACC
ATGTCGCACAGGAGACAAACACATATGCCCTCTGGAAGAAGTTGGAGGCCATGTACCAGGCCAAGACCGCTCGAAACAAGGCCCTGTTGATGAGGCGAGTCTGGTGAACC
AATTATCGTCTATTGAGATGCCGCTTGGACATGAAATGCAGTCTCTACTACTTCTAAGTTCACTTCCTAATAGCTGGGAAACACTTGTTGTTACTCTCAGCAACTCAGCC
TCTGATGGCAAACTTACCATGTCTGTGGTTAAGGATGCTCTGTTCAATGAAGGGGCCAGAAGGAAAGACGGTCATATGAAGAAGAACTGCTACAAATGGCTAGAGGAGCG
AGGTCAGAGCAATTCTCAATTGAAGAACAAGGGTAGAGAAACACTAATCACTGTTTTAGGAGATGTAGCGTATTGTTCAACCCATGATAAGACATGCCTTTATGTCTCAA
GAGAAGACATGGAATGGGTGGTAGATACTACAGCATCCTACCACGACTACTTCACAACATACAAAGCAGGAGACTTTGGAACAGTGAAGATGGAAAATTCCAGTTCCTCT
GGAATAGTAGGAATTGGTGATGTCCAGATAAAGACAAGTGCGGGGAGCACAATTATTCTGAAGGATGTCAGACATATGCCAGATCTTCAGCTCAATCTGTTGTCAATGAT
ATCCCTTGACAGAGTAGGGTATGATAATCATTTCAGCACAGGCACATGGAAGCTGTCAAAGGGCATTTTGACAATCAATCAAGGACACATTTGTGGGACGTTGTACAAAA
CTCATGTGAAAATTTGTACAGACAGCCTCAATGTTACAGATGAGGCTTTTCAAAATTTATGGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCACAGAAGAGTCAATATCATCTTCAGGAGCTATGATTATGCTCACAGCTACTAACTACATGCTGTGGAAACCTCGGATGGAAGATTTCCTCACTTGTCTTCCACC
ATGTCGCACAGGAGACAAACACATATGCCCTCTGGAAGAAGTTGGAGGCCATGTACCAGGCCAAGACCGCTCGAAACAAGGCCCTGTTGATGAGGCGAGTCTGGTGAACC
AATTATCGTCTATTGAGATGCCGCTTGGACATGAAATGCAGTCTCTACTACTTCTAAGTTCACTTCCTAATAGCTGGGAAACACTTGTTGTTACTCTCAGCAACTCAGCC
TCTGATGGCAAACTTACCATGTCTGTGGTTAAGGATGCTCTGTTCAATGAAGGGGCCAGAAGGAAAGACGGTCATATGAAGAAGAACTGCTACAAATGGCTAGAGGAGCG
AGGTCAGAGCAATTCTCAATTGAAGAACAAGGGTAGAGAAACACTAATCACTGTTTTAGGAGATGTAGCGTATTGTTCAACCCATGATAAGACATGCCTTTATGTCTCAA
GAGAAGACATGGAATGGGTGGTAGATACTACAGCATCCTACCACGACTACTTCACAACATACAAAGCAGGAGACTTTGGAACAGTGAAGATGGAAAATTCCAGTTCCTCT
GGAATAGTAGGAATTGGTGATGTCCAGATAAAGACAAGTGCGGGGAGCACAATTATTCTGAAGGATGTCAGACATATGCCAGATCTTCAGCTCAATCTGTTGTCAATGAT
ATCCCTTGACAGAGTAGGGTATGATAATCATTTCAGCACAGGCACATGGAAGCTGTCAAAGGGCATTTTGACAATCAATCAAGGACACATTTGTGGGACGTTGTACAAAA
CTCATGTGAAAATTTGTACAGACAGCCTCAATGTTACAGATGAGGCTTTTCAAAATTTATGGCATTAG
Protein sequenceShow/hide protein sequence
MTTEESISSSGAMIMLTATNYMLWKPRMEDFLTCLPPCRTGDKHICPLEEVGGHVPGQDRSKQGPVDEASLVNQLSSIEMPLGHEMQSLLLLSSLPNSWETLVVTLSNSA
SDGKLTMSVVKDALFNEGARRKDGHMKKNCYKWLEERGQSNSQLKNKGRETLITVLGDVAYCSTHDKTCLYVSREDMEWVVDTTASYHDYFTTYKAGDFGTVKMENSSSS
GIVGIGDVQIKTSAGSTIILKDVRHMPDLQLNLLSMISLDRVGYDNHFSTGTWKLSKGILTINQGHICGTLYKTHVKICTDSLNVTDEAFQNLWH