; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G019090 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G019090
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
Genome locationCma_Chr04:10073878..10074744
RNA-Seq ExpressionCmaCh04G019090
SyntenyCmaCh04G019090
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.5e-9669.57Show/hide
Query:  SIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFG
        SIVQLLASEKLNGDNY+ WKSNLNTILV+DDLRFVLT EC   P+ NANRTVR+AYDRW+K NDKARVYILAS++DVLAKKHD + TAK IM+SL+ MFG
Subjt:  SIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFG

Query:  QSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTG
        Q S+SLRHEAIK+IY  RMKEG+ VREHVLDMM+HFN+ E N   IDE +QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L ++KG+  
Subjt:  QSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTG

Query:  EANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
        EANVA++K K +RGSSSKNK  PS ++   MKK GKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAE
Subjt:  EANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8865.82Show/hide
Query:  IVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQ
        IVQLLASEKLN DNY TWKSNLNTILV+DDLRFVLT EC   P+SNANRT R+AYDRWIK N+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKGMFGQ
Subjt:  IVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQ

Query:  SSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGE
          +SLRH+AIKYIY  RMKEG+ +REHVL MM+HFN+ E N   IDE +QVSF++ SL KSF  F+TNV +NKIE+NLT LLNELQ +Q+L M K +  E
Subjt:  SSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGE

Query:  ANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
        AN+A +K K  RGSSSK+K  PS     + K   KGK K P   K K +   KGKC+HC ENGH   NCPKYL +
Subjt:  ANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

TYK01653.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-8966.19Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        NSIVQLLASEKLN +NYA WKSNLNTILV+DDLRFVLT EC   P+SNANR  + AYDRWIK N K  VYILAS+SDVLAKKH+ + T KEI++SLKGMF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SLRHEAIKYIY  RMKEG+ VREHVLDMM+HFN+ E N  VIDE  QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQK-ADKGKCFHCNENGHWKRNCPKYLAE
         EANVA +K K  RGSSS++K  PS   S  MKK G GK    T +++KV+K  +KGKC+HC ENGHW +NCPKYLA+
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQK-ADKGKCFHCNENGHWKRNCPKYLAE

TYK11933.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-9066.06Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        +SIVQLLASEKLNGDNYA WKSNLNTILV++DLRFVLT EC P P+SNANRT R+AYDRWIK N+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKGMF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SL+HEAIKYIY  RMKEG+ VREHVL+MM+HFN+ E N   IDE +QVSF++ SL KSF  F+TN  ++KIE+NLT LLNELQ++Q+L M KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
         EANVA +K K  RG SSK+K  PS     + KK GKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

TYK31700.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-9066.06Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        +SIVQLLASEKLNGDNYA WKSNLNTILV+DDLRFVLT E    P+SNANRT R AYDRW+K N+KARVYILA ++DVLAKKH+ + TAKEIM++LK MF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SLRHEAIKYIY  RMKEG+ VREHVLDMM+ FN+ E N+  IDE +QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L M KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
         EANVAI+K K  RGSSSK K+ P   K  + K   KGK K P   K K + ++KGKC+H  +NGHW RNCPKYLAE
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein9.9e-8965.82Show/hide
Query:  IVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQ
        IVQLLASEKLN DNY TWKSNLNTILV+DDLRFVLT EC   P+SNANRT R+AYDRWIK N+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKGMFGQ
Subjt:  IVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQ

Query:  SSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGE
          +SLRH+AIKYIY  RMKEG+ +REHVL MM+HFN+ E N   IDE +QVSF++ SL KSF  F+TNV +NKIE+NLT LLNELQ +Q+L M K +  E
Subjt:  SSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGE

Query:  ANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
        AN+A +K K  RGSSSK+K  PS     + K   KGK K P   K K +   KGKC+HC ENGH   NCPKYL +
Subjt:  ANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

A0A5D3BPM3 Gag/pol protein1.2e-8966.19Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        NSIVQLLASEKLN +NYA WKSNLNTILV+DDLRFVLT EC   P+SNANR  + AYDRWIK N K  VYILAS+SDVLAKKH+ + T KEI++SLKGMF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SLRHEAIKYIY  RMKEG+ VREHVLDMM+HFN+ E N  VIDE  QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQK-ADKGKCFHCNENGHWKRNCPKYLAE
         EANVA +K K  RGSSS++K  PS   S  MKK G GK    T +++KV+K  +KGKC+HC ENGHW +NCPKYLA+
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQK-ADKGKCFHCNENGHWKRNCPKYLAE

A0A5D3CJ27 Gag/pol protein8.1e-9166.06Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        +SIVQLLASEKLNGDNYA WKSNLNTILV++DLRFVLT EC P P+SNANRT R+AYDRWIK N+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKGMF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SL+HEAIKYIY  RMKEG+ VREHVL+MM+HFN+ E N   IDE +QVSF++ SL KSF  F+TN  ++KIE+NLT LLNELQ++Q+L M KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
         EANVA +K K  RG SSK+K  PS     + KK GKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

A0A5D3E7W3 Gag/pol protein4.0e-9066.06Show/hide
Query:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF
        +SIVQLLASEKLNGDNYA WKSNLNTILV+DDLRFVLT E    P+SNANRT R AYDRW+K N+KARVYILA ++DVLAKKH+ + TAKEIM++LK MF
Subjt:  NSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMF

Query:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT
        GQ  +SLRHEAIKYIY  RMKEG+ VREHVLDMM+ FN+ E N+  IDE +QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L M KG+ 
Subjt:  GQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQT

Query:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
         EANVAI+K K  RGSSSK K+ P   K  + K   KGK K P   K K + ++KGKC+H  +NGHW RNCPKYLAE
Subjt:  GEANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

E2GK51 Gag/pol protein (Fragment)1.7e-9669.57Show/hide
Query:  SIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFG
        SIVQLLASEKLNGDNY+ WKSNLNTILV+DDLRFVLT EC   P+ NANRTVR+AYDRW+K NDKARVYILAS++DVLAKKHD + TAK IM+SL+ MFG
Subjt:  SIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFG

Query:  QSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTG
        Q S+SLRHEAIK+IY  RMKEG+ VREHVLDMM+HFN+ E N   IDE +QVSF++ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L ++KG+  
Subjt:  QSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTG

Query:  EANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE
        EANVA++K K +RGSSSKNK  PS ++   MKK GKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAE
Subjt:  EANVAISK-KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-0622.14Show/hide
Query:  KLNGDN-YATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQSSFSLRH
        K NGDN ++TW+  +  +L+   L  VL  +     +  A        + W   +++A   I   +SD +        TA+ I   L+ ++   + + + 
Subjt:  KLNGDN-YATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQSSFSLRH

Query:  EAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGEANVAISK
           K +Y   M EG+    H+           +  V I+E+ +   L+ SL  S+ +  T ++  K    L  + + L   + +       G+A +   +
Subjt:  EAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKGQTGEANVAISK

Query:  KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCP
              SS N  R            GK KN+          K+    C++CN+ GH+KR+CP
Subjt:  KLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAATAGAATTTTCTTTTATCACAGCTGGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAATTACGCAACTTGGAAATCAAAC
CTAAACACAATACTGGTAATTGATGATTTAAGGTTTGTTTTAACTGGGGAATGTTCTCCAAACCCCAGCTCAAATGCAAATCGAACAGTTCGGGATGCGTATGAC
AGATGGATAAAGGAAAATGACAAAGCTCGAGTGTACATTCTAGCTAGCATATCTGATGTTTTAGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATG
GAATCTCTAAAAGGGATGTTTGGACAATCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGAGCTTAGTTAGAGAA
CATGTCCTGGACATGATGGTCCATTTCAATGTGACAGAAGATAATGAAGTGGTCATTGATGAGAAGAGTCAAGTCAGTTTTCTTATGATGTCTCTTTCGAAGAGC
TTCTTCCATTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAATCCCTCTTAATGAACAAGGGA
CAAACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTCTAGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAG
AACGGTAAAGGGAAAAATAAGATTCCTACTAACCGCAAACACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGA
AATTGCCCGAAATACCTTGCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAATAGAATTTTCTTTTATCACAGCTGGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAATTACGCAACTTGGAAATCAAAC
CTAAACACAATACTGGTAATTGATGATTTAAGGTTTGTTTTAACTGGGGAATGTTCTCCAAACCCCAGCTCAAATGCAAATCGAACAGTTCGGGATGCGTATGAC
AGATGGATAAAGGAAAATGACAAAGCTCGAGTGTACATTCTAGCTAGCATATCTGATGTTTTAGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATG
GAATCTCTAAAAGGGATGTTTGGACAATCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGAGCTTAGTTAGAGAA
CATGTCCTGGACATGATGGTCCATTTCAATGTGACAGAAGATAATGAAGTGGTCATTGATGAGAAGAGTCAAGTCAGTTTTCTTATGATGTCTCTTTCGAAGAGC
TTCTTCCATTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAATCCCTCTTAATGAACAAGGGA
CAAACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTCTAGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAG
AACGGTAAAGGGAAAAATAAGATTCCTACTAACCGCAAACACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGA
AATTGCCCGAAATACCTTGCAGAATAG
Protein sequenceShow/hide protein sequence
MTNRIFFYHSWTNSIVQLLASEKLNGDNYATWKSNLNTILVIDDLRFVLTGECSPNPSSNANRTVRDAYDRWIKENDKARVYILASISDVLAKKHDVMGTAKEIM
ESLKGMFGQSSFSLRHEAIKYIYNCRMKEGSLVREHVLDMMVHFNVTEDNEVVIDEKSQVSFLMMSLSKSFFHFRTNVVMNKIEYNLTALLNELQTYQSLLMNKG
QTGEANVAISKKLLRGSSSKNKSRPSTSKSVLMKKNGKGKNKIPTNRKHKVQKADKGKCFHCNENGHWKRNCPKYLAE