; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:8755964..8758414
RNA-Seq ExpressionMoc01g14080
SyntenyMoc01g14080
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW27595.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-5445.17Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR +IK  +    P++V++QETK    D  ++ S+W+     W  L A G+SGGILF+W+    +  E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI  + L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F     + + R TSDH+PI LD     WGPTPFRFENMWL     HPS K+
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

RVW39500.1 hypothetical protein CK203_085975 [Vitis vinifera]1.6e-5444.4Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR ++K  +   NP++V++QETK    D   + S+W++    W AL A+G+SGGIL +W+  +    E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI    L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F   L + + R TSDH+PI++D     WGPTPFRFENMWL H+    + +D
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

RVW94236.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.1e-5444.4Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR ++K  +   NP++V++QETK    D   + S+W+     W AL A+G+SGGIL +W+  + +  E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI    L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F   L + + R TSDH+PI++D     WGPTPFRFENMWL H+    + +D
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.3e-5947.58Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        M +L+WNVRGLGS +KRA IK +I+   P++VIL ETK + ++   IKSLWSS  I+W++L+A+G+SGGI+ LW++   +  E+I G FS+S++F LAD 
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRS---FSLIDRFL
        F++W+TG+Y P     R LFW+EL DL+ LC   W+   DFN+ RWS E S   P       FN FI+  GL D  ++N +YTWSN R     S I+RFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRS---FSLIDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWL
         S G  ++F     KR+ R  SDH+PILL+     WG  PFR EN+WL
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWL

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]5.8e-10570.59Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK L+WNVRGL SW K ALIKQ ISR NPN+VILQETKL+Y+D LI+KSLWS+HGI+WSAL+A+G + GIL LWN+ D   AE+IEG FSL+INFCL+DG
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSLIDRFLLSN
        F FW++GIYGPS++   +LFW+EL DLS LCE  WI AGDFNV+RWSWEKS+GRPLTKS  LFNSFIE + L D+PL+N ++TWS N SFSLID FLL+N
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSLIDRFLLSN

Query:  GCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLK
        GCI++ G+P+AKRM RTTSDHFPILLDFGQNNWG TPFRFENMWL+H    P L+
Subjt:  GCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLK

TrEMBL top hitse value%identityAlignment
A0A438CWL6 Transposon TX1 uncharacterized 149 kDa protein7.8e-5545.17Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR +IK  +    P++V++QETK    D  ++ S+W+     W  L A G+SGGILF+W+    +  E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI  + L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F     + + R TSDH+PI LD     WGPTPFRFENMWL     HPS K+
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

A0A438DVR3 Endo/exonuclease/phosphatase domain-containing protein7.8e-5544.4Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR ++K  +   NP++V++QETK    D   + S+W++    W AL A+G+SGGIL +W+  +    E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI    L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F   L + + R TSDH+PI++D     WGPTPFRFENMWL H+    + +D
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

A0A438IBZ1 LINE-1 retrotransposable element ORF2 protein1.0e-5444.4Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK++SWNVRGLGS NKR ++K  +   NP++V++QETK    D   + S+W+     W AL A+G+SGGIL +W+  + +  E++ G FS+S+ F L   
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL
           WI+ +YGP+S   R  FW EL D+  L    W   GDFNV R S EK  G  LT S R F+SFI    L D PL N  +TWSN +   +   +DRFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSL---IDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD
         SN     F   L + + R TSDH+PI++D     WGPTPFRFENMWL H+    + +D
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKD

A0A6J1CVN2 uncharacterized protein LOC1110146576.2e-6047.58Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        M +L+WNVRGLGS +KRA IK +I+   P++VIL ETK + ++   IKSLWSS  I+W++L+A+G+SGGI+ LW++   +  E+I G FS+S++F LAD 
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRS---FSLIDRFL
        F++W+TG+Y P     R LFW+EL DL+ LC   W+   DFN+ RWS E S   P       FN FI+  GL D  ++N +YTWSN R     S I+RFL
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRS---FSLIDRFL

Query:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWL
         S G  ++F     KR+ R  SDH+PILL+     WG  PFR EN+WL
Subjt:  LSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWL

A0A6J1E2G6 uncharacterized protein LOC1110254052.8e-10570.59Show/hide
Query:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG
        MK L+WNVRGL SW K ALIKQ ISR NPN+VILQETKL+Y+D LI+KSLWS+HGI+WSAL+A+G + GIL LWN+ D   AE+IEG FSL+INFCL+DG
Subjt:  MKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALNAAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADG

Query:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSLIDRFLLSN
        F FW++GIYGPS++   +LFW+EL DLS LCE  WI AGDFNV+RWSWEKS+GRPLTKS  LFNSFIE + L D+PL+N ++TWS N SFSLID FLL+N
Subjt:  FSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGLSDIPLSNEKYTWSNNRSFSLIDRFLLSN

Query:  GCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLK
        GCI++ G+P+AKRM RTTSDHFPILLDFGQNNWG TPFRFENMWL+H    P L+
Subjt:  GCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAAAGCGAGTGGTGAGGATGGTTTGGACATGCGGCCATGGGGAACACACCCTGGACAAGCATGCCCACAACACTGATCATCCCCCCGCCACTGAAACTAGGGA
ACAAGCCACCCTCCCTCTTCCCCCGACACCCATGTTACAAAGACCACACACTTTGACTTTGGACCAACCCACCAAGTCTTCCCTTTTTCCCGAATTATTTGCCTGGAGTC
GATGGGTTATTGTTCAGCGCCAATCGTTTCAGGATGATTGGCACCCAATCCTCATGGCCTTACAAGAGTCCATCAGCGACCACCGCTCCCTGAGTCCTATTCATGCCGAC
AAATCCCTCCTCAGATGTGGCGGCTTCTTGGCAGTCTCTGGTACCTCATCCAGCCTCGAGCCTCCATTCATGGAACTCAAGATTAAGGTCAAGAGCAACAACACGAGCTT
TGTCCCGGCAACCATCGAGCTCCCTCCATCCCTTACTCATGAAGACATTATCACCGTCCATATTGACCCATTTTTCATTGTCGAAAACCTGGTGGGTCGCCGACACTATG
CTCGTGGTGGACAAAGAAATCCCCCGAAATCAACAGCCGGAAAAGCACCCACCGGAAAACCTCCCTCTCGGAGCCAGACCTTCGCCGTCGCGGTTCCCAACCTTGCCGCT
ACAGCTGAGATTGACACATGGGCCCACAAATCTAGAGTCCCGACACTGTCCTCTACGTCTCGACAGACATCCCTTACAAAGTCCAAAGGGAAGGAAAAAGTCATGGACTT
TTCTGCCCCTCTTCCCTCGAGCTCCCCACCTCGGATGAAGCTTCTATCGTGGAATGTTAGAGGGTTGGGCTCATGGAATAAACGAGCCCTTATAAAACAGTCAATCTCCC
GCCATAATCCAAATCTTGTTATTCTACAAGAAACTAAGCTCGCATATGTTGACCCCCTCATCATCAAGTCCCTTTGGAGCTCACATGGGATCAGTTGGTCCGCCCTCAAT
GCTGCGGGTTCTAGCGGAGGGATCCTTTTTCTTTGGAACGAATCTGACTTCGCTGTGGCTGAGATCATTGAAGGTGATTTCTCCCTCTCCATTAATTTTTGTCTCGCTGA
TGGCTTCTCTTTCTGGATCACAGGCATTTATGGTCCCTCTTCCTCTGGTACCCGCCACCTTTTCTGGAAAGAACTGCAAGATCTCTCCTATCTTTGTGAGGCACAGTGGA
TTTTTGCAGGGGACTTCAATGTTTCCAGATGGTCCTGGGAAAAATCTCACGGTAGACCCCTCACAAAAAGCACGAGGCTTTTTAATTCCTTCATTGAGGCCACTGGACTT
TCGGATATCCCATTGAGCAACGAGAAGTACACTTGGTCTAATAACAGATCTTTCTCCCTTATTGATAGATTTCTGTTATCTAACGGCTGCATCAACAGATTTGGGTTGCC
CTTGGCCAAGAGAATGGACAGAACTACATCAGACCACTTCCCCATCCTCTTAGACTTTGGCCAAAACAACTGGGGCCCCACCCCCTTCAGATTTGAGAATATGTGGCTTG
CCCATAGTCCTTCTCACCCTTCCTTGAAAGATGGTGGCATTGCAACCCAACCAACGGATGGCCAGGTCATGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGAAAGCGAGTGGTGAGGATGGTTTGGACATGCGGCCATGGGGAACACACCCTGGACAAGCATGCCCACAACACTGATCATCCCCCCGCCACTGAAACTAGGGA
ACAAGCCACCCTCCCTCTTCCCCCGACACCCATGTTACAAAGACCACACACTTTGACTTTGGACCAACCCACCAAGTCTTCCCTTTTTCCCGAATTATTTGCCTGGAGTC
GATGGGTTATTGTTCAGCGCCAATCGTTTCAGGATGATTGGCACCCAATCCTCATGGCCTTACAAGAGTCCATCAGCGACCACCGCTCCCTGAGTCCTATTCATGCCGAC
AAATCCCTCCTCAGATGTGGCGGCTTCTTGGCAGTCTCTGGTACCTCATCCAGCCTCGAGCCTCCATTCATGGAACTCAAGATTAAGGTCAAGAGCAACAACACGAGCTT
TGTCCCGGCAACCATCGAGCTCCCTCCATCCCTTACTCATGAAGACATTATCACCGTCCATATTGACCCATTTTTCATTGTCGAAAACCTGGTGGGTCGCCGACACTATG
CTCGTGGTGGACAAAGAAATCCCCCGAAATCAACAGCCGGAAAAGCACCCACCGGAAAACCTCCCTCTCGGAGCCAGACCTTCGCCGTCGCGGTTCCCAACCTTGCCGCT
ACAGCTGAGATTGACACATGGGCCCACAAATCTAGAGTCCCGACACTGTCCTCTACGTCTCGACAGACATCCCTTACAAAGTCCAAAGGGAAGGAAAAAGTCATGGACTT
TTCTGCCCCTCTTCCCTCGAGCTCCCCACCTCGGATGAAGCTTCTATCGTGGAATGTTAGAGGGTTGGGCTCATGGAATAAACGAGCCCTTATAAAACAGTCAATCTCCC
GCCATAATCCAAATCTTGTTATTCTACAAGAAACTAAGCTCGCATATGTTGACCCCCTCATCATCAAGTCCCTTTGGAGCTCACATGGGATCAGTTGGTCCGCCCTCAAT
GCTGCGGGTTCTAGCGGAGGGATCCTTTTTCTTTGGAACGAATCTGACTTCGCTGTGGCTGAGATCATTGAAGGTGATTTCTCCCTCTCCATTAATTTTTGTCTCGCTGA
TGGCTTCTCTTTCTGGATCACAGGCATTTATGGTCCCTCTTCCTCTGGTACCCGCCACCTTTTCTGGAAAGAACTGCAAGATCTCTCCTATCTTTGTGAGGCACAGTGGA
TTTTTGCAGGGGACTTCAATGTTTCCAGATGGTCCTGGGAAAAATCTCACGGTAGACCCCTCACAAAAAGCACGAGGCTTTTTAATTCCTTCATTGAGGCCACTGGACTT
TCGGATATCCCATTGAGCAACGAGAAGTACACTTGGTCTAATAACAGATCTTTCTCCCTTATTGATAGATTTCTGTTATCTAACGGCTGCATCAACAGATTTGGGTTGCC
CTTGGCCAAGAGAATGGACAGAACTACATCAGACCACTTCCCCATCCTCTTAGACTTTGGCCAAAACAACTGGGGCCCCACCCCCTTCAGATTTGAGAATATGTGGCTTG
CCCATAGTCCTTCTCACCCTTCCTTGAAAGATGGTGGCATTGCAACCCAACCAACGGATGGCCAGGTCATGGATTGA
Protein sequenceShow/hide protein sequence
MKGKRVVRMVWTCGHGEHTLDKHAHNTDHPPATETREQATLPLPPTPMLQRPHTLTLDQPTKSSLFPELFAWSRWVIVQRQSFQDDWHPILMALQESISDHRSLSPIHAD
KSLLRCGGFLAVSGTSSSLEPPFMELKIKVKSNNTSFVPATIELPPSLTHEDIITVHIDPFFIVENLVGRRHYARGGQRNPPKSTAGKAPTGKPPSRSQTFAVAVPNLAA
TAEIDTWAHKSRVPTLSSTSRQTSLTKSKGKEKVMDFSAPLPSSSPPRMKLLSWNVRGLGSWNKRALIKQSISRHNPNLVILQETKLAYVDPLIIKSLWSSHGISWSALN
AAGSSGGILFLWNESDFAVAEIIEGDFSLSINFCLADGFSFWITGIYGPSSSGTRHLFWKELQDLSYLCEAQWIFAGDFNVSRWSWEKSHGRPLTKSTRLFNSFIEATGL
SDIPLSNEKYTWSNNRSFSLIDRFLLSNGCINRFGLPLAKRMDRTTSDHFPILLDFGQNNWGPTPFRFENMWLAHSPSHPSLKDGGIATQPTDGQVMD