; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028310 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028310
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold8:26732554..26734598
RNA-Seq ExpressionSpg028310
SyntenySpg028310
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]9.4e-2432.51Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA++L   + W  VG+F V+F  W+       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFI-EEQRDSKPRAAQAKEIEKGD
        T    D+ E  IK+K N+TGFIPA + +         V +       ++      IHG          D     S +++ E+     P  A +  + K D
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFI-EEQRDSKPRAAQAKEIEKGD

Query:  NPQ
        N Q
Subjt:  NPQ

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]7.2e-2439.84Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA ++   + W  VG+F V+F  WN       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI
        T    D++E  I++K N++GFIPA + +
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.9e-2523.62Show/hide
Query:  MRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAN--IRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHL
        M +L++      S  P Q DKA+L   ++    + +N     W  VG +QV+F  W+ ++ +    +PSYGGW+R R +P+  W+  TF+ IG  CGG L
Subjt:  MRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAN--IRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHL

Query:  ETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHI-PSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFIEEQRDSKP----RAA
        + A +T+    +++  IKV+ N+TGF+PA + I  +   + I   + P  A  + +     +HG   T      D    ++  +      + P    R  
Subjt:  ETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHI-PSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFIEEQRDSKP----RAA

Query:  QAKEIEKGDNPQISYSCALQGTQKDDCHTEDYQTQPVCVPHLPEDHVNPQSYLPSPSVQKNPYPHKTPSHIPQFEAHPKYQPPLHPAPKSPLHPDGPPLN
            I   D   ISY    +     +   + +  Q      L +        +   + Q + +  K    I   +              S L P G   N
Subjt:  QAKEIEKGDNPQISYSCALQGTQKDDCHTEDYQTQPVCVPHLPEDHVNPQSYLPSPSVQKNPYPHKTPSHIPQFEAHPKYQPPLHPAPKSPLHPDGPPLN

Query:  CSNQPSCPNPKAPIELIHSTDSPPISPTPPTINQQVRKK---PITINNKETYLLMGTVHSTGATYHLS---DSEGAFSSPCSPNMAESPPTRKQKANNLV
         SN           E+     S  IS    TIN Q  K+         K TY +      +   ++LS     EG+     S +M    P      +   
Subjt:  CSNQPSCPNPKAPIELIHSTDSPPISPTPPTINQQVRKK---PITINNKETYLLMGTVHSTGATYHLS---DSEGAFSSPCSPNMAESPPTRKQKANNLV

Query:  DSPPTISHLFESSEDHVTDIENPIPLMIEEPTGAVYQQNFEMEPTTLVDIDVEEVTEDEYESNSHHQQRDPAVYLPILFPWLAEHGMGM---------GS
            T+++         TD      L +    GA   QN     +T      +  T  E E +   +++        L  WL E+ + +          S
Subjt:  DSPPTISHLFESSEDHVTDIENPIPLMIEEPTGAVYQQNFEMEPTTLVDIDVEEVTEDEYESNSHHQQRDPAVYLPILFPWLAEHGMGM---------GS

Query:  WKKRVLIKDF---------ITAKNPAIVILQETKLHSFDRKMAKSIWSSRDIAWTALHTYGA--SGGVYGPNSSHERKFFWQDLQDLQALCLPNWILAGD
            V++ D          +  K   +V+  +TK    D K+      +  I+   L+T G      VYGP   ++R   W +L+ LQ+LCLPNW++AGD
Subjt:  WKKRVLIKDF---------ITAKNPAIVILQETKLHSFDRKMAKSIWSSRDIAWTALHTYGA--SGGVYGPNSSHERKFFWQDLQDLQALCLPNWILAGD

Query:  FNITRWSWEKSTSTAPTR
        FNI RW  E +  +   R
Subjt:  FNITRWSWEKSTSTAPTR

QWT43305.1 kinesin-related protein KIN7C [Citrullus lanatus subsp. vulgaris]6.5e-2556Show/hide
Query:  DQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHG
        D +VPSYG WI+IRNL ID+WS +TFK IG+ CGG++ET+ KTL R+DMME  +KVK N  GF+PA + + S+S   + V +DPFF  D +IGYIA + G
Subjt:  DQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHG

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]5.5e-2431Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA ++   + W  VG+F V+F  WN       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDS-RVGISGKFIEEQRDSKPRAAQAKEIEKGD
        T    D++E  I++K N++GFIPA + +         + +       +++     IHG          D   +     F E+     P  A +  I K D
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDS-RVGISGKFIEEQRDSKPRAAQAKEIEKGD

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein7.8e-2440.62Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P + DKAL+  +NEEQA++L   + W  VG+F V+F  W+       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI
        T    D+ E  IK+K N++GFIPA + +
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI

A0A5A7TFK7 DUF4283 domain-containing protein4.5e-2432.51Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA++L   + W  VG+F V+F  W+       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFI-EEQRDSKPRAAQAKEIEKGD
        T    D+ E  IK+K N+TGFIPA + +         V +       ++      IHG          D     S +++ E+     P  A +  + K D
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFI-EEQRDSKPRAAQAKEIEKGD

Query:  NPQ
        N Q
Subjt:  NPQ

A0A5A7U495 DUF4283 domain-containing protein3.5e-2439.84Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA ++   + W  VG+F V+F  WN       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI
        T    D++E  I++K N++GFIPA + +
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHI

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein1.4e-2523.62Show/hide
Query:  MRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAN--IRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHL
        M +L++      S  P Q DKA+L   ++    + +N     W  VG +QV+F  W+ ++ +    +PSYGGW+R R +P+  W+  TF+ IG  CGG L
Subjt:  MRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAN--IRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHL

Query:  ETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHI-PSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFIEEQRDSKP----RAA
        + A +T+    +++  IKV+ N+TGF+PA + I  +   + I   + P  A  + +     +HG   T      D    ++  +      + P    R  
Subjt:  ETANKTLTRLDMMEVLIKVKTNHTGFIPAEVHI-PSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFIEEQRDSKP----RAA

Query:  QAKEIEKGDNPQISYSCALQGTQKDDCHTEDYQTQPVCVPHLPEDHVNPQSYLPSPSVQKNPYPHKTPSHIPQFEAHPKYQPPLHPAPKSPLHPDGPPLN
            I   D   ISY    +     +   + +  Q      L +        +   + Q + +  K    I   +              S L P G   N
Subjt:  QAKEIEKGDNPQISYSCALQGTQKDDCHTEDYQTQPVCVPHLPEDHVNPQSYLPSPSVQKNPYPHKTPSHIPQFEAHPKYQPPLHPAPKSPLHPDGPPLN

Query:  CSNQPSCPNPKAPIELIHSTDSPPISPTPPTINQQVRKK---PITINNKETYLLMGTVHSTGATYHLS---DSEGAFSSPCSPNMAESPPTRKQKANNLV
         SN           E+     S  IS    TIN Q  K+         K TY +      +   ++LS     EG+     S +M    P      +   
Subjt:  CSNQPSCPNPKAPIELIHSTDSPPISPTPPTINQQVRKK---PITINNKETYLLMGTVHSTGATYHLS---DSEGAFSSPCSPNMAESPPTRKQKANNLV

Query:  DSPPTISHLFESSEDHVTDIENPIPLMIEEPTGAVYQQNFEMEPTTLVDIDVEEVTEDEYESNSHHQQRDPAVYLPILFPWLAEHGMGM---------GS
            T+++         TD      L +    GA   QN     +T      +  T  E E +   +++        L  WL E+ + +          S
Subjt:  DSPPTISHLFESSEDHVTDIENPIPLMIEEPTGAVYQQNFEMEPTTLVDIDVEEVTEDEYESNSHHQQRDPAVYLPILFPWLAEHGMGM---------GS

Query:  WKKRVLIKDF---------ITAKNPAIVILQETKLHSFDRKMAKSIWSSRDIAWTALHTYGA--SGGVYGPNSSHERKFFWQDLQDLQALCLPNWILAGD
            V++ D          +  K   +V+  +TK    D K+      +  I+   L+T G      VYGP   ++R   W +L+ LQ+LCLPNW++AGD
Subjt:  WKKRVLIKDF---------ITAKNPAIVILQETKLHSFDRKMAKSIWSSRDIAWTALHTYGA--SGGVYGPNSSHERKFFWQDLQDLQALCLPNWILAGD

Query:  FNITRWSWEKSTSTAPTR
        FNI RW  E +  +   R
Subjt:  FNITRWSWEKSTSTAPTR

A0A5D3CFS8 DUF4283 domain-containing protein2.7e-2431Show/hide
Query:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK
        L + L       P   DKAL+  +NEEQA ++   + W  VG+F V+F  WN       + +PSYGGWI++R +P+  W++E+F +IGD CGG +E A +
Subjt:  LQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANK

Query:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDS-RVGISGKFIEEQRDSKPRAAQAKEIEKGD
        T    D++E  I++K N++GFIPA + +         + +       +++     IHG          D   +     F E+     P  A +  I K D
Subjt:  TLTRLDMMEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDS-RVGISGKFIEEQRDSKPRAAQAKEIEKGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCTCTTCAACAACACTTATCAGCATATGCTTCAGTTAGCCCCCTGCAACCGGATAAAGCCCTTCTGGCTTGTGAGAATGAGGAACAAGCCCAGGTCCTAGCAAA
TATCAGAGATTGGTATAAGGTTGGAAAATTTCAGGTTAGATTCTTCCCATGGAACCCTGATATCATGAATGGTGATCAAAAGGTCCCTTCATATGGGGGATGGATAAGAA
TCCGCAATCTTCCAATAGATAAATGGTCTGTGGAAACCTTCAAAAAGATTGGAGACGAGTGTGGGGGCCATTTAGAAACAGCGAACAAAACTCTGACCAGATTGGATATG
ATGGAGGTTTTGATAAAGGTAAAAACAAACCACACTGGTTTCATCCCAGCAGAGGTACACATTCCATCATCGTCAACTAGCCCCATCAAGGTTAATATAGACCCATTCTT
TGCGGAAGATTACTATATTGGATATATAGCCGGAATCCATGGGAAAATACCAACAGCTCCGACGACATCCGTTGATTCTCGCGTCGGAATATCTGGGAAATTCATTGAGG
AACAAAGGGATTCAAAGCCACGCGCCGCCCAAGCAAAGGAAATTGAAAAAGGGGATAATCCCCAAATTTCGTACAGTTGCGCCCTCCAAGGAACCCAAAAGGATGACTGT
CACACAGAGGATTATCAGACTCAACCAGTCTGTGTCCCACACCTTCCAGAAGATCACGTGAATCCCCAATCATATTTGCCATCTCCATCGGTTCAAAAAAATCCCTATCC
ACACAAAACCCCAAGCCATATTCCCCAATTCGAGGCCCACCCCAAATATCAGCCCCCACTACATCCTGCCCCAAAAAGCCCACTCCATCCCGATGGCCCACCCCTGAACT
GCTCTAACCAACCGAGCTGCCCCAATCCCAAAGCCCCTATAGAATTAATCCACTCTACAGACTCCCCACCTATCAGCCCAACCCCACCCACCATTAACCAACAGGTTCGA
AAAAAGCCCATCACCATTAACAATAAGGAAACCTACCTTCTCATGGGCACCGTACACTCTACTGGAGCGACTTATCACCTATCAGATTCGGAAGGAGCCTTTTCCTCCCC
ATGCTCTCCGAATATGGCTGAATCTCCCCCTACTCGAAAACAAAAGGCAAACAACCTTGTGGATTCTCCTCCGACTATATCCCACTTATTTGAATCTTCCGAAGACCATG
TCACTGATATTGAAAACCCCATCCCCTTGATGATTGAAGAGCCCACTGGGGCAGTCTACCAGCAAAACTTTGAGATGGAACCCACAACTTTGGTCGACATAGACGTCGAA
GAAGTGACAGAAGATGAATACGAATCCAATTCCCACCATCAGCAAAGAGATCCTGCTGTTTATCTCCCCATTCTCTTTCCTTGGTTGGCAGAACATGGCATGGGTATGGG
CTCATGGAAGAAAAGAGTCCTCATCAAAGATTTTATCACAGCTAAGAATCCAGCCATCGTTATCCTTCAAGAAACAAAGTTGCACTCCTTTGATAGAAAGATGGCCAAAT
CGATTTGGAGCTCGAGGGACATTGCCTGGACAGCCCTCCACACCTATGGTGCTTCAGGGGGAGTGTATGGTCCAAACTCTTCTCATGAGAGAAAATTCTTTTGGCAAGAT
CTTCAAGACCTACAAGCCCTCTGTCTTCCAAATTGGATCTTAGCAGGGGATTTCAACATCACTAGATGGTCTTGGGAGAAATCGACCTCCACAGCTCCCACCCGT
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCTCTTCAACAACACTTATCAGCATATGCTTCAGTTAGCCCCCTGCAACCGGATAAAGCCCTTCTGGCTTGTGAGAATGAGGAACAAGCCCAGGTCCTAGCAAA
TATCAGAGATTGGTATAAGGTTGGAAAATTTCAGGTTAGATTCTTCCCATGGAACCCTGATATCATGAATGGTGATCAAAAGGTCCCTTCATATGGGGGATGGATAAGAA
TCCGCAATCTTCCAATAGATAAATGGTCTGTGGAAACCTTCAAAAAGATTGGAGACGAGTGTGGGGGCCATTTAGAAACAGCGAACAAAACTCTGACCAGATTGGATATG
ATGGAGGTTTTGATAAAGGTAAAAACAAACCACACTGGTTTCATCCCAGCAGAGGTACACATTCCATCATCGTCAACTAGCCCCATCAAGGTTAATATAGACCCATTCTT
TGCGGAAGATTACTATATTGGATATATAGCCGGAATCCATGGGAAAATACCAACAGCTCCGACGACATCCGTTGATTCTCGCGTCGGAATATCTGGGAAATTCATTGAGG
AACAAAGGGATTCAAAGCCACGCGCCGCCCAAGCAAAGGAAATTGAAAAAGGGGATAATCCCCAAATTTCGTACAGTTGCGCCCTCCAAGGAACCCAAAAGGATGACTGT
CACACAGAGGATTATCAGACTCAACCAGTCTGTGTCCCACACCTTCCAGAAGATCACGTGAATCCCCAATCATATTTGCCATCTCCATCGGTTCAAAAAAATCCCTATCC
ACACAAAACCCCAAGCCATATTCCCCAATTCGAGGCCCACCCCAAATATCAGCCCCCACTACATCCTGCCCCAAAAAGCCCACTCCATCCCGATGGCCCACCCCTGAACT
GCTCTAACCAACCGAGCTGCCCCAATCCCAAAGCCCCTATAGAATTAATCCACTCTACAGACTCCCCACCTATCAGCCCAACCCCACCCACCATTAACCAACAGGTTCGA
AAAAAGCCCATCACCATTAACAATAAGGAAACCTACCTTCTCATGGGCACCGTACACTCTACTGGAGCGACTTATCACCTATCAGATTCGGAAGGAGCCTTTTCCTCCCC
ATGCTCTCCGAATATGGCTGAATCTCCCCCTACTCGAAAACAAAAGGCAAACAACCTTGTGGATTCTCCTCCGACTATATCCCACTTATTTGAATCTTCCGAAGACCATG
TCACTGATATTGAAAACCCCATCCCCTTGATGATTGAAGAGCCCACTGGGGCAGTCTACCAGCAAAACTTTGAGATGGAACCCACAACTTTGGTCGACATAGACGTCGAA
GAAGTGACAGAAGATGAATACGAATCCAATTCCCACCATCAGCAAAGAGATCCTGCTGTTTATCTCCCCATTCTCTTTCCTTGGTTGGCAGAACATGGCATGGGTATGGG
CTCATGGAAGAAAAGAGTCCTCATCAAAGATTTTATCACAGCTAAGAATCCAGCCATCGTTATCCTTCAAGAAACAAAGTTGCACTCCTTTGATAGAAAGATGGCCAAAT
CGATTTGGAGCTCGAGGGACATTGCCTGGACAGCCCTCCACACCTATGGTGCTTCAGGGGGAGTGTATGGTCCAAACTCTTCTCATGAGAGAAAATTCTTTTGGCAAGAT
CTTCAAGACCTACAAGCCCTCTGTCTTCCAAATTGGATCTTAGCAGGGGATTTCAACATCACTAGATGGTCTTGGGAGAAATCGACCTCCACAGCTCCCACCCGT
Protein sequenceShow/hide protein sequence
MRALQQHLSAYASVSPLQPDKALLACENEEQAQVLANIRDWYKVGKFQVRFFPWNPDIMNGDQKVPSYGGWIRIRNLPIDKWSVETFKKIGDECGGHLETANKTLTRLDM
MEVLIKVKTNHTGFIPAEVHIPSSSTSPIKVNIDPFFAEDYYIGYIAGIHGKIPTAPTTSVDSRVGISGKFIEEQRDSKPRAAQAKEIEKGDNPQISYSCALQGTQKDDC
HTEDYQTQPVCVPHLPEDHVNPQSYLPSPSVQKNPYPHKTPSHIPQFEAHPKYQPPLHPAPKSPLHPDGPPLNCSNQPSCPNPKAPIELIHSTDSPPISPTPPTINQQVR
KKPITINNKETYLLMGTVHSTGATYHLSDSEGAFSSPCSPNMAESPPTRKQKANNLVDSPPTISHLFESSEDHVTDIENPIPLMIEEPTGAVYQQNFEMEPTTLVDIDVE
EVTEDEYESNSHHQQRDPAVYLPILFPWLAEHGMGMGSWKKRVLIKDFITAKNPAIVILQETKLHSFDRKMAKSIWSSRDIAWTALHTYGASGGVYGPNSSHERKFFWQD
LQDLQALCLPNWILAGDFNITRWSWEKSTSTAPTR