; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030177 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030177
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold6:15026524..15034246
RNA-Seq ExpressionSpg030177
SyntenySpg030177
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038349.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.6e-3838.64Show/hide
Query:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRLLESSES
        ++++  +TG  S   + T R +    R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++FS+ +  W DV  E  ++VK  L E S +
Subjt:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRLLESSES

Query:  NKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWG
        NK +R    +NH +GSKSF+  Q+E+ E++G  +D +ELF  TH + G +V+Q  EDAH +M++L+  P  EGSQPLS+  IC+ VLG+RPG+ KG+ WG
Subjt:  NKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWG

Query:  PKPSRSNT---SGNTSLSSGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV
         K     T   S +++  S  T++E+E Q    E +  ++ Q  N Q  L    +S +K IEE+
Subjt:  PKPSRSNT---SGNTSLSSGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV

KAA0041316.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.0e-3935.8Show/hide
Query:  APPAANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL---
        +PP+A L    G +S  T+ T R +R   R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++FSVR   W DV  E  ++VK  L   
Subjt:  APPAANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL---

Query:  -------------------------------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHS
                                                         LE S +NK +R    +NH +GSKSF+  Q+E+ E+ G+ +D +ELF+ TH 
Subjt:  -------------------------------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHS

Query:  KNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQT
        + G +V+QAAEDAH +M++L+  P PEGSQPLS+  IC+ VLG+RPG+ KG GWGPKP    T+   S SS  T      Q+EI  L+A+     + ++ 
Subjt:  KNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQT

Query:  SKQSAQKTIEEVNVLKDMIVVLPR
          ++ Q    +V  +K MI  L R
Subjt:  SKQSAQKTIEEVNVLKDMIVVLPR

KAA0043188.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.5e-3936.49Show/hide
Query:  STGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------------
        +TG +S  T+ T R +R   R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++F VR   W DV  E  ++VK  L            
Subjt:  STGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------------

Query:  ---------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEG
                             LE S +NK  R    +NH +GSKSF+  Q+E+ E+ G+ +D +ELF  TH + G +V+QAAEDAH +M++L+  P PEG
Subjt:  ---------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEG

Query:  SQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVLPR
        SQPLS+  IC+ VLG+RPG+ KG+GWGPKP ++    + S SS    + +E++ E   L+A+     ++++   ++ Q    +V  ++ MI  L R
Subjt:  SQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVLPR

KAA0063396.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.6e-3936.42Show/hide
Query:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------
        ++++  +TG +S  T+ T R +R   R + L+ +V I+G+IP+ I P  +KP+     +FS +IG  VR++F VR   W DV  E  ++VK  L      
Subjt:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------

Query:  ---------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRD
                                   LE S +NK +R   ++NH +GSKSF+  Q+E+ E+ G+ +D +ELF  TH + G +V QA EDAH +M++L+ 
Subjt:  ---------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRD

Query:  APIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVL
         PIPEGSQPLS+  IC+ VLGKRPG+ KG+ WGPKP    T   TS SS  T      ++EI  L+A+     + ++   ++ Q    +V  +K MI  L
Subjt:  APIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVL

Query:  PR
         R
Subjt:  PR

TYK00061.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.7e-3839.44Show/hide
Query:  ETDGEYV--EVPAPPAANLPSSTGD-----TSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWD
        ETD  ++  EV     A   SS GD     +S  T+ T R +R + R + L+ +V I+ +I +TI P  +K +   A +F+ +IG  +R++F +R     
Subjt:  ETDGEYV--EVPAPPAANLPSSTGD-----TSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWD

Query:  DVTEEAKKLVKSRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKP
        DV  E  ++VK  L E S +NK +R    +NH +GSKSF+  Q+E+ E++G  +D +ELF  TH + G +V+QA EDAH +M++L+  P PEGSQPLS+ 
Subjt:  DVTEEAKKLVKSRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKP

Query:  LICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLS---SGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV
         IC+ VLG+RPG+ KG+GWGPKP    T+  +S S   S  T++E+E Q    E +  ++ Q  N Q  L +  +S +K IEE+
Subjt:  LICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLS---SGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV

TrEMBL top hitse value%identityAlignment
A0A5A7TAG5 CACTA en-spm transposon protein3.2e-3838.64Show/hide
Query:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRLLESSES
        ++++  +TG  S   + T R +    R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++FS+ +  W DV  E  ++VK  L E S +
Subjt:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRLLESSES

Query:  NKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWG
        NK +R    +NH +GSKSF+  Q+E+ E++G  +D +ELF  TH + G +V+Q  EDAH +M++L+  P  EGSQPLS+  IC+ VLG+RPG+ KG+ WG
Subjt:  NKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWG

Query:  PKPSRSNT---SGNTSLSSGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV
         K     T   S +++  S  T++E+E Q    E +  ++ Q  N Q  L    +S +K IEE+
Subjt:  PKPSRSNT---SGNTSLSSGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV

A0A5A7TK56 CACTA en-spm transposon protein1.7e-3936.49Show/hide
Query:  STGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------------
        +TG +S  T+ T R +R   R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++F VR   W DV  E  ++VK  L            
Subjt:  STGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------------

Query:  ---------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEG
                             LE S +NK  R    +NH +GSKSF+  Q+E+ E+ G+ +D +ELF  TH + G +V+QAAEDAH +M++L+  P PEG
Subjt:  ---------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEG

Query:  SQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVLPR
        SQPLS+  IC+ VLG+RPG+ KG+GWGPKP ++    + S SS    + +E++ E   L+A+     ++++   ++ Q    +V  ++ MI  L R
Subjt:  SQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVLPR

A0A5A7TUT1 CACTA en-spm transposon protein2.7e-3733.96Show/hide
Query:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVK---------
        ++++  +T  +S  T+ T R +R   R + L+ +V I+G+IP+TI P  +KP+   A +FS +IG  VR++F VR   W +V  E  ++VK         
Subjt:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVK---------

Query:  -------------------------------------------SRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNG
                                                   SR  + S +NK +R    +NH +GSKSF+  Q+E+ E+ G+ +D +ELF  TH + G
Subjt:  -------------------------------------------SRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNG

Query:  KWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQ
         +V+QAAEDAH +M++L+  P PEGSQPLS+  IC+ VLG+RPG+ KG+GWGPKP    T+   S SS  T      Q+EI  L+A+     + ++   +
Subjt:  KWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQ

Query:  SAQKTIEEVNVLKDMIVVLPR
        + Q    +V  +K MI  L R
Subjt:  SAQKTIEEVNVLKDMIVVLPR

A0A5A7VAE5 CACTA en-spm transposon protein7.6e-4036.42Show/hide
Query:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------
        ++++  +TG +S  T+ T R +R   R + L+ +V I+G+IP+ I P  +KP+     +FS +IG  VR++F VR   W DV  E  ++VK  L      
Subjt:  AANLPSSTGDTSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRL------

Query:  ---------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRD
                                   LE S +NK +R   ++NH +GSKSF+  Q+E+ E+ G+ +D +ELF  TH + G +V QA EDAH +M++L+ 
Subjt:  ---------------------------LESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRD

Query:  APIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVL
         PIPEGSQPLS+  IC+ VLGKRPG+ KG+ WGPKP    T   TS SS  T      ++EI  L+A+     + ++   ++ Q    +V  +K MI  L
Subjt:  APIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTLKAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVL

Query:  PR
         R
Subjt:  PR

A0A5D3BJR1 CACTA en-spm transposon protein8.3e-3939.44Show/hide
Query:  ETDGEYV--EVPAPPAANLPSSTGD-----TSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWD
        ETD  ++  EV     A   SS GD     +S  T+ T R +R + R + L+ +V I+ +I +TI P  +K +   A +F+ +IG  +R++F +R     
Subjt:  ETDGEYV--EVPAPPAANLPSSTGD-----TSDGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWD

Query:  DVTEEAKKLVKSRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKP
        DV  E  ++VK  L E S +NK +R    +NH +GSKSF+  Q+E+ E++G  +D +ELF  TH + G +V+QA EDAH +M++L+  P PEGSQPLS+ 
Subjt:  DVTEEAKKLVKSRLLESSESNKRSRSMLAFNHKAGSKSFINIQHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKP

Query:  LICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLS---SGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV
         IC+ VLG+RPG+ KG+GWGPKP    T+  +S S   S  T++E+E Q    E +  ++ Q  N Q  L +  +S +K IEE+
Subjt:  LICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLS---SGPTQRELEQQ----EEINTLKAQYENIQKELQTSKQSAQKTIEEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGCCTATTAGCATGACTGCCCCAGACAGATGGATTTGGCACTATGATTGTAGAGTTAATATGACTTGTCCGGTTTGTCATGAAGAGTTGGAAACCACTGATCA
TGCTCTTTTCAGATGCCAAAGGGCTAGGGAGGTCAACAACAAATTTGTAATGGATGGGGACAATCGTGTGTCTGCAAAAAAGGGAACTGAGCATCAGTGTAATGCTAAGT
ATACGATGGCATCAACATCATCACGCATACACGACGAGACTGATGGAGAGTATGTCGAGGTGCCGGCACCCCCTGCCGCGAACTTACCATCTTCCACGGGGGATACGTCA
GATGGTACATCCATGACTGGGAGGAACAAACGTGGATATGGACGGAATATTCTCTTGGACAACTACGTCACGATACATGGCAAAATTCCAATTACAATCGATCCAGAGGT
GAAAAAACCAGTTGGGAAATGGGCGACGCAATTTAGCATATCTATTGGTACGGGCGTTCGGGAGTCTTTTTCTGTTCGATTTGACACTTGGGATGACGTAACTGAAGAGG
CTAAGAAGCTGGTAAAGTCTCGACTACTGGAATCTTCAGAATCGAATAAAAGGAGTCGATCCATGTTGGCGTTCAACCATAAGGCAGGGTCAAAATCATTTATAAACATT
CAACACGAAATGAAAGAGAAGGAGGGTCGAGACATAGATCCGATTGAACTGTTTGAACTGACCCATTCGAAAAATGGAAAGTGGGTGAACCAGGCAGCGGAAGATGCACA
TGGGAAGATGGTAGATCTACGGGATGCTCCTATTCCAGAAGGGTCTCAACCACTCAGTAAACCTCTCATATGTGAGACGGTTTTGGGTAAACGGCCCGGCCACGTTAAAG
GCATGGGTTGGGGACCTAAGCCATCTCGTTCAAACACGAGTGGCAACACATCACTTAGTTCAGGACCCACGCAGAGAGAACTTGAACAACAAGAAGAGATTAACACCTTG
AAGGCCCAGTACGAAAATATACAGAAGGAGCTCCAGACCAGCAAGCAGTCTGCGCAAAAGACGATAGAGGAAGTAAATGTCCTGAAAGACATGATTGTGGTGTTGCCCAG
GCGCGCGAGTCAACGCGTCGGCAGAAATTGCTCCGCAAAACTTTCTGCCGACGCGTTGACTTCCGCGTCGGGAGAGACCATTTCTCCCGACGCACGGTCAACGCGTAGGC
AGAAAATCGAGTCCATCACTTTCTGCCGACGGGCTGACCACCGCCGCGTCGGCAAAAATGATCTCTACCGACGCGATAGTCAACGTGTCGGTAGAAAGCGCCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTGCCTATTAGCATGACTGCCCCAGACAGATGGATTTGGCACTATGATTGTAGAGTTAATATGACTTGTCCGGTTTGTCATGAAGAGTTGGAAACCACTGATCA
TGCTCTTTTCAGATGCCAAAGGGCTAGGGAGGTCAACAACAAATTTGTAATGGATGGGGACAATCGTGTGTCTGCAAAAAAGGGAACTGAGCATCAGTGTAATGCTAAGT
ATACGATGGCATCAACATCATCACGCATACACGACGAGACTGATGGAGAGTATGTCGAGGTGCCGGCACCCCCTGCCGCGAACTTACCATCTTCCACGGGGGATACGTCA
GATGGTACATCCATGACTGGGAGGAACAAACGTGGATATGGACGGAATATTCTCTTGGACAACTACGTCACGATACATGGCAAAATTCCAATTACAATCGATCCAGAGGT
GAAAAAACCAGTTGGGAAATGGGCGACGCAATTTAGCATATCTATTGGTACGGGCGTTCGGGAGTCTTTTTCTGTTCGATTTGACACTTGGGATGACGTAACTGAAGAGG
CTAAGAAGCTGGTAAAGTCTCGACTACTGGAATCTTCAGAATCGAATAAAAGGAGTCGATCCATGTTGGCGTTCAACCATAAGGCAGGGTCAAAATCATTTATAAACATT
CAACACGAAATGAAAGAGAAGGAGGGTCGAGACATAGATCCGATTGAACTGTTTGAACTGACCCATTCGAAAAATGGAAAGTGGGTGAACCAGGCAGCGGAAGATGCACA
TGGGAAGATGGTAGATCTACGGGATGCTCCTATTCCAGAAGGGTCTCAACCACTCAGTAAACCTCTCATATGTGAGACGGTTTTGGGTAAACGGCCCGGCCACGTTAAAG
GCATGGGTTGGGGACCTAAGCCATCTCGTTCAAACACGAGTGGCAACACATCACTTAGTTCAGGACCCACGCAGAGAGAACTTGAACAACAAGAAGAGATTAACACCTTG
AAGGCCCAGTACGAAAATATACAGAAGGAGCTCCAGACCAGCAAGCAGTCTGCGCAAAAGACGATAGAGGAAGTAAATGTCCTGAAAGACATGATTGTGGTGTTGCCCAG
GCGCGCGAGTCAACGCGTCGGCAGAAATTGCTCCGCAAAACTTTCTGCCGACGCGTTGACTTCCGCGTCGGGAGAGACCATTTCTCCCGACGCACGGTCAACGCGTAGGC
AGAAAATCGAGTCCATCACTTTCTGCCGACGGGCTGACCACCGCCGCGTCGGCAAAAATGATCTCTACCGACGCGATAGTCAACGTGTCGGTAGAAAGCGCCCATAA
Protein sequenceShow/hide protein sequence
MGLPISMTAPDRWIWHYDCRVNMTCPVCHEELETTDHALFRCQRAREVNNKFVMDGDNRVSAKKGTEHQCNAKYTMASTSSRIHDETDGEYVEVPAPPAANLPSSTGDTS
DGTSMTGRNKRGYGRNILLDNYVTIHGKIPITIDPEVKKPVGKWATQFSISIGTGVRESFSVRFDTWDDVTEEAKKLVKSRLLESSESNKRSRSMLAFNHKAGSKSFINI
QHEMKEKEGRDIDPIELFELTHSKNGKWVNQAAEDAHGKMVDLRDAPIPEGSQPLSKPLICETVLGKRPGHVKGMGWGPKPSRSNTSGNTSLSSGPTQRELEQQEEINTL
KAQYENIQKELQTSKQSAQKTIEEVNVLKDMIVVLPRRASQRVGRNCSAKLSADALTSASGETISPDARSTRRQKIESITFCRRADHRRVGKNDLYRRDSQRVGRKRP