; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007488 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007488
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationscaffold2:1473747..1476673
RNA-Seq ExpressionSpg007488
SyntenySpg007488
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]7.2e-5848.72Show/hide
Query:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-
        FRAI NK P  A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N ++D 
Subjt:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-

Query:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT
          ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+EF+C+
Subjt:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT

Query:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK
        MC +  TSEI+ N H+ GKKHKAKEGR       HK+P+  E+                P LE     KF CE C VG   + VM +HN G+KH+ARLLK
Subjt:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK

Query:  LG-QCKLDEQKE
        L  QCK ++QK+
Subjt:  LG-QCKLDEQKE

XP_010047067.2 uncharacterized protein LOC104436029 [Eucalyptus grandis]1.6e-2837.41Show/hide
Query:  DHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAA-GRTERLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDSQT
        D  L  ++   E+ KQ+I+EEI+  E+A R+MLE E+RREL++E+++ +  A  G          F     +  L+ +  +     LA  G ++   S  
Subjt:  DHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAA-GRTERLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDSQT

Query:  IRS-FSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG---
        IRS  + P + +P   E  KD+LI+LP+PDP    AKRKA   P +AD      + + S KK  K+E+ C +C+++ TSE  L  HL+GKKHK KE    
Subjt:  IRS-FSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG---

Query:  -----RGLETGEVH------KQPSPPEQAPPLEN-----KGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQ
              G     V+      K  S P+ A  LEN     K +F+FWC  C +G  S+ VME+H KGKKH ARL +LGQ
Subjt:  -----RGLETGEVH------KQPSPPEQAPPLEN-----KGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQ

XP_022926958.1 uncharacterized protein LOC111433915 [Cucurbita moschata]1.1e-3475.41Show/hide
Query:  LIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNH
        L+D   RA+ NKP  AASSS  SSDH LRDDSPNAELVKQRIKEEI  RE ASRRMLEAEIRRELIIEQEL++ RA GRTE LA DEHFAMR+L+ RLNH
Subjt:  LIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNH

Query:  VVDQPST-GLLAVPGFSSSLDS
        +VDQ S+ GLLAVPG  SSL+S
Subjt:  VVDQPST-GLLAVPGFSSSLDS

XP_030456348.1 uncharacterized protein LOC115677338 [Syzygium oleosum]4.6e-2836.05Show/hide
Query:  GIEPRNLIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLL
        G  P     + F    N  P   S   +  D  L  +S   E+ KQ+I+EEI+  E+A R+MLE E+RREL++E+++ + R AG  + L  D   AM   
Subjt:  GIEPRNLIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLL

Query:  NQR----LNHVVDQPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDE------KDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKK
          R    L+ + ++ + G LA  G +++  S        P    P+A+  E      KD+LI+LP+PDP    AKRKA   P +A       + + S KK
Subjt:  NQR----LNHVVDQPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDE------KDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKK

Query:  LAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG--------RGLETGEVH------KQPSPPEQAPPLEN-KGS-----FKFWCETCLVGTRSLVVME
          K+E+ C +C+++ TSE  L  HL+GKKHK KE          G    +VH      K+ S  + A  LEN +GS     F+FWCE C +G  S  VME
Subjt:  LAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG--------RGLETGEVH------KQPSPPEQAPPLEN-KGS-----FKFWCETCLVGTRSLVVME

Query:  THNKGKKHRARLLKLGQCK
        +H KGKKH A L +LGQ +
Subjt:  THNKGKKHRARLLKLGQCK

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]2.7e-7355.41Show/hide
Query:  IDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQ-RLNH
        +D  FRA  NK P  A+S+   S   L  DS NAEL+KQR+K+EIMIREIASRRMLEAEIRRELIIEQELA  R  GRTE L  D+ F++RLL+Q R+NH
Subjt:  IDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQ-RLNH

Query:  VVDQPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTT
         +  P  GLL VPG SSS     +     P+ EEPK  +D+K+KLI+LPKPDP KF+ KRKAEG   E DT+ +     ISSKKLAK+EF+C+MC +  T
Subjt:  VVDQPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTT

Query:  SEITLNTHLKGKKHKAKEGRGLETGE----------VHKQPSPPEQAPPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLG-QCKLDEQK-
        SEI+ N HLKGKKH AKEGR L+T E          +  Q    ++   L+NK  FKFWC+ C +GT  + +M +HN GKKH+ARLLKL  Q KLD+QK 
Subjt:  SEITLNTHLKGKKHKAKEGRGLETGE----------VHKQPSPPEQAPPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLG-QCKLDEQK-

Query:  EPNGL
        EPN L
Subjt:  EPNGL

TrEMBL top hitse value%identityAlignment
A0A059CKW4 Uncharacterized protein4.9e-2837.05Show/hide
Query:  DHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAA-GRTERLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDSQT
        D  L  ++   E+ KQ+I+EEI+  E+A R+MLE E+RREL++E+++ + RA  G          F     +  L+ +  +     LA  G ++   S  
Subjt:  DHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAA-GRTERLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDSQT

Query:  IRS-FSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG---
        IRS  + P + +P   E  KD+LI+LP+PDP    AKRKA     +AD      + + S KK  K+E+ C +C+++ TSE  L  HL+GKKHK KE    
Subjt:  IRS-FSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKAKEG---

Query:  -----RGLETGEVH------KQPSPPEQAPPLEN-----KGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQ
              G     V+      K  S P+    LEN     K +F+FWC  C +G  S+ VME+H KGKKH ARL +LGQ
Subjt:  -----RGLETGEVH------KQPSPPEQAPPLEN-----KGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQ

A0A1S3B9S7 uncharacterized protein LOC1034873433.5e-5848.72Show/hide
Query:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-
        FRAI NK P  A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N ++D 
Subjt:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-

Query:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT
          ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+EF+C+
Subjt:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT

Query:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK
        MC +  TSEI+ N H+ GKKHKAKEGR       HK+P+  E+                P LE     KF CE C VG   + VM +HN G+KH+ARLLK
Subjt:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK

Query:  LG-QCKLDEQKE
        L  QCK ++QK+
Subjt:  LG-QCKLDEQKE

A0A5D3B800 UBP1-associated proteins 1C-like isoform X33.5e-5848.72Show/hide
Query:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-
        FRAI NK P  A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N ++D 
Subjt:  FRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNHVVD-

Query:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT
          ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+EF+C+
Subjt:  QPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADT--------NTEPLIPS--ISSKKLAKKEFICT

Query:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK
        MC +  TSEI+ N H+ GKKHKAKEGR       HK+P+  E+                P LE     KF CE C VG   + VM +HN G+KH+ARLLK
Subjt:  MCKITTTSEITLNTHLKGKKHKAKEGRGLETGEVHKQPSPPEQ---------------APPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLK

Query:  LG-QCKLDEQKE
        L  QCK ++QK+
Subjt:  LG-QCKLDEQKE

A0A6J1EJN4 uncharacterized protein LOC1114339155.4e-3575.41Show/hide
Query:  LIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNH
        L+D   RA+ NKP  AASSS  SSDH LRDDSPNAELVKQRIKEEI  RE ASRRMLEAEIRRELIIEQEL++ RA GRTE LA DEHFAMR+L+ RLNH
Subjt:  LIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNH

Query:  VVDQPST-GLLAVPGFSSSLDS
        +VDQ S+ GLLAVPG  SSL+S
Subjt:  VVDQPST-GLLAVPGFSSSLDS

A0A6J1HN47 uncharacterized protein LOC111465139 isoform X12.9e-2867.77Show/hide
Query:  IDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV
        +D   RA+ N+P  AASSS  SSDH LRDDSPNAELVKQRIK +I  REIASRRMLEAE R ELIIEQEL++ RA G TE LA DEHF MR+L+ RLN +
Subjt:  IDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV

Query:  VDQPSTG-LLAVPGFSSSLDS
        VDQ S+  LLA PG  SSL+S
Subjt:  VDQPSTG-LLAVPGFSSSLDS

SwissProt top hitse value%identityAlignment
Q8VD12 Zinc finger protein 385A1.6e-0427.45Show/hide
Query:  PKPDPEKFKAKRKAEGPPTEADTNTEPLIP--SISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKA--KEGRGLETGEVHKQPSPPEQAPPLENKG
        P   PE  +   K EG      T+    +P  S   ++ AK+   C +CK+   S   L  H KG KHK   +   GL   + + +  PP    P     
Subjt:  PKPDPEKFKAKRKAEGPPTEADTNTEPLIP--SISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKHKA--KEGRGLETGEVHKQPSPPEQAPPLENKG

Query:  SFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCA
           F CE C V   S V ++ H   ++HR  +       L   K+P G    A
Subjt:  SFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCA

Arabidopsis top hitse value%identityAlignment
AT2G24030.1 zinc ion binding;nucleic acid binding2.2e-1228.75Show/hide
Query:  FRAI-HNKPPPAA------------------SSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALD
        +RAI +N+PPPA                   S  G  S+  +R ++   E+ K++I++EI+I E A +R L AE+ +E+ IE+E+A+ R +     ++L+
Subjt:  FRAI-HNKPPPAA------------------SSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALD

Query:  EHFAM-----RLLNQ---RLNHVVDQPST---GLLAVPGFSSSLDSQTIRSFSEPRKEE---PKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTN
        E   M     +L NQ   + N+   Q  +    L+    ++S L S  ++     +  E      LE  K+ LI+L + D    K K  + G        
Subjt:  EHFAM-----RLLNQ---RLNHVVDQPST---GLLAVPGFSSSLDSQTIRSFSEPRKEE---PKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTN

Query:  TEPLIPSISSKKLAKKEFICTMCKITTTSE--------ITLNTHLKGKKHKAKEGR----GLETGEV--HKQPSPPEQAPPLENKGSF----KFWCETCL
          P     S  +  K++FI  + +     E          LN  L+ K+ KAKE       LETGE+   K P   +     + +G      KFWCE C 
Subjt:  TEPLIPSISSKKLAKKEFICTMCKITTTSE--------ITLNTHLKGKKHKAKEGR----GLETGEV--HKQPSPPEQAPPLENKGSF----KFWCETCL

Query:  VGTRSLVVMETHNKGKKHRA
        VGT   +VM  H  GKKH+A
Subjt:  VGTRSLVVMETHNKGKKHRA

AT2G24030.2 zinc ion binding;nucleic acid binding2.6e-0527.69Show/hide
Query:  LIIEQELAMHRAAGRTERLALDEHFAM-----RLLNQ---RLNHVVDQPST---GLLAVPGFSSSLDSQTIRSFSEPRKEE---PKALEDEKDKLILLPK
        + IE+E+A+ R +     ++L+E   M     +L NQ   + N+   Q  +    L+    ++S L S  ++     +  E      LE  K+ LI+L +
Subjt:  LIIEQELAMHRAAGRTERLALDEHFAM-----RLLNQ---RLNHVVDQPST---GLLAVPGFSSSLDSQTIRSFSEPRKEE---PKALEDEKDKLILLPK

Query:  PDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSE--------ITLNTHLKGKKHKAKEGR----GLETGEV--HKQPSPPEQ
         D    K K  + G          P     S  +  K++FI  + +     E          LN  L+ K+ KAKE       LETGE+   K P   + 
Subjt:  PDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSE--------ITLNTHLKGKKHKAKEGR----GLETGEV--HKQPSPPEQ

Query:  APPLENKGSF----KFWCETCLVGTRSLVVMETHNKGKKHRA
            + +G      KFWCE C VGT   +VM  H  GKKH+A
Subjt:  APPLENKGSF----KFWCETCLVGTRSLVVMETHNKGKKHRA

AT5G61190.1 putative endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain2.1e-0730.07Show/hide
Query:  ISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKH---------------------------------------KAKEGRGL-------ETGEVHKQPSP
        I S+  A  EF+C MC +   S+I  N+HL+GKKH                                       KA+E + L       E G+  K    
Subjt:  ISSKKLAKKEFICTMCKITTTSEITLNTHLKGKKH---------------------------------------KAKEGRGL-------ETGEVHKQPSP

Query:  PEQAPPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARL
        P++   L N  S K+ C  C VG  S +V ETH +G+KH A L
Subjt:  PEQAPPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGGTTCCGAATTCTCGAAGGGGGAATCGAGCCACGTAATTTGATCGATATTTGGTTCCGAGCCATCCACAACAAACCGCCGCCCGCCGCTTCCAGTTCCGGAAG
CTCCTCCGATCACCTGCTGCGAGATGATTCTCCAAACGCGGAGCTCGTGAAACAGAGGATTAAAGAAGAGATAATGATCAGAGAGATTGCGAGCCGACGAATGCTCGAGG
CGGAGATCAGGAGGGAGCTCATCATCGAGCAAGAACTAGCGATGCATAGGGCTGCGGGCCGGACGGAGAGGTTAGCATTGGACGAACATTTTGCAATGCGATTGTTGAAC
CAGAGGCTGAATCACGTTGTGGATCAGCCTTCCACAGGTCTATTAGCGGTTCCAGGTTTCAGTTCTTCGCTCGACTCCCAAACGATTCGTTCGTTTTCGGAGCCTCGAAA
AGAAGAACCGAAGGCTTTGGAAGATGAAAAGGACAAGTTAATTTTGCTGCCAAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCCGAGGGTCCACCGACGGAGG
CTGATACCAATACAGAGCCATTAATTCCTTCGATTAGTTCGAAGAAATTAGCAAAGAAAGAGTTCATTTGTACAATGTGCAAAATCACAACAACAAGCGAAATTACACTG
AATACACACTTAAAAGGCAAGAAGCACAAGGCCAAAGAGGGACGTGGCCTAGAAACTGGGGAAGTCCACAAGCAACCAAGCCCACCGGAACAGGCACCACCCCTTGAAAA
CAAGGGCAGCTTCAAATTCTGGTGCGAAACGTGCCTAGTTGGAACTCGAAGCTTGGTTGTTATGGAGACACATAACAAGGGGAAGAAGCATAGGGCTCGCCTTTTGAAAC
TTGGTCAGTGCAAATTGGACGAGCAAAAGGAACCGAATGGGCTTATGTCTTGTGCATCCCCAGAAGGAAGAGGAGATGACTCAAATTCCATCACCTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGGTTCCGAATTCTCGAAGGGGGAATCGAGCCACGTAATTTGATCGATATTTGGTTCCGAGCCATCCACAACAAACCGCCGCCCGCCGCTTCCAGTTCCGGAAG
CTCCTCCGATCACCTGCTGCGAGATGATTCTCCAAACGCGGAGCTCGTGAAACAGAGGATTAAAGAAGAGATAATGATCAGAGAGATTGCGAGCCGACGAATGCTCGAGG
CGGAGATCAGGAGGGAGCTCATCATCGAGCAAGAACTAGCGATGCATAGGGCTGCGGGCCGGACGGAGAGGTTAGCATTGGACGAACATTTTGCAATGCGATTGTTGAAC
CAGAGGCTGAATCACGTTGTGGATCAGCCTTCCACAGGTCTATTAGCGGTTCCAGGTTTCAGTTCTTCGCTCGACTCCCAAACGATTCGTTCGTTTTCGGAGCCTCGAAA
AGAAGAACCGAAGGCTTTGGAAGATGAAAAGGACAAGTTAATTTTGCTGCCAAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCCGAGGGTCCACCGACGGAGG
CTGATACCAATACAGAGCCATTAATTCCTTCGATTAGTTCGAAGAAATTAGCAAAGAAAGAGTTCATTTGTACAATGTGCAAAATCACAACAACAAGCGAAATTACACTG
AATACACACTTAAAAGGCAAGAAGCACAAGGCCAAAGAGGGACGTGGCCTAGAAACTGGGGAAGTCCACAAGCAACCAAGCCCACCGGAACAGGCACCACCCCTTGAAAA
CAAGGGCAGCTTCAAATTCTGGTGCGAAACGTGCCTAGTTGGAACTCGAAGCTTGGTTGTTATGGAGACACATAACAAGGGGAAGAAGCATAGGGCTCGCCTTTTGAAAC
TTGGTCAGTGCAAATTGGACGAGCAAAAGGAACCGAATGGGCTTATGTCTTGTGCATCCCCAGAAGGAAGAGGAGATGACTCAAATTCCATCACCTTATAA
Protein sequenceShow/hide protein sequence
MLGFRILEGGIEPRNLIDIWFRAIHNKPPPAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLN
QRLNHVVDQPSTGLLAVPGFSSSLDSQTIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPTEADTNTEPLIPSISSKKLAKKEFICTMCKITTTSEITL
NTHLKGKKHKAKEGRGLETGEVHKQPSPPEQAPPLENKGSFKFWCETCLVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCASPEGRGDDSNSITL