; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033750 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033750
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationchr3:1578284..1579936
RNA-Seq ExpressionLag0033750
SyntenyLag0033750
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2668195.1 hypothetical protein I3760_15G148700 [Carya illinoinensis]3.9e-2932.61Show/hide
Query:  MDFRFRAIHNKP---PAAASSSGSSSDHLLRDDSPN---------------AELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERL
        M+F+FRA+  +P   P++++  G  ++  LRD + N                EL K RI+EEI++ EIA R+MLEAE+RREL++E+E+ M R     E L
Subjt:  MDFRFRAIHNKP---PAAASSSGSSSDHLLRDDSPN---------------AELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERL

Query:  ALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPK-ALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISS
        +++      L           P  G LA PG S   D  T+++  E    + K +LE  KDKLILL KP+     AK+KA  PPA + +  +P       
Subjt:  ALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPK-ALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISS

Query:  KKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIER--RGLETGEVHKQPSPP---------------------------------------------E
        KK  K+E+ C +CK++ T+E   N HL+GKKHKA E   R    G++     PP                                             E
Subjt:  KKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIER--RGLETGEVHKQPSPP---------------------------------------------E

Query:  QAPV------------------------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ
        Q PV                          + +  FKFWC  CQ+G  S VVM+ H KGKKHRARL  LG+
Subjt:  QAPV------------------------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ

XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]4.0e-5848.73Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH
        M+FRFRAI NK PA A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N 
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH

Query:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE
        ++D   ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+E
Subjt:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE

Query:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA
        F+C++C +  TSEI+ N H+ GKKHKA E R       HK+P+  E+        PV        LE     KF CE C VG   + VM +HN G+KH+A
Subjt:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA

Query:  RLLKLG-QCKLDEQKE
        RLLKL  QCK ++QK+
Subjt:  RLLKLG-QCKLDEQKE

XP_021660819.1 uncharacterized protein LOC110650243 [Hevea brasiliensis]1.8e-2931.87Show/hide
Query:  MDFRFRAIHNKPPAAASSSGS-------------------SSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTER
        M+F+FRA+  KPP   SSS +                    +  L+R+ +   E+ KQRI+EEI+ +EIA RR+LEAE+RREL++E+E+AM R   R   
Subjt:  MDFRFRAIHNKPPAAASSSGS-------------------SSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTER

Query:  LALDEHFAMRL--------LNQRLNHVVDQPST----GLL-AVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEA
        L+ +E   MRL        +NQ  N  ++  S     G+    P  S +L    ++  S          ED KDKLI+L KP+     AKRKA  PP + 
Subjt:  LALDEHFAMRL--------LNQRLNHVVDQPST----GLL-AVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEA

Query:  DTNTEPLIPSISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSP------------------------------------
               +P    KK+ K+E+ CT+C ++ TSE  LN HL+GK+HKA E R L    V K  SP                                    
Subjt:  DTNTEPLIPSISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSP------------------------------------

Query:  -------------------------------------PEQAPVPLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ
                                              E+       K  FKFWCE CQ+G  S VVME H KGKKH+ +L +L Q
Subjt:  -------------------------------------PEQAPVPLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ

XP_022926958.1 uncharacterized protein LOC111433915 [Cucurbita moschata]1.7e-3576.67Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV
        +DFR RA+ NKP  AASSS  SSDH LRDDSPNAELVKQRIKEEI  RE ASRRMLEAEIRRELIIEQEL++ RA GRTE LA DEHFAMR+L+ RLNH+
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV

Query:  VDQPST-GLLAVPGFSSSLD
        VDQ S+ GLLAVPG  SSL+
Subjt:  VDQPST-GLLAVPGFSSSLD

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]3.1e-7455.99Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQ-RLNH
        MDFRFRA  NK PA A+S+   S   L  DS NAEL+KQR+K+EIMIREIASRRMLEAEIRRELIIEQELA  R  GRTE L  D+ F++RLL+Q R+NH
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQ-RLNH

Query:  VVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISSKKLAKKEFICTVCKITTT
         +  P  GLL VPG SSS     +     P+ EEPK  +D+K+KLI+LPKPDP KF+ KRKAEG  AE DT+ +     ISSKKLAK+EF+C++C +  T
Subjt:  VVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISSKKLAKKEFICTVCKITTT

Query:  SEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQAPVPLE-------------NKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLG-QCKLD
        SEI+ N HLKGKKH A E R L+T    ++PSP E     L+             NK  FKFWC+ C++GT  + +M +HN GKKH+ARLLKL  Q KLD
Subjt:  SEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQAPVPLE-------------NKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLG-QCKLD

Query:  EQK-EPNGL
        +QK EPN L
Subjt:  EQK-EPNGL

TrEMBL top hitse value%identityAlignment
A0A1S3B9S7 uncharacterized protein LOC1034873432.0e-5848.73Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH
        M+FRFRAI NK PA A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N 
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH

Query:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE
        ++D   ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+E
Subjt:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE

Query:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA
        F+C++C +  TSEI+ N H+ GKKHKA E R       HK+P+  E+        PV        LE     KF CE C VG   + VM +HN G+KH+A
Subjt:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA

Query:  RLLKLG-QCKLDEQKE
        RLLKL  QCK ++QK+
Subjt:  RLLKLG-QCKLDEQKE

A0A2I4F9T4 UBP1-associated proteins 1C-like isoform X27.3e-2933.16Show/hide
Query:  MDFRFRAIHNKP---PAAASSSGSSSDHLLRDDSPNA-----------------ELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE
        M+F+FRA+  +P   P+++S  G  ++  LR   PNA                 EL K+RI+EEI++ EIA R+MLEAE+RREL++E+E+ M R     E
Subjt:  MDFRFRAIHNKP---PAAASSSGSSSDHLLRDDSPNA-----------------ELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE

Query:  RLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPK-ALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSI
         L+++          RL  +   P  G LA PG  S  D  T+ +  E    + K +LE  KDKLILL KP+      K+KA  PPA + +  +P     
Subjt:  RLALDEHFAMRLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPK-ALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSI

Query:  SSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIER--RGLETGEVHKQPSPP--------------------------------------------
          KK  K+E+ C +CK++ T+E   N HL+GKKHKA E   R    G++     PP                                            
Subjt:  SSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIER--RGLETGEVHKQPSPP--------------------------------------------

Query:  -------EQAPV--------PLENKGS-----------------FKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ
               EQ PV          + KG+                 FKFWC  CQ+G  S VVME H KGKKHRARL  LG+
Subjt:  -------EQAPV--------PLENKGS-----------------FKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQ

A0A5D3B800 UBP1-associated proteins 1C-like isoform X32.0e-5848.73Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH
        M+FRFRAI NK PA A+   S+SD  ++DDS N EL KQRIKEEI +REI  RRMLEAEIRREL++E+ELA+ RA G+TE  L+ D  F +R +N+ +N 
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTE-RLALDEHFAMRLLNQRLNH

Query:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE
        ++D   ST LLAVPG +SSL+           KEEPK  EDE +KLI L +PDP KF  KRKA G    A            + +IP+  I SKKLAK+E
Subjt:  VVD-QPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADT--------NTEPLIPS--ISSKKLAKKE

Query:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA
        F+C++C +  TSEI+ N H+ GKKHKA E R       HK+P+  E+        PV        LE     KF CE C VG   + VM +HN G+KH+A
Subjt:  FICTVCKITTTSEITLNTHLKGKKHKAIERRGLETGEVHKQPSPPEQ-------APV-------PLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA

Query:  RLLKLG-QCKLDEQKE
        RLLKL  QCK ++QK+
Subjt:  RLLKLG-QCKLDEQKE

A0A6J1EJN4 uncharacterized protein LOC1114339158.0e-3676.67Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV
        +DFR RA+ NKP  AASSS  SSDH LRDDSPNAELVKQRIKEEI  RE ASRRMLEAEIRRELIIEQEL++ RA GRTE LA DEHFAMR+L+ RLNH+
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV

Query:  VDQPST-GLLAVPGFSSSLD
        VDQ S+ GLLAVPG  SSL+
Subjt:  VDQPST-GLLAVPGFSSSLD

A0A6J1HN47 uncharacterized protein LOC111465139 isoform X14.3e-2969.17Show/hide
Query:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV
        MDF  RA+ N+P  AASSS  SSDH LRDDSPNAELVKQRIK +I  REIASRRMLEAE R ELIIEQEL++ RA G TE LA DEHF MR+L+ RLN +
Subjt:  MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHV

Query:  VDQPSTG-LLAVPGFSSSLD
        VDQ S+  LLA PG  SSL+
Subjt:  VDQPSTG-LLAVPGFSSSLD

SwissProt top hitse value%identityAlignment
Q8VD12 Zinc finger protein 385A9.2e-0528.21Show/hide
Query:  PKPDPEKFKAKRKAEGPPAEADTNTEPLIP--SISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAI--ERRGLETGEVHKQ--PSPPEQAPVPLE
        P   PE  +   K EG      T+    +P  S   ++ AK+   C +CK+   S   L  H KG KHK I   R GL   + + +  P  P +   P +
Subjt:  PKPDPEKFKAKRKAEGPPAEADTNTEPLIP--SISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAI--ERRGLETGEVHKQ--PSPPEQAPVPLE

Query:  NKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCA
        ++    F CE C V   S V ++ H   ++HR  +       L   K+P G    A
Subjt:  NKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCA

Arabidopsis top hitse value%identityAlignment
AT2G24030.1 zinc ion binding;nucleic acid binding8.0e-1226.91Show/hide
Query:  MDFRFRAI-HNKPPAAA------------------SSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTER
        M+FR+RAI +N+PP A                   S  G  S+  +R ++   E+ K++I++EI+I E A +R L AE+ +E+ IE+E+A+ R +     
Subjt:  MDFRFRAI-HNKPPAAA------------------SSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTER

Query:  LALDEHFAM----------------RLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPP
        ++L+E   M                    Q+ +++    +TG       S  +    ++  SE        LE  K+ LI+L + D    K K  + G  
Subjt:  LALDEHFAM----------------RLLNQRLNHVVDQPSTGLLAVPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPP

Query:  AEADTNTEPLIPSISSKKLAKKEFICTV-------CKITTTSEITLNTHLKGKKHKAIERR----GLETGEVHKQPSP-------PEQAPVPLENKGSFK
                P     S  +  K++FI           +     +  LN  L+ K+ KA E       LETGE+     P        ++A   L  + + K
Subjt:  AEADTNTEPLIPSISSKKLAKKEFICTV-------CKITTTSEITLNTHLKGKKHKAIERR----GLETGEVHKQPSP-------PEQAPVPLENKGSFK

Query:  FWCETCQVGTRSLVVMETHNKGKKHRA
        FWCE C+VGT   +VM  H  GKKH+A
Subjt:  FWCETCQVGTRSLVVMETHNKGKKHRA

AT2G24030.2 zinc ion binding;nucleic acid binding2.1e-0430.13Show/hide
Query:  LEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISSKKLAKKEFICTV-------CKITTTSEITLNTHLKGKKHKAIERR----GLETGE
        LE  K+ LI+L + D    K K  + G          P     S  +  K++FI           +     +  LN  L+ K+ KA E       LETGE
Subjt:  LEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISSKKLAKKEFICTV-------CKITTTSEITLNTHLKGKKHKAIERR----GLETGE

Query:  VHKQPSP-------PEQAPVPLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA
        +     P        ++A   L  + + KFWCE C+VGT   +VM  H  GKKH+A
Subjt:  VHKQPSP-------PEQAPVPLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRA

AT5G61190.1 putative endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain1.2e-0726.43Show/hide
Query:  ISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKH-------------KAIERRGL--------ETGEVHKQPSPPEQAPVPLENK--------------
        I S+  A  EF+C +C +   S+I  N+HL+GKKH               ++ +G+           E+  Q    ++  V +++K              
Subjt:  ISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKH-------------KAIERRGL--------ETGEVHKQPSPPEQAPVPLENK--------------

Query:  -------GSFKFWCETCQVGTRSLVVMETHNKGKKHRARL
                S K+ C  C VG  S +V ETH +G+KH A L
Subjt:  -------GSFKFWCETCQVGTRSLVVMETHNKGKKHRARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAGGTTCCGAGCCATCCACAACAAACCGCCGGCCGCCGCTTCCAGTTCCGGAAGCTCCTCCGATCACCTGCTGCGAGATGATTCTCCAAACGCGGAGCTCGT
GAAACAGAGGATTAAAGAAGAGATAATGATCAGAGAGATTGCGAGCCGACGAATGCTCGAGGCGGAGATCAGGAGGGAGCTCATCATTGAGCAAGAACTAGCGATGCATA
GGGCTGCGGGCCGGACGGAGAGGTTAGCATTGGACGAGCATTTTGCCATGCGATTGTTGAACCAGAGGCTGAATCACGTTGTGGATCAGCCTTCCACAGGTCTATTAGCG
GTTCCAGGTTTCAGTTCTTCGCTCGACTGCGAAACGATTCGTTCGTTTTCGGAGCCTCGGAAAGAAGAACCGAAGGCTTTGGAAGATGAAAAGGACAAGTTAATTTTGCT
GCCAAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCCGAGGGTCCACCGGCGGAGGCTGATACCAATACAGAGCCATTAATTCCTTCGATCAGTTCGAAGAAAT
TAGCAAAGAAAGAGTTCATTTGTACAGTGTGCAAGATCACAACAACAAGCGAAATTACACTGAATACACACTTAAAAGGCAAGAAGCACAAGGCCATAGAGAGACGTGGC
CTAGAAACTGGGGAAGTCCACAAGCAACCAAGCCCACCGGAACAGGCACCAGTACCCCTTGAAAACAAGGGCAGCTTCAAATTCTGGTGCGAAACGTGCCAAGTTGGAAC
TCGAAGCTTGGTTGTTATGGAGACACATAACAAGGGGAAGAAGCATAGGGCTCGCCTTTTGAAACTTGGTCAGTGCAAATTGGACGAGCAAAAAGAACCCAATGGGCTTA
TGTCTTGTGCATCCCCAGAAGGAAGAGGAGATGACTCAAATTCGATCACCTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAGGTTCCGAGCCATCCACAACAAACCGCCGGCCGCCGCTTCCAGTTCCGGAAGCTCCTCCGATCACCTGCTGCGAGATGATTCTCCAAACGCGGAGCTCGT
GAAACAGAGGATTAAAGAAGAGATAATGATCAGAGAGATTGCGAGCCGACGAATGCTCGAGGCGGAGATCAGGAGGGAGCTCATCATTGAGCAAGAACTAGCGATGCATA
GGGCTGCGGGCCGGACGGAGAGGTTAGCATTGGACGAGCATTTTGCCATGCGATTGTTGAACCAGAGGCTGAATCACGTTGTGGATCAGCCTTCCACAGGTCTATTAGCG
GTTCCAGGTTTCAGTTCTTCGCTCGACTGCGAAACGATTCGTTCGTTTTCGGAGCCTCGGAAAGAAGAACCGAAGGCTTTGGAAGATGAAAAGGACAAGTTAATTTTGCT
GCCAAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCCGAGGGTCCACCGGCGGAGGCTGATACCAATACAGAGCCATTAATTCCTTCGATCAGTTCGAAGAAAT
TAGCAAAGAAAGAGTTCATTTGTACAGTGTGCAAGATCACAACAACAAGCGAAATTACACTGAATACACACTTAAAAGGCAAGAAGCACAAGGCCATAGAGAGACGTGGC
CTAGAAACTGGGGAAGTCCACAAGCAACCAAGCCCACCGGAACAGGCACCAGTACCCCTTGAAAACAAGGGCAGCTTCAAATTCTGGTGCGAAACGTGCCAAGTTGGAAC
TCGAAGCTTGGTTGTTATGGAGACACATAACAAGGGGAAGAAGCATAGGGCTCGCCTTTTGAAACTTGGTCAGTGCAAATTGGACGAGCAAAAAGAACCCAATGGGCTTA
TGTCTTGTGCATCCCCAGAAGGAAGAGGAGATGACTCAAATTCGATCACCTTATAA
Protein sequenceShow/hide protein sequence
MDFRFRAIHNKPPAAASSSGSSSDHLLRDDSPNAELVKQRIKEEIMIREIASRRMLEAEIRRELIIEQELAMHRAAGRTERLALDEHFAMRLLNQRLNHVVDQPSTGLLA
VPGFSSSLDCETIRSFSEPRKEEPKALEDEKDKLILLPKPDPEKFKAKRKAEGPPAEADTNTEPLIPSISSKKLAKKEFICTVCKITTTSEITLNTHLKGKKHKAIERRG
LETGEVHKQPSPPEQAPVPLENKGSFKFWCETCQVGTRSLVVMETHNKGKKHRARLLKLGQCKLDEQKEPNGLMSCASPEGRGDDSNSITL