; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001591 (gene) of Snake gourd v1 genome

Gene IDTan0001591
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRING/U-box superfamily protein
Genome locationLG03:76926522..76928053
RNA-Seq ExpressionTan0001591
SyntenyTan0001591
Gene Ontology termsNA
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026653.1 putative E3 ubiquitin-protein ligase LUL4, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-9683.94Show/hide
Query:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDD---DDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR
        MDG DGGRR  RRSLKQRLGFKVM CCGATWGFRPATVSVRGGDDD   DDR  PD EVM+T +FPEERELD  CLSPF++SSPA SGMNLATALAAERR
Subjt:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDD---DDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR

Query:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
        LR SPRG E  FVES+  ++DS AGMMG+GTPLRVSLMRLLEETDG GG GGNLGV E+K EEA NDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
Subjt:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN

Query:  RGSCPLCNRPILEILDIF
        RGSCPLCNRPILEILDIF
Subjt:  RGSCPLCNRPILEILDIF

XP_008440052.1 PREDICTED: uncharacterized protein LOC103484644 [Cucumis melo]2.6e-9684.19Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA
        MDGVD GGRRR SLKQRLGFKVMGCCGATWGFRP +VSVR GG +DDDRRVPD EVM+TRR  EERELDR CLSP SV SP PSGMNLATALAAERRLRA
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA

Query:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
        SPRGAEG  VE NN+D+DS  GMM  GTPLRVSL+RLLEET+G    GGNLGVAE+KREE GNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
Subjt:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS

Query:  CPLCNRPILEILDIF
        CPLCNRPI+EILDIF
Subjt:  CPLCNRPILEILDIF

XP_022926821.1 uncharacterized protein LOC111433822 isoform X1 [Cucurbita moschata]2.8e-9583.49Show/hide
Query:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR
        MDG DGGRR  RRSLKQRLGFKVM CCGATWGFRPATVSVRGG   DDDDDR  PD EVM+T +FPE RELD  CLSPF++SSPA SGMNLATALAAERR
Subjt:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR

Query:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
        LR SPRG E  FVES+  ++DS AGMMG+GTPLRVSLMRLLEETDG GG GGNLGV E+K EEA NDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
Subjt:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN

Query:  RGSCPLCNRPILEILDIF
        RGSCPLCNRPILEILDIF
Subjt:  RGSCPLCNRPILEILDIF

XP_038883240.1 uncharacterized protein LOC120074251 isoform X1 [Benincasa hispida]2.0e-9685.12Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA
        MDGVD GGR+R SLKQRLGFKVMGCCGATWGFR  TVSVR GG DDDDRRVPD EV +TRR  EERELDR CLSP SV SP PSGMNLATALAAERRLRA
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA

Query:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
        SPRG EG  VE NN+D+DSL GMMG GTPLRVSLMRLLEETDG  G  GNLGVAE KREE GNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
Subjt:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS

Query:  CPLCNRPILEILDIF
        CPLCNRPI+EILDIF
Subjt:  CPLCNRPILEILDIF

XP_038883241.1 uncharacterized protein LOC120074251 isoform X2 [Benincasa hispida]2.0e-9685.12Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA
        MDGVD GGR+R SLKQRLGFKVMGCCGATWGFR  TVSVR GG DDDDRRVPD EV +TRR  EERELDR CLSP SV SP PSGMNLATALAAERRLRA
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA

Query:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
        SPRG EG  VE NN+D+DSL GMMG GTPLRVSLMRLLEETDG  G  GNLGVAE KREE GNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
Subjt:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS

Query:  CPLCNRPILEILDIF
        CPLCNRPI+EILDIF
Subjt:  CPLCNRPILEILDIF

TrEMBL top hitse value%identityAlignment
A0A1S3AZT0 uncharacterized protein LOC1034846441.2e-9684.19Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA
        MDGVD GGRRR SLKQRLGFKVMGCCGATWGFRP +VSVR GG +DDDRRVPD EVM+TRR  EERELDR CLSP SV SP PSGMNLATALAAERRLRA
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVR-GGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRA

Query:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
        SPRGAEG  VE NN+D+DS  GMM  GTPLRVSL+RLLEET+G    GGNLGVAE+KREE GNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS
Subjt:  SPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGS

Query:  CPLCNRPILEILDIF
        CPLCNRPI+EILDIF
Subjt:  CPLCNRPILEILDIF

A0A6J1EFY3 uncharacterized protein LOC111433822 isoform X11.4e-9583.49Show/hide
Query:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR
        MDG DGGRR  RRSLKQRLGFKVM CCGATWGFRPATVSVRGG   DDDDDR  PD EVM+T +FPE RELD  CLSPF++SSPA SGMNLATALAAERR
Subjt:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR

Query:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
        LR SPRG E  FVES+  ++DS AGMMG+GTPLRVSLMRLLEETDG GG GGNLGV E+K EEA NDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
Subjt:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN

Query:  RGSCPLCNRPILEILDIF
        RGSCPLCNRPILEILDIF
Subjt:  RGSCPLCNRPILEILDIF

A0A6J1EJA9 uncharacterized protein LOC111433822 isoform X21.4e-9583.49Show/hide
Query:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR
        MDG DGGRR  RRSLKQRLGFKVM CCGATWGFRPATVSVRGG   DDDDDR  PD EVM+T +FPE RELD  CLSPF++SSPA SGMNLATALAAERR
Subjt:  MDGVDGGRR--RRSLKQRLGFKVMGCCGATWGFRPATVSVRGG---DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERR

Query:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
        LR SPRG E  FVES+  ++DS AGMMG+GTPLRVSLMRLLEETDG GG GGNLGV E+K EEA NDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN
Subjt:  LRASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLN

Query:  RGSCPLCNRPILEILDIF
        RGSCPLCNRPILEILDIF
Subjt:  RGSCPLCNRPILEILDIF

A0A6J1IJK3 uncharacterized protein LOC111478017 isoform X24.0e-9581.94Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGG--DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLR
        MDGVD GGRRR+SLKQRLGFKVMGCCGATWG RPAT SVRGG  DDDDD+RVP+ EV + RRF EER+LDR C+SP SV SPA SGMNLATALAAERRLR
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGG--DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLR

Query:  ASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG
        ASPRGAEG FVES+N+D+DSL GMM   TPL+VSL+RLL+ETDG+ G  G LGVAE+KREEAGNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG
Subjt:  ASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG

Query:  SCPLCNRPILEILDIF
         CPLCNRPI+EILDIF
Subjt:  SCPLCNRPILEILDIF

A0A6J1ISH8 uncharacterized protein LOC111478017 isoform X14.0e-9581.94Show/hide
Query:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGG--DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLR
        MDGVD GGRRR+SLKQRLGFKVMGCCGATWG RPAT SVRGG  DDDDD+RVP+ EV + RRF EER+LDR C+SP SV SPA SGMNLATALAAERRLR
Subjt:  MDGVD-GGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGG--DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLR

Query:  ASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG
        ASPRGAEG FVES+N+D+DSL GMM   TPL+VSL+RLL+ETDG+ G  G LGVAE+KREEAGNDS+CCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG
Subjt:  ASPRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRG

Query:  SCPLCNRPILEILDIF
         CPLCNRPI+EILDIF
Subjt:  SCPLCNRPILEILDIF

SwissProt top hitse value%identityAlignment
Q8BFW4 Tripartite motif-containing protein 651.1e-0440Show/hide
Query:  CCVCMGRKKGAAFIPCGHTFCRVCSRELWLN-RGSCPLCNRPILE
        C +C+GR +    +PCGH+FC  C ++ W +   SCP C +P  E
Subjt:  CCVCMGRKKGAAFIPCGHTFCRVCSRELWLN-RGSCPLCNRPILE

Q8IUD6 E3 ubiquitin-protein ligase RNF1355.3e-0440Show/hide
Query:  AGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELW----LNRGSCPLCNR
        A +D  C +C G     A +PCGH+FCR C   LW      R +CP C +
Subjt:  AGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELW----LNRGSCPLCNR

Q8LA32 Probable E3 ubiquitin-protein ligase LUL46.7e-0740.35Show/hide
Query:  EEAGNDSVCCVCMGRKKGAAFIPCGH-TFCRVCSRELWLNRGSCPLCNRPILEILDI
        +E+G+ + C +CM   K  A +PC H   C  C++EL L    CP+C +PI E+L+I
Subjt:  EEAGNDSVCCVCMGRKKGAAFIPCGH-TFCRVCSRELWLNRGSCPLCNRPILEILDI

Arabidopsis top hitse value%identityAlignment
AT1G62370.1 RING/U-box superfamily protein1.6e-3240.81Show/hide
Query:  DGVDGGRRRRS----LKQRLGFKVMGCCGATWGFRPATVSVRGG------DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALA
        D ++GG R  S    LKQRL F  +GCCG     +P T+ +R        DDDDD      +V+         ELD  CL+         S  NLA ALA
Subjt:  DGVDGGRRRRS----LKQRLGFKVMGCCGATWGFRPATVSVRGG------DDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALA

Query:  AERRLRASPRGAEGDFVESNNHDYDSLAGMMGNGTPL-RVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSR
         ER                       LA +    T + +V LMRLL E+DG           +      GND +CCVCMGR+KGAAFIPCGHT+CRVCSR
Subjt:  AERRLRASPRGAEGDFVESNNHDYDSLAGMMGNGTPL-RVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSR

Query:  ELWLNRGSCPLCNRPILEILDIF
        E+W+NRG+CPLCNR I ++LD++
Subjt:  ELWLNRGSCPLCNRPILEILDIF

AT3G25030.1 RING/U-box superfamily protein2.1e-2459.09Show/hide
Query:  PLRVSLMRLLEETDGNGGIGGNLGVAERK-REEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGSCPLCNRPILEILDIF
        P R+SLM LLEE +G        G AE +       +  CCVCM R KGAAFIPCGHTFCR+CSRELW+ RG+CPLCN  ILE+LD+F
Subjt:  PLRVSLMRLLEETDGNGGIGGNLGVAERK-REEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGSCPLCNRPILEILDIF

AT3G25030.2 RING/U-box superfamily protein2.1e-2459.09Show/hide
Query:  PLRVSLMRLLEETDGNGGIGGNLGVAERK-REEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGSCPLCNRPILEILDIF
        P R+SLM LLEE +G        G AE +       +  CCVCM R KGAAFIPCGHTFCR+CSRELW+ RG+CPLCN  ILE+LD+F
Subjt:  PLRVSLMRLLEETDGNGGIGGNLGVAERK-REEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGSCPLCNRPILEILDIF

AT4G03965.1 RING/U-box superfamily protein2.8e-4851.6Show/hide
Query:  MDGVDGGRRR-RSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRAS
        MDG+D  RRR R+LK+RLGFK +GCCG TWG R   ++   G+DDD+              P E  L  G      V+ P   GMNLATALAAER  R  
Subjt:  MDGVDGGRRR-RSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRAS

Query:  PRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGN-----LGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWL
           A G                    TPL+VSLMRLLEET     +  N        A        NDSVCCVCMGRKKGAAFIPCGHTFCRVCSRE+WL
Subjt:  PRGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGN-----LGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWL

Query:  NRGSCPLCNRPILEILDIF
        NRGSCPLCNRPI+EILDI+
Subjt:  NRGSCPLCNRPILEILDIF

AT4G22250.1 RING/U-box superfamily protein1.4e-4448.2Show/hide
Query:  MDGVDGGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRASP
        M G D  RR+R+LK+RL FK +GCCG TW  R    +        D  +            EE +++ G +        + +GMNLATAL AER  R   
Subjt:  MDGVDGGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRASP

Query:  RGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEET-----DGNGG----IGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRE
          AE D                   TP RVSLMRLLEET     D +G     +  ++G         GNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRE
Subjt:  RGAEGDFVESNNHDYDSLAGMMGNGTPLRVSLMRLLEET-----DGNGG----IGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRE

Query:  LWLNRGSCPLCNRPILEILDIF
        LWLNRGSCPLCNRPI+EILDIF
Subjt:  LWLNRGSCPLCNRPILEILDIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTGTCGATGGGGGGAGGCGGCGGCGGAGTTTGAAGCAACGGTTAGGATTTAAGGTCATGGGCTGTTGTGGTGCTACTTGGGGTTTTCGCCCGGCGACGGTCAG
CGTTAGAGGCGGTGACGATGACGACGATAGGAGAGTACCGGACGGGGAGGTGATGAGCACACGCCGGTTTCCGGAGGAGAGGGAGTTGGATCGGGGTTGCTTAAGTCCGT
TTTCCGTTTCGTCGCCGGCGCCGTCCGGTATGAACTTAGCCACCGCACTGGCGGCGGAGCGTCGTTTGAGAGCGTCGCCACGTGGAGCGGAGGGGGATTTTGTGGAATCG
AATAACCACGATTACGATTCACTGGCTGGAATGATGGGAAACGGAACGCCGTTGAGGGTGTCGCTGATGCGGCTGCTGGAAGAAACAGACGGCAACGGTGGCATTGGTGG
GAATTTGGGGGTGGCGGAAAGGAAGAGAGAAGAGGCGGGAAATGATTCGGTGTGCTGCGTGTGCATGGGGAGGAAAAAGGGAGCGGCATTTATCCCATGTGGGCACACGT
TTTGTAGGGTGTGTTCGAGGGAGCTGTGGTTGAATCGAGGATCTTGCCCGCTTTGCAATCGTCCGATCCTCGAGATTCTTGATATATTTTGA
mRNA sequenceShow/hide mRNA sequence
CTTGAACTCGATTTTGAACATTAATTAATTAACAATTTCATCAAAGAAAAGTGTATAAAAATCATTTAGCAGTCCAATGCGCTGCCGCATTTTACAAAGAAAATAAGAGA
GTCTCTGTAAAGCGTAAAGTACCAACCAGCACCTCCACTTCCTCATCCCTTTGTCTTTTTCTTCTTCTTCTTCTCTCTTAATTGCCATCTTAATTAAATTTCGAGTCGCC
TCTTTAAAATTAAGCTTGCATTTAAGACCCTCACGACTCTTTTATTTTTCTTCTTGGGGTCTGGTTTCACTTTCCGAGAATAATTAAAAACAGAGAGGTGGATTTCATCA
GAATCTTTGAAGAAACTCACGAGATCTCCATGGCTAAAGGAAGCAGCGAGGCGAGAAAGGGAGGCTAGGACTATTTGGGGGTTTTTGACCGATCAAATGGACGGTGTCGA
TGGGGGGAGGCGGCGGCGGAGTTTGAAGCAACGGTTAGGATTTAAGGTCATGGGCTGTTGTGGTGCTACTTGGGGTTTTCGCCCGGCGACGGTCAGCGTTAGAGGCGGTG
ACGATGACGACGATAGGAGAGTACCGGACGGGGAGGTGATGAGCACACGCCGGTTTCCGGAGGAGAGGGAGTTGGATCGGGGTTGCTTAAGTCCGTTTTCCGTTTCGTCG
CCGGCGCCGTCCGGTATGAACTTAGCCACCGCACTGGCGGCGGAGCGTCGTTTGAGAGCGTCGCCACGTGGAGCGGAGGGGGATTTTGTGGAATCGAATAACCACGATTA
CGATTCACTGGCTGGAATGATGGGAAACGGAACGCCGTTGAGGGTGTCGCTGATGCGGCTGCTGGAAGAAACAGACGGCAACGGTGGCATTGGTGGGAATTTGGGGGTGG
CGGAAAGGAAGAGAGAAGAGGCGGGAAATGATTCGGTGTGCTGCGTGTGCATGGGGAGGAAAAAGGGAGCGGCATTTATCCCATGTGGGCACACGTTTTGTAGGGTGTGT
TCGAGGGAGCTGTGGTTGAATCGAGGATCTTGCCCGCTTTGCAATCGTCCGATCCTCGAGATTCTTGATATATTTTGAAGGAGCATGATAAGTATGATGGGGATGAGAAC
AGGGGATTGTTGGTGACGGGAAGGGGAAGTGCTTTTGCCTCAAGATTCGAGACAGACTGTTGTCATTGGACGGTACTGAACGATGCCGTATCTGGGCGCCATGTGGATGC
GGGTGATAAAACTTTTTCACGTCCCAGAAAAAAAGTTAAATTAATAGTTTCGATGTAATTGATAAATTTAAACTTTTATTTTATATACATGGTTGAGTCTTTTCTTTCTT
TTTTCTTTTTTAAAGATTTGTATATTGGATTTTTATTTTATTTTATTTTTGTGTGCTCGT
Protein sequenceShow/hide protein sequence
MDGVDGGRRRRSLKQRLGFKVMGCCGATWGFRPATVSVRGGDDDDDRRVPDGEVMSTRRFPEERELDRGCLSPFSVSSPAPSGMNLATALAAERRLRASPRGAEGDFVES
NNHDYDSLAGMMGNGTPLRVSLMRLLEETDGNGGIGGNLGVAERKREEAGNDSVCCVCMGRKKGAAFIPCGHTFCRVCSRELWLNRGSCPLCNRPILEILDIF