; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G013750 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G013750
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1
Genome locationchr05:21639263..21655645
RNA-Seq ExpressionLsi05G013750
SyntenyLsi05G013750
Gene Ontology termsGO:0042372 - phylloquinone biosynthetic process (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0061522 - 1,4-dihydroxy-2-naphthoyl-CoA thioesterase activity (molecular function)
InterPro domainsIPR003736 - Phenylacetic acid degradation-related domain
IPR006683 - Thioesterase domain
IPR029069 - HotDog domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018535.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-7365.02Show/hide
Query:  MDHQNPNNDPAPLPSPSSTTAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSIDHLQSAKI
        MDHQNPN+ P P P PSSTTAILD+PLNAVGFEI+ LSPHRVTGRIVVSPKCCQ FKVLHGGVSAMIAEALAS GAQMA+GFKRVAGFHLSIDHLQSA+I
Subjt:  MDHQNPNNDPAPLPSPSSTTAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSIDHLQSAKI

Query:  GDLVLAEATPLSIGNAIQF--------------------SNYIT--TNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLV------------
        GDLVLAEATPLS+G AIQ                     S+ +T   NMP   H+ P     L  TL+     V   SS++   RL +            
Subjt:  GDLVLAEATPLSIGNAIQF--------------------SNYIT--TNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLV------------

Query:  -SPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP
         SP       VLHGGVSALIAESLAS+GAHTASGYQRVAGIHLSINHLKSAALGD V AEAAP
Subjt:  -SPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP

KNA13827.1 hypothetical protein SOVF_113040 [Spinacia oleracea]8.1e-6760.83Show/hide
Query:  MDHQNPNNDPAPLPSPSST-------TAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSID
        M++Q P   P+  PS +ST       TA +DAPLN +GFE + ++P  V+G ++V+PKCCQ FKVLHGGVSAMIAEALAS GA +A G KRVAG HLSID
Subjt:  MDHQNPNNDPAPLPSPSST-------TAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSID

Query:  HLQSAKIGDLVLAEATPLSIGNAIQ-----FSNYITTNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAES
        H++SA++GDL+LA+ATPLS G  IQ     F     T+M   +    S++  LD  L A GFE+D VS  KVTGRLLV+  CCQPFKVLHGGVSALIAES
Subjt:  HLQSAKIGDLVLAEATPLSIGNAIQ-----FSNYITTNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAES

Query:  LASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP
        LASMGAH ASG +RVAGIHLSINH+K A LGD V A+A P
Subjt:  LASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP

OEL31688.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1 [Dichanthelium oligosanthes]4.4e-6554.31Show/hide
Query:  SPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAPHYPF
        S  +K+  LD  LQA GFEV+ +S  ++TGRLLV+P+CCQPFKVLHGGVSAL+AESLASMGAH ASGY+RVAG+ LSINH +SA+LGD V A A      
Subjt:  SPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAPHYPF

Query:  IPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASG
        +P  L  +   P    +G  +    Q + +         A+ DAPL A+GF++E  S  R++GR+LV+P CCQPF VLHGGVSAL+AEALAS GA++ASG
Subjt:  IPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASG

Query:  YGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWK-----DLKERKIVVSTVTLLCN
        + ++VG+QLSI+H +SA +G  VLA A PV VGR+ QVW V+LWK       K  +I  S VTLL N
Subjt:  YGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWK-----DLKERKIVVSTVTLLCN

XP_022955679.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Cucurbita moschata]6.4e-5685.61Show/hide
Query:  PPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTV
        PPPP +P   DAPL+AVGF LEHESAQRVSGRILVSPICCQPF VLHGGVSALIAEALASKGAYVASGY K+VGI LSINHLKSAEMGAVVLAEATPVTV
Subjt:  PPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTV

Query:  GRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP
        GRTIQVW+VELWKDLKE K++VST  VTLLC SSV  DP
Subjt:  GRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP

XP_022979953.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Cucurbita maxima]4.9e-5682.31Show/hide
Query:  MSEKSLPPPPSQPADP--DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVL
        MS  + PPPP  P  P  DAPL+AVGF +EHESAQRVSGRILVSPICCQPF VLHGGVSALIAEALASKGAYVASGY K+VGI LSINHLKSAEMGAVVL
Subjt:  MSEKSLPPPPSQPADP--DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVL

Query:  AEATPVTVGRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP
        AEATPVTVGRTIQVW+VELWKDLKE K++VST  VTLLC SSV  DP
Subjt:  AEATPVTVGRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP

TrEMBL top hitse value%identityAlignment
A0A0K9R4T6 Uncharacterized protein3.9e-6760.83Show/hide
Query:  MDHQNPNNDPAPLPSPSST-------TAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSID
        M++Q P   P+  PS +ST       TA +DAPLN +GFE + ++P  V+G ++V+PKCCQ FKVLHGGVSAMIAEALAS GA +A G KRVAG HLSID
Subjt:  MDHQNPNNDPAPLPSPSST-------TAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSID

Query:  HLQSAKIGDLVLAEATPLSIGNAIQ-----FSNYITTNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAES
        H++SA++GDL+LA+ATPLS G  IQ     F     T+M   +    S++  LD  L A GFE+D VS  KVTGRLLV+  CCQPFKVLHGGVSALIAES
Subjt:  HLQSAKIGDLVLAEATPLSIGNAIQ-----FSNYITTNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAES

Query:  LASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP
        LASMGAH ASG +RVAGIHLSINH+K A LGD V A+A P
Subjt:  LASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAP

A0A1E5W2Y5 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 12.1e-6554.31Show/hide
Query:  SPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAPHYPF
        S  +K+  LD  LQA GFEV+ +S  ++TGRLLV+P+CCQPFKVLHGGVSAL+AESLASMGAH ASGY+RVAG+ LSINH +SA+LGD V A A      
Subjt:  SPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGDFVHAEAAPHYPF

Query:  IPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASG
        +P  L  +   P    +G  +    Q + +         A+ DAPL A+GF++E  S  R++GR+LV+P CCQPF VLHGGVSAL+AEALAS GA++ASG
Subjt:  IPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASG

Query:  YGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWK-----DLKERKIVVSTVTLLCN
        + ++VG+QLSI+H +SA +G  VLA A PV VGR+ QVW V+LWK       K  +I  S VTLL N
Subjt:  YGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWK-----DLKERKIVVSTVTLLCN

A0A1E5W707 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 12.2e-4946.72Show/hide
Query:  MPSTNHSPPSKSP------VLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGD
        M S    PP   P      V D +L A GFE   ++ ++V GRL V+  CCQPF VL+GGVSAL+AESLAS+GA+ ASGY+RVAG+ LS+NHL+ A LG+
Subjt:  MPSTNHSPPSKSP------VLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGD

Query:  FVHAEAAPHYPFIPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPP------SQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGV
         V A A+      P QL  +      LH+  P       +  S PP P      ++P   D  L+A+GF+    +A  V+GR+ V+  CCQPF+ L+GGV
Subjt:  FVHAEAAPHYPFIPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPP------SQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGV

Query:  SALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNV
        SAL+AE  AS G YVA+GY ++ G+QLSINH+  A +G +V A ATPV +GR IQV N+
Subjt:  SALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNV

A0A6J1GWZ2 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like3.1e-5685.61Show/hide
Query:  PPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTV
        PPPP +P   DAPL+AVGF LEHESAQRVSGRILVSPICCQPF VLHGGVSALIAEALASKGAYVASGY K+VGI LSINHLKSAEMGAVVLAEATPVTV
Subjt:  PPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTV

Query:  GRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP
        GRTIQVW+VELWKDLKE K++VST  VTLLC SSV  DP
Subjt:  GRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP

A0A6J1IXT0 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like2.4e-5682.31Show/hide
Query:  MSEKSLPPPPSQPADP--DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVL
        MS  + PPPP  P  P  DAPL+AVGF +EHESAQRVSGRILVSPICCQPF VLHGGVSALIAEALASKGAYVASGY K+VGI LSINHLKSAEMGAVVL
Subjt:  MSEKSLPPPPSQPADP--DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVL

Query:  AEATPVTVGRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP
        AEATPVTVGRTIQVW+VELWKDLKE K++VST  VTLLC SSV  DP
Subjt:  AEATPVTVGRTIQVWNVELWKDLKERKIVVST--VTLLCNSSVQKDP

SwissProt top hitse value%identityAlignment
P14205 Putative esterase ComA25.5e-1037.93Show/hide
Query:  LKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGA--YVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVEL
        L+A+G ++   +A+R    + V     QPF  LHGG S  +AE  AS GA   +       VG++++ NHLKS + G  V A A PV +GRT  V+++ +
Subjt:  LKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGA--YVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVEL

Query:  WKDLKERKIVVSTVTL
        + D +ER I +S  TL
Subjt:  WKDLKERKIVVSTVTL

P45083 Putative esterase HI_11613.6e-0940.91Show/hide
Query:  QPFNVLHGGVSALIAEALASKGAYVASGYGK-IVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWKDLKERKIVVSTVTL
        QPF VLHGGVS  +AE + S    +    GK +VG+ ++ NHL+    G V  A ATP+ +GR IQVW +++ +  + +   VS +TL
Subjt:  QPFNVLHGGVSALIAEALASKGAYVASGYGK-IVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWKDLKERKIVVSTVTL

P77781 1,4-dihydroxy-2-naphthoyl-CoA hydrolase4.0e-0834.21Show/hide
Query:  VGF---KLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVAS-GYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELW
        VGF   + EH     +   + V     QPF +LHGG S ++AE++ S   Y+ + G  K+VG++++ NH++SA  G  V     P+ +G   QVW +E++
Subjt:  VGF---KLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVAS-GYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELW

Query:  KDLKERKIVVSTVT
         D K R    S +T
Subjt:  KDLKERKIVVSTVT

Q9FI76 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 27.4e-3157.6Show/hide
Query:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE
        D PLK +GF  +  SA RVSG + ++  CCQPF VLHGGVSALIAEALAS GA +ASG+ ++ GI LSI+HL+ A +G +V AE+ PV+VG+ IQVW V 
Subjt:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE

Query:  LWKDLK----ERKIVVST--VTLLC
        LWK  K    + KI+VST  VTL C
Subjt:  LWKDLK----ERKIVVST--VTLLC

Q9SX65 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 14.5e-3660.8Show/hide
Query:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE
        D PL  +GF+ +  S  R++GR+ VSP+CCQPF VLHGGVSALIAE+LAS GA++ASG+ ++ GIQLSINHLKSA++G +V AEATPV+ G+TIQVW V+
Subjt:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE

Query:  LWKDL---KERKIVVST--VTLLCN
        LWK     K  KI++S+  VTL+CN
Subjt:  LWKDL---KERKIVVST--VTLLCN

Arabidopsis top hitse value%identityAlignment
AT1G48320.1 Thioesterase superfamily protein3.2e-3760.8Show/hide
Query:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE
        D PL  +GF+ +  S  R++GR+ VSP+CCQPF VLHGGVSALIAE+LAS GA++ASG+ ++ GIQLSINHLKSA++G +V AEATPV+ G+TIQVW V+
Subjt:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE

Query:  LWKDL---KERKIVVST--VTLLCN
        LWK     K  KI++S+  VTL+CN
Subjt:  LWKDL---KERKIVVST--VTLLCN

AT5G48950.1 Thioesterase superfamily protein5.3e-3257.6Show/hide
Query:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE
        D PLK +GF  +  SA RVSG + ++  CCQPF VLHGGVSALIAEALAS GA +ASG+ ++ GI LSI+HL+ A +G +V AE+ PV+VG+ IQVW V 
Subjt:  DAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVASGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVE

Query:  LWKDLK----ERKIVVST--VTLLC
        LWK  K    + KI+VST  VTL C
Subjt:  LWKDLK----ERKIVVST--VTLLC

AT5G48950.2 Thioesterase superfamily protein7.9e-2859.22Show/hide
Query:  PSSTTAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSIDHLQSAKIGDLVLAEATPLSIGN
        P S   I+D PL  +GF  + LS  RV+G + ++ KCCQ FKVLHGGVSA+IAEALAS GA +ASGFKRVAG HLSI HL+ A +G++V AE+ P+S+G 
Subjt:  PSSTTAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSIDHLQSAKIGDLVLAEATPLSIGN

Query:  AIQ
         IQ
Subjt:  AIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATCAAAATCCGAATAATGATCCGGCGCCGCTTCCGTCGCCGTCATCCACGACGGCGATTCTGGACGCTCCGCTTAACGCCGTCGGGTTTGAGATCGAAACCTT
ATCTCCTCACAGAGTAACCGGCCGCATTGTGGTTTCCCCAAAGTGCTGCCAGGCGTTTAAGGTGCTGCACGGCGGCGTATCGGCGATGATAGCGGAGGCTCTGGCGAGCG
CCGGAGCACAGATGGCGTCGGGATTCAAAAGAGTGGCTGGATTTCATCTCAGTATTGATCATTTGCAGAGCGCCAAAATCGGTGACCTTGTTCTTGCCGAAGCTACCCCT
CTTTCCATCGGCAATGCTATTCAGTTCTCCAACTACATCACAACCAACATGCCGTCTACCAATCATTCTCCGCCGTCGAAGTCGCCAGTTCTCGACGCTACGCTTCAGGC
ATTTGGATTTGAGGTCGACCATGTCTCTTCTCAAAAAGTTACCGGTCGTCTCCTCGTTTCTCCAATCTGCTGCCAGCCGTTCAAAGTGTTGCACGGCGGAGTATCGGCGT
TGATCGCAGAGTCTTTGGCGAGTATGGGCGCGCATACGGCCTCCGGCTACCAGAGAGTCGCTGGAATTCATCTCAGTATCAACCACTTGAAAAGCGCCGCCCTCGGCGAC
TTTGTTCATGCCGAAGCCGCTCCGCATTATCCATTTATTCCCCCTCAACTCTCTTTCACTCAATACCAACCTCTCATTCTCCACTCCGGCCATCCGATACCGATTCGCAA
CCAAATGTCTGAAAAGAGTCTCCCTCCTCCGCCGTCTCAGCCCGCGGATCCGGATGCTCCGCTTAAGGCAGTCGGATTCAAGCTGGAACACGAATCTGCTCAGAGAGTGA
GCGGCCGCATCCTCGTTTCCCCAATCTGCTGCCAGCCGTTTAATGTGTTGCACGGAGGAGTATCGGCGTTGATTGCGGAGGCTTTGGCGAGTAAGGGGGCTTATGTAGCG
TCGGGTTACGGGAAAATTGTCGGAATCCAACTCAGTATCAATCACTTGAAGAGTGCTGAGATGGGCGCCGTCGTTCTCGCCGAAGCTACTCCGGTCACCGTCGGCAGAAC
CATTCAGGTATGGAATGTGGAATTGTGGAAGGATTTGAAAGAAAGGAAAATAGTTGTGTCTACGGTGACTCTTCTATGCAATTCCTCTGTCCAAAAAGACCCAAATGCCT
AA
mRNA sequenceShow/hide mRNA sequence
TTTGAATACAATTCTGACGCCTAACTTCAGACCGATTTCTTTTCAAATTCCGATCGGATCCCTTCATCTCTAACCAAAAAAAAAAGAAAAGAAACAAAATCAAATTCGCA
TTCATTCCGTTCCATATGGATCATCAAAATCCGAATAATGATCCGGCGCCGCTTCCGTCGCCGTCATCCACGACGGCGATTCTGGACGCTCCGCTTAACGCCGTCGGGTT
TGAGATCGAAACCTTATCTCCTCACAGAGTAACCGGCCGCATTGTGGTTTCCCCAAAGTGCTGCCAGGCGTTTAAGGTGCTGCACGGCGGCGTATCGGCGATGATAGCGG
AGGCTCTGGCGAGCGCCGGAGCACAGATGGCGTCGGGATTCAAAAGAGTGGCTGGATTTCATCTCAGTATTGATCATTTGCAGAGCGCCAAAATCGGTGACCTTGTTCTT
GCCGAAGCTACCCCTCTTTCCATCGGCAATGCTATTCAGTTCTCCAACTACATCACAACCAACATGCCGTCTACCAATCATTCTCCGCCGTCGAAGTCGCCAGTTCTCGA
CGCTACGCTTCAGGCATTTGGATTTGAGGTCGACCATGTCTCTTCTCAAAAAGTTACCGGTCGTCTCCTCGTTTCTCCAATCTGCTGCCAGCCGTTCAAAGTGTTGCACG
GCGGAGTATCGGCGTTGATCGCAGAGTCTTTGGCGAGTATGGGCGCGCATACGGCCTCCGGCTACCAGAGAGTCGCTGGAATTCATCTCAGTATCAACCACTTGAAAAGC
GCCGCCCTCGGCGACTTTGTTCATGCCGAAGCCGCTCCGCATTATCCATTTATTCCCCCTCAACTCTCTTTCACTCAATACCAACCTCTCATTCTCCACTCCGGCCATCC
GATACCGATTCGCAACCAAATGTCTGAAAAGAGTCTCCCTCCTCCGCCGTCTCAGCCCGCGGATCCGGATGCTCCGCTTAAGGCAGTCGGATTCAAGCTGGAACACGAAT
CTGCTCAGAGAGTGAGCGGCCGCATCCTCGTTTCCCCAATCTGCTGCCAGCCGTTTAATGTGTTGCACGGAGGAGTATCGGCGTTGATTGCGGAGGCTTTGGCGAGTAAG
GGGGCTTATGTAGCGTCGGGTTACGGGAAAATTGTCGGAATCCAACTCAGTATCAATCACTTGAAGAGTGCTGAGATGGGCGCCGTCGTTCTCGCCGAAGCTACTCCGGT
CACCGTCGGCAGAACCATTCAGGTATGGAATGTGGAATTGTGGAAGGATTTGAAAGAAAGGAAAATAGTTGTGTCTACGGTGACTCTTCTATGCAATTCCTCTGTCCAAA
AAGACCCAAATGCCTAAAATGCCCCTATTGCGCTCAAAGAAGCTTGCAAAGTTGTGATAAATTGTGTGCCATAAACCATCTACTATATCCCCATTTCATTTCATAGGGGA
GGTTATTTCTGTAATTCAACTTTGGGAGTAACCCCAAACTAAGCCTAATTAACAGCCTAAATAAGAATTTAACCTATGGGTTATGGACAAACTCACTTTTTAGGTGGATG
AGATGAACTCCTCATTTTTATTTCATAATATATATTAATAATATTAGGATTAAAATAGCATTTTGATCCC
Protein sequenceShow/hide protein sequence
MDHQNPNNDPAPLPSPSSTTAILDAPLNAVGFEIETLSPHRVTGRIVVSPKCCQAFKVLHGGVSAMIAEALASAGAQMASGFKRVAGFHLSIDHLQSAKIGDLVLAEATP
LSIGNAIQFSNYITTNMPSTNHSPPSKSPVLDATLQAFGFEVDHVSSQKVTGRLLVSPICCQPFKVLHGGVSALIAESLASMGAHTASGYQRVAGIHLSINHLKSAALGD
FVHAEAAPHYPFIPPQLSFTQYQPLILHSGHPIPIRNQMSEKSLPPPPSQPADPDAPLKAVGFKLEHESAQRVSGRILVSPICCQPFNVLHGGVSALIAEALASKGAYVA
SGYGKIVGIQLSINHLKSAEMGAVVLAEATPVTVGRTIQVWNVELWKDLKERKIVVSTVTLLCNSSVQKDPNA