; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004599 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004599
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein ENL-like
Genome locationChr08:18753653..18754508
RNA-Seq ExpressionHG10004599
SyntenyHG10004599
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598807.1 hypothetical protein SDJN03_08585, partial [Cucurbita argyrosperma subsp. sororia]1.8e-6370.75Show/hide
Query:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK
        I+   G   S   P+   D R  S+S     SIGVPDD+SD+ESVSSTGGDR EV+SKLN G  S+ SLE SLPIKRGLSSHFSGKSKSF NLAEAKSVK
Subjt:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK

Query:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ
        DIEKPENSFNKRRR LIASKLA+K+SFYTWPNPKSMPLLALREE+  DD D +E+ES APYSS    DE+EEPKE+R +DF  RR MSFKSRS S+ADLQ
Subjt:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ

Query:  QQHHGIDQQQEQ
        Q+HH ID Q+EQ
Subjt:  QQHHGIDQQQEQ

KAG7029745.1 hypothetical protein SDJN02_08087 [Cucurbita argyrosperma subsp. argyrosperma]6.3e-6470.75Show/hide
Query:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK
        I+   G   S   P+   D R  S+S     SIGVPDD++D+ESVSSTGGDREEV+SKLN G  S+ SLE SLPIKRGLSSHFSGKSKSF NLAEAKSVK
Subjt:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK

Query:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ
        DIEKPENSFNKRRR LIASKLA+K+SFYTWPNPKSMPLLALREE+  DD D +E+ES APYSS    DE+EEPKE+R +DF  RR MSFKSRS S+ADLQ
Subjt:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ

Query:  QQHHGIDQQQEQ
        Q+HH ID Q+EQ
Subjt:  QQHHGIDQQQEQ

XP_022997371.1 uncharacterized protein LOC111492307 [Cucurbita maxima]2.2e-6468.47Show/hide
Query:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF
        +H +  P   I+   G   S   P+   D R  S+S     SIGVPDD+SD+ESVSSTGGDREEV+SKLN G  S+GSLE SLPIKRGLSSHFSGKSKSF
Subjt:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF

Query:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK
         NLAEAKSVKDIEKPENSFNKRRR  IASKLA+K+SFYTWPNPKSMPLLALREE+  DD D +++ES APYSS    DE EEPKE+R +DF+ RR MSFK
Subjt:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK

Query:  SRSLSMADLQQQHHGIDQQQEQ
        SRS S+ADLQQ+HH ID+Q+EQ
Subjt:  SRSLSMADLQQQHHGIDQQQEQ

XP_023546188.1 uncharacterized protein LOC111805363 [Cucurbita pepo subsp. pepo]4.4e-6571.7Show/hide
Query:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK
        I+   G   S   P+   D R  S+S     SIGVPDD+SD+ESVSSTGGDREEV+SKLN G  S+ SLE SLPIKRGLSSHFSGKSKSF NLAEAKSVK
Subjt:  IRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVK

Query:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ
        DIEKPENSFNKRRR LIASKLA+K+SFYTWPNPKSMPLLALREE   DD D +E+ES APYSS    DE+EEPKE+R +DFH RR MSFKSRS S+ADLQ
Subjt:  DIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ

Query:  QQHHGIDQQQEQ
        Q+HH ID Q+EQ
Subjt:  QQHHGIDQQQEQ

XP_038884123.1 protein virilizer homolog [Benincasa hispida]9.1e-7180.73Show/hide
Query:  PQRSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRR
        P+  +  + S S++   SIGVPDD+SD+ESVSSTGGDREEV SKLN GFVSLGSLEESLPIKRGLS+HFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRR
Subjt:  PQRSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRR

Query:  VLIASKLAKK-SSFYTWPNPKSMPLLALREE--DDDDDDDDEEEESHAPYSSDEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGID
        VLIA+KLAKK SSFYTWPNPKSMPLLALREE  DDD+++++EEEES APYSS+EDEEPK +RVTDFH R+ MSFKSRS SMADLQQQHHG+D
Subjt:  VLIASKLAKK-SSFYTWPNPKSMPLLALREE--DDDDDDDDEEEESHAPYSSDEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGID

TrEMBL top hitse value%identityAlignment
A0A6J1BUU2 uncharacterized protein LOC1110049431.2e-6376.65Show/hide
Query:  RSDRRFSSTSAAAVGSIGVPDDNSDDESVSST-GGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRV
        R D  + S S++   SIGVPDD SDDESVSST GGD EEV+SKLN G  SLGSLEESLPIKRGLS+HFSGKSKSF NLAEAKSVK+IEKPEN FNKRRRV
Subjt:  RSDRRFSSTSAAAVGSIGVPDDNSDDESVSST-GGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRV

Query:  LIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQEQ
        LIASKLA++SSFY WPNPKSMPLLALREE   DDDD+EEEESHAPYSS    DEDE+ K +RV+DFH RR MSFKSRS S+ADLQQQHH  D Q+EQ
Subjt:  LIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQEQ

A0A6J1GJ08 uncharacterized protein LOC1114543111.1e-6179.89Show/hide
Query:  SIGVPD-DNSDDESVSSTGGDR-EEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT
        SIGVPD D S+DES+SSTGGD+ EEV SKL+ GFVSLGSLEESLPIKRGLSSHFSGK KSF NLAEAKSVKDI KPENSFNKRRR+LIASKLAKKSSFYT
Subjt:  SIGVPD-DNSDDESVSSTGGDR-EEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT

Query:  WPNPKSMPLLALREEDDDDDDDDEEEESHAP--YSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQE
        WPNPKSMPLLALRE DDDDDDD +EEE  +P  YSS    DEDEE K +RV+DFH RRFMSFKSR  S+ADLQQ+HH  DQ QE
Subjt:  WPNPKSMPLLALREEDDDDDDDDEEEESHAP--YSS----DEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQE

A0A6J1HC13 uncharacterized protein LOC1114626331.2e-6068.22Show/hide
Query:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF
        +H +  P   I+   G   S   P+   D R  S+S     SIGVPDD+SD+ESVS TGGD EEV+SKLN G  S+ SLE SLPIKRGLSSHFSGKSKSF
Subjt:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF

Query:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK
         NLAEAKSVKDIEKPENSFNKRRR LIASKLA+K+SFYTWPNPKSMPLLALREE+  DD D +E+ES APYSS    DE+EEPKE+R +DF  RR MSFK
Subjt:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK

Query:  SRSLSMADLQQQHH
        SRS S+ADLQQ+HH
Subjt:  SRSLSMADLQQQHH

A0A6J1K7A7 uncharacterized protein LOC1114923071.0e-6468.47Show/hide
Query:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF
        +H +  P   I+   G   S   P+   D R  S+S     SIGVPDD+SD+ESVSSTGGDREEV+SKLN G  S+GSLE SLPIKRGLSSHFSGKSKSF
Subjt:  RHFLLRPILIIRDHGGSLPSSLFPQ-RSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSF

Query:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK
         NLAEAKSVKDIEKPENSFNKRRR  IASKLA+K+SFYTWPNPKSMPLLALREE+  DD D +++ES APYSS    DE EEPKE+R +DF+ RR MSFK
Subjt:  VNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSS----DEDEEPKERRVTDFHGRRFMSFK

Query:  SRSLSMADLQQQHHGIDQQQEQ
        SRS S+ADLQQ+HH ID+Q+EQ
Subjt:  SRSLSMADLQQQHHGIDQQQEQ

A0A6J1KP27 uncharacterized protein LOC1114963052.9e-5978.57Show/hide
Query:  SIGVPD-DNSDDESVSSTGGD-REEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT
        SIGVPD D S+DES+SSTGGD  EEV SKL+ GF SLGSLEESLPIKRGLSSHFSGK KSF NLAEAKSVKDI KPENSFNKRRR+LIASKLAKKSSFYT
Subjt:  SIGVPD-DNSDDESVSSTGGD-REEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT

Query:  WPNPKSMPLLALREEDDDDDDDDEEEESHAPYSSD----EDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQE
        WPNPKSMPLLALRE  DDDD D+EE++S A YSS+    EDEEPK + V+DFH RRFMSFKSR  S+ADLQQ+HH  DQ QE
Subjt:  WPNPKSMPLLALREEDDDDDDDDEEEESHAPYSSD----EDEEPKERRVTDFHGRRFMSFKSRSLSMADLQQQHHGIDQQQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein9.2e-2146.9Show/hide
Query:  PDDNSDDESVSSTGGDREEVESKLNGGFVSLG-------SLEESLPIKRGLSSHFSGKSKSFVNLAEAKS-VKDIEKPENSFNKRRRVLIASKLAKK---
        P+++SD  S      + EE E + +      G       SLE+SLPIKRGLS+H+ GKSKSF NL EA S  KD+EK EN FNKRRR++IA+KL ++   
Subjt:  PDDNSDDESVSSTGGDREEVESKLNGGFVSLG-------SLEESLPIKRGLSSHFSGKSKSFVNLAEAKS-VKDIEKPENSFNKRRRVLIASKLAKK---

Query:  ---SSFYTWPNPKSMPLLALRE--------EDDDDDDDDEEEESH
           S+FY+W NP SMPLLAL+E         +DD +DDD + + H
Subjt:  ---SSFYTWPNPKSMPLLALRE--------EDDDDDDDDEEEESH

AT3G43850.1 unknown protein1.4e-1340.15Show/hide
Query:  SSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKS--VKDIEKPENSFNKRRRVLIASK
        SSTS+ ++G      +NSDD+      G   E+ES  NG    + SLEE+LPIKR +S  + GKSKSF++L+E  S  VKD+ KPEN +++RRR L++ +
Subjt:  SSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKS--VKDIEKPENSFNKRRRVLIASK

Query:  LAKKSSFYTWPNPKSMPLLALREEDDDDDDDD
        +  +      P  KS+  ++ RE D     DD
Subjt:  LAKKSSFYTWPNPKSMPLLALREEDDDDDDDD

AT4G31510.1 unknown protein6.3e-2242.5Show/hide
Query:  IRDHGGSLPSSLFPQRSDRRFS-STSAAAVGSIGVPDDNSDDE--SVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKS
        +  H  ++P+SL  +   RR   S    +  S+G   +N +DE  +VSS+ G       +    F S  SLE+SLPIKRGLS+H+ GKSKSF NL EA +
Subjt:  IRDHGGSLPSSLFPQRSDRRFS-STSAAAVGSIGVPDDNSDDE--SVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKS

Query:  VKDIEKPENSFNKRRRVLIASKLAKKS-----SFYTWPNPKSMPLLALREEDDDD----DDDDEEEESHAPYSSDEDEEPKERRVTDFHGRRFMSFKSRS
          D+ K E+  NKRRR+LIA+KL ++S     S YT  NP SMPLLAL+E D++D    DDDD+++ S    S DE  + KE+R+   + R FM  +++S
Subjt:  VKDIEKPENSFNKRRRVLIASKLAKKS-----SFYTWPNPKSMPLLALREEDDDD----DDDDEEEESHAPYSSDEDEEPKERRVTDFHGRRFMSFKSRS

AT5G21940.1 unknown protein4.1e-1342.96Show/hide
Query:  PQRSDRRFSSTSAAAVGSIGVPDDNSDD-ESVSSTGGD---REEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNL-AEA-------KSVKDI
        P  SD   SS S++A  SIG    NSDD E  S  GGD     EVES   G    + SLE+ LP+++G+S ++SGKSKSF NL AEA        S+KD+
Subjt:  PQRSDRRFSSTSAAAVGSIGVPDDNSDD-ESVSSTGGD---REEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGKSKSFVNL-AEA-------KSVKDI

Query:  EKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMP
         KPEN +++RRR L+  ++        W N K+ P
Subjt:  EKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMP

AT5G24890.1 unknown protein4.1e-2947.22Show/hide
Query:  SIGVPDDNSDDESVSSTGGDREEVESKLNG--GFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT
        SIG P D+ +DE  S    + ++V SK  G  G  S+ SLE+SLP KRGLS+H+ GKSKSF NL E  SVK++ K EN  NKRRR+ I +KLA+K SFY+
Subjt:  SIGVPDDNSDDESVSSTGGDREEVESKLNG--GFVSLGSLEESLPIKRGLSSHFSGKSKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYT

Query:  WPNPKSMPLLALREEDDDDDDDDEEEESHAPY---SSDEDEEPKERRVTDFHGRRFMSFKSRS-LSMADLQQQHHGIDQQ
        W NPKSMPLL + E++DDDD+DD+EE+  + +    S  DEE  ++ V      +  ++KSRS  +++DL ++    D Q
Subjt:  WPNPKSMPLLALREEDDDDDDDDEEEESHAPY---SSDEDEEPKERRVTDFHGRRFMSFKSRS-LSMADLQQQHHGIDQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACGAAGCTTCTACCGGCGACCTCTAGAAGAACCACTCTCCGCCACTTTCTCCTCCGTCCGATTCTCATTATTCGCGATCATGGAGGTTCTCTTCCCTCCTCCCT
CTTTCCTCAGAGATCGGACCGCCGCTTCTCCTCCACCTCCGCCGCCGCCGTCGGATCGATTGGAGTTCCTGATGATAATAGCGATGACGAGAGCGTTTCATCCACTGGCG
GAGATCGTGAGGAAGTTGAGAGTAAATTAAATGGAGGATTCGTTTCTCTTGGATCGTTGGAAGAGTCTCTTCCGATTAAGAGAGGATTATCGAGTCATTTTTCTGGAAAA
TCGAAATCGTTTGTGAATCTAGCAGAGGCGAAATCAGTAAAAGATATTGAGAAACCTGAAAATTCATTCAATAAGAGAAGACGGGTTTTGATTGCGTCGAAATTAGCTAA
GAAATCGTCGTTCTACACCTGGCCAAACCCTAAGTCGATGCCTCTGTTAGCGCTGAGAGAAGAAGACGATGATGACGACGACGACGACGAAGAAGAAGAATCTCATGCTC
CATATTCTTCCGATGAAGATGAAGAACCGAAAGAAAGAAGAGTTACAGATTTTCATGGGAGAAGGTTTATGAGCTTCAAGTCGAGAAGCTTGTCTATGGCGGATCTGCAA
CAGCAGCACCATGGTATTGATCAACAACAAGAACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACAACGAAGCTTCTACCGGCGACCTCTAGAAGAACCACTCTCCGCCACTTTCTCCTCCGTCCGATTCTCATTATTCGCGATCATGGAGGTTCTCTTCCCTCCTCCCT
CTTTCCTCAGAGATCGGACCGCCGCTTCTCCTCCACCTCCGCCGCCGCCGTCGGATCGATTGGAGTTCCTGATGATAATAGCGATGACGAGAGCGTTTCATCCACTGGCG
GAGATCGTGAGGAAGTTGAGAGTAAATTAAATGGAGGATTCGTTTCTCTTGGATCGTTGGAAGAGTCTCTTCCGATTAAGAGAGGATTATCGAGTCATTTTTCTGGAAAA
TCGAAATCGTTTGTGAATCTAGCAGAGGCGAAATCAGTAAAAGATATTGAGAAACCTGAAAATTCATTCAATAAGAGAAGACGGGTTTTGATTGCGTCGAAATTAGCTAA
GAAATCGTCGTTCTACACCTGGCCAAACCCTAAGTCGATGCCTCTGTTAGCGCTGAGAGAAGAAGACGATGATGACGACGACGACGACGAAGAAGAAGAATCTCATGCTC
CATATTCTTCCGATGAAGATGAAGAACCGAAAGAAAGAAGAGTTACAGATTTTCATGGGAGAAGGTTTATGAGCTTCAAGTCGAGAAGCTTGTCTATGGCGGATCTGCAA
CAGCAGCACCATGGTATTGATCAACAACAAGAACAATAA
Protein sequenceShow/hide protein sequence
MTTKLLPATSRRTTLRHFLLRPILIIRDHGGSLPSSLFPQRSDRRFSSTSAAAVGSIGVPDDNSDDESVSSTGGDREEVESKLNGGFVSLGSLEESLPIKRGLSSHFSGK
SKSFVNLAEAKSVKDIEKPENSFNKRRRVLIASKLAKKSSFYTWPNPKSMPLLALREEDDDDDDDDEEEESHAPYSSDEDEEPKERRVTDFHGRRFMSFKSRSLSMADLQ
QQHHGIDQQQEQ