; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016319 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016319
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationChr03:4181700..4182377
RNA-Seq ExpressionHG10016319
SyntenyHG10016319
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603870.1 hypothetical protein SDJN03_04479, partial [Cucurbita argyrosperma subsp. sororia]6.0e-9685.78Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQ+VLSCPQ+KQLV+ELLD SMKLLDVCSLAKD+ LETQQHVGALHSA RRRKGDSA+KTTTAAY  YRK+MKKE K LITSMKKMNEKFN  T M+
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N D+HLSSV+ ALRQ CSTNSSIFE+VLLYLTPLTK KARGWSLV+KWVHKGAIACESNS +NEFENVDVALSSVVE MEVEK QIAQ+RLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRASLLNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

XP_008440893.1 PREDICTED: uncharacterized protein LOC103485177 [Cucumis melo]5.8e-10793.33Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAK++TLETQQHVGALHSAVRRRKGDSA+KT TAAY CYRKRMKKEAKKLITSMKKMNEKFN  T ME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N DHHLSSVIGALRQACSTNS IFESVL+YLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSS+VEEMEVEKSQIAQKRLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRAS+LNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

XP_011658068.1 uncharacterized protein LOC101217795 [Cucumis sativus]1.9e-10592Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHKQ VEELLDGSMKLLDVCSLAK++TLETQQHVGALHSAVRRRKGDSA+KT T AY CYRKRMKKEAKKLITSMKKMNEKFN  T ME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N DHHLSSVIGALRQACSTN+ IFESVL+YLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVV+EMEVEKSQIAQKRLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRAS+LNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

XP_023544003.1 uncharacterized protein LOC111803711 [Cucurbita pepo subsp. pepo]3.0e-9584.89Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQ+VLSCPQ+KQLV+ELLD SMKLLDVCSLAKD+ LETQ HVGALHSA RRRKGDSA+KTTTAAY  YRK+MKKE K LITSMKKMNEKFN  + M+
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N D+HLSSV+ ALRQ CSTNSSIFESVLLYLTPLTK KARGWSLV+KWVHKGAIACESNS +NEFENVDVALSSVVEEMEVEK Q+AQ+RLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIK RASLLNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

XP_038882236.1 uncharacterized protein LOC120073459 [Benincasa hispida]4.9e-10693.78Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHK+LV+ELLDGSMKLLDVCSLAKDMTLETQQHVGAL+SAVRRRKGDSAI+TTTAAYTCYRKRMKKEAKKLITSMKKMNEKFN   SME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
          DHHLSSVIGALRQACSTNSSIFESVLLYL PLTKSKARGWSLVSKW HKGAIACESNSGLNEFENVDVALS VVEEMEVEKSQIAQKRLE+LEMA QE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDGLFRRLIKTRASLLNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

TrEMBL top hitse value%identityAlignment
A0A0A0KGU4 Uncharacterized protein9.1e-10692Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHKQ VEELLDGSMKLLDVCSLAK++TLETQQHVGALHSAVRRRKGDSA+KT T AY CYRKRMKKEAKKLITSMKKMNEKFN  T ME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N DHHLSSVIGALRQACSTN+ IFESVL+YLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVV+EMEVEKSQIAQKRLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRAS+LNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

A0A1S3B1Q6 uncharacterized protein LOC1034851772.8e-10793.33Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAK++TLETQQHVGALHSAVRRRKGDSA+KT TAAY CYRKRMKKEAKKLITSMKKMNEKFN  T ME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N DHHLSSVIGALRQACSTNS IFESVL+YLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSS+VEEMEVEKSQIAQKRLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRAS+LNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

A0A5D3CMX4 DUF241 domain protein2.8e-10793.33Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAK++TLETQQHVGALHSAVRRRKGDSA+KT TAAY CYRKRMKKEAKKLITSMKKMNEKFN  T ME
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        N DHHLSSVIGALRQACSTNS IFESVL+YLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSS+VEEMEVEKSQIAQKRLE+LEMAAQE
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLIKTRAS+LNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

A0A6J1BTB3 uncharacterized protein LOC1110055324.4e-9280.44Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGSTQ+VL+  +++QLV+ELLDGSMKLLD+CSLAK+MTLETQ HVGAL SAVRRRKGDSA++T  AAYTC+RK+MKKEAKKLITSM+KM+EK N +T + 
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        NLDHHLSSVIGALRQACSTNSSIFESV LYLTPL K +ARGWSLVSKWVHKGAIACE NSG+NEFENVD AL SV E +E+EK QIAQ+RLE LEMAAQ+
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRRLI+TRASLLNIISQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

A0A6J1EF12 uncharacterized protein LOC1114335721.7e-9181.78Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME
        MGST ++LSCPQ KQLV+ELLDGS+KLLDVCSLAKDM  +TQQHVGAL+SAVRRRKGDSAIKTT AAYTCYRK+MKKEAKKL+ SMKKMNEKFN  T M 
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSME

Query:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE
        NLD+HLSSVIGA+RQA S NSSI ESVLLYLTPLTK KA GWSLVSKWVH GAIACES SGLNEFE+VD+ALSS VEE EVEK  +AQ+RLE+LEM A+E
Subjt:  NLDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQE

Query:  IESGLDGLFRRLIKTRASLLNIISQ
        IESGLDG+FRR IKTRASLLNI+SQ
Subjt:  IESGLDGLFRRLIKTRASLLNIISQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17080.1 Arabidopsis protein of unknown function (DUF241)1.5e-2033.94Show/hide
Query:  TQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSMENLD
        TQ+ LS   +K+ VE+LLDGS+++LD+C+++KD   E ++ +  + S +RR++GD  +      Y   RK +KK  +K+  S+K        +T  E+ +
Subjt:  TQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSMENLD

Query:  HHLSSVIGALRQACSTNSSIFESVLLYLT-PLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQEIE
            +V G   +A +   S+F+S+L Y++   T SK   WS+VSK ++K  + CE+    NEF  VD        E + EK+ +    ++ LE   Q++E
Subjt:  HHLSSVIGALRQACSTNSSIFESVLLYLT-PLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQEIE

Query:  SGLDGLFRRLIKTRASLLNII
         GL+ L + LIK R S LNI+
Subjt:  SGLDGLFRRLIKTRASLLNII

AT2G17680.1 Arabidopsis protein of unknown function (DUF241)1.1e-2335.29Show/hide
Query:  MGSTQKVLSCPQHK---------QLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNE
        MGSTQ+VLS    K         + +EE+LDGS++L+D+C++++D+ +ET +HV  L S VRRRK         + Y  +RK M+KE KKL+ S+K +N 
Subjt:  MGSTQKVLSCPQHK---------QLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNE

Query:  KFNLITSMENLDH----HLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIA
           L+      D     H  +VI A+R+      S+ +S   +L+           L    ++K           NE ENVD A+       +       
Subjt:  KFNLITSMENLDH----HLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIA

Query:  QKRLETLEMAAQEIESGLDGLFRRLIKTRASLLNIISQ
         ++LE +E+   + E  L+GLFR LIKTRASLLNIISQ
Subjt:  QKRLETLEMAAQEIESGLDGLFRRLIKTRASLLNIISQ

AT4G35210.1 Arabidopsis protein of unknown function (DUF241)5.4e-1832.37Show/hide
Query:  VEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSMENLDHHLSSVIGALRQA
        +E+LLDGS+K+LD+CS++KD   + ++ +  + S VRR++GD  +      Y   RK +KK  +K++ S+K    K              +  +    +A
Subjt:  VEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSMENLDHHLSSVIGALRQA

Query:  CSTNSSIFESVLLYLTPLTKSKARG-WSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQEIESGLDGLFRRLIKT
         +   ++FES+  +   ++ SKA G WSLVSK + +    CE+ +  NEF  VD+       E + EKS +  + ++ LE+  Q++E G+  L + LIK 
Subjt:  CSTNSSIFESVLLYLTPLTKSKARG-WSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQEIESGLDGLFRRLIKT

Query:  RASLLNI
        R S+LNI
Subjt:  RASLLNI

AT4G35690.1 Arabidopsis protein of unknown function (DUF241)1.0e-3238.43Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRK---GDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLIT
        MGSTQ+V+S     + +EE+LDGS++L+D+CS+++D+ +ETQ+HV  + S VRR+K   G+  +    A Y  +RK M+KEAK+L+ S+K ++   +  +
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRK---GDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLIT

Query:  SMEN--LDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLE
        S+ N   + HL  V+ A+RQ  S + ++  S L +L+   +S  +   L S    K     E     NE EN+D+ +     ++        QK+LE +E
Subjt:  SMEN--LDHHLSSVIGALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLE

Query:  MAAQEIESGLDGLFRRLIKTRASLLNIIS
        M+    E  L+GLFRRLI+TRASLLNIIS
Subjt:  MAAQEIESGLDGLFRRLIKTRASLLNIIS

AT4G35710.1 Arabidopsis protein of unknown function (DUF241)5.6e-2335.78Show/hide
Query:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRK-----GDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNL
        MGS Q+V+S     + +EE+LDGS++L+D+CS+++D+ +ET +HV  + S VRR+K     G   I    + Y  +RK M+KEAKKL+ S+KK++     
Subjt:  MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRK-----GDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNL

Query:  ITSMENLDHHLSSVIGALRQACSTNSSIFESVLLYLT---PLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLE
          + ++ D  L +VI  +R+  S +  + +S L  L+      KSK      + K  H  A         N  E +D A+       +       Q  LE
Subjt:  ITSMENLDHHLSSVIGALRQACSTNSSIFESVLLYLT---PLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLE

Query:  TLEMAAQEIESGLDGLFRRLIKTRASLLNIIS
         +EM     E  L+GLFRRLI+TRAS+LNIIS
Subjt:  TLEMAAQEIESGLDGLFRRLIKTRASLLNIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCTACCCAGAAGGTTTTGTCTTGTCCCCAACATAAACAGTTGGTGGAGGAGTTGTTGGATGGTTCTATGAAGCTTTTGGATGTTTGCAGCTTAGCAAAGGACAT
GACATTGGAAACCCAACAACATGTTGGGGCTCTTCATTCTGCCGTTCGTCGGAGGAAAGGCGATTCCGCTATTAAAACTACCACCGCTGCTTACACTTGTTACAGAAAAA
GGATGAAGAAAGAAGCTAAGAAGTTAATAACATCAATGAAGAAGATGAATGAGAAATTCAATTTAATAACCTCAATGGAAAATCTGGATCATCACCTGAGCTCTGTGATT
GGTGCGTTAAGACAAGCTTGCTCAACCAACAGCTCTATTTTCGAATCCGTGTTGTTGTACTTGACGCCATTGACGAAGTCAAAAGCTCGAGGATGGTCTCTTGTTTCCAA
GTGGGTGCACAAGGGGGCGATTGCCTGCGAATCAAACAGTGGCTTGAACGAATTTGAGAATGTGGATGTGGCTTTGAGCTCTGTTGTTGAAGAAATGGAGGTTGAGAAGT
CGCAGATTGCTCAAAAAAGATTGGAGACTTTGGAAATGGCAGCACAAGAAATTGAGAGTGGCTTGGATGGTTTGTTCAGGAGATTGATCAAAACAAGAGCTTCTCTTTTG
AACATAATCTCTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCTACCCAGAAGGTTTTGTCTTGTCCCCAACATAAACAGTTGGTGGAGGAGTTGTTGGATGGTTCTATGAAGCTTTTGGATGTTTGCAGCTTAGCAAAGGACAT
GACATTGGAAACCCAACAACATGTTGGGGCTCTTCATTCTGCCGTTCGTCGGAGGAAAGGCGATTCCGCTATTAAAACTACCACCGCTGCTTACACTTGTTACAGAAAAA
GGATGAAGAAAGAAGCTAAGAAGTTAATAACATCAATGAAGAAGATGAATGAGAAATTCAATTTAATAACCTCAATGGAAAATCTGGATCATCACCTGAGCTCTGTGATT
GGTGCGTTAAGACAAGCTTGCTCAACCAACAGCTCTATTTTCGAATCCGTGTTGTTGTACTTGACGCCATTGACGAAGTCAAAAGCTCGAGGATGGTCTCTTGTTTCCAA
GTGGGTGCACAAGGGGGCGATTGCCTGCGAATCAAACAGTGGCTTGAACGAATTTGAGAATGTGGATGTGGCTTTGAGCTCTGTTGTTGAAGAAATGGAGGTTGAGAAGT
CGCAGATTGCTCAAAAAAGATTGGAGACTTTGGAAATGGCAGCACAAGAAATTGAGAGTGGCTTGGATGGTTTGTTCAGGAGATTGATCAAAACAAGAGCTTCTCTTTTG
AACATAATCTCTCAATAG
Protein sequenceShow/hide protein sequence
MGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKDMTLETQQHVGALHSAVRRRKGDSAIKTTTAAYTCYRKRMKKEAKKLITSMKKMNEKFNLITSMENLDHHLSSVI
GALRQACSTNSSIFESVLLYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVEEMEVEKSQIAQKRLETLEMAAQEIESGLDGLFRRLIKTRASLL
NIISQ