; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g38650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g38650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationchr9:29712907..29714051
RNA-Seq ExpressionMoc09g38650
SyntenyMoc09g38650
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601483.1 hypothetical protein SDJN03_06716, partial [Cucurbita argyrosperma subsp. sororia]3.0e-8476.29Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN
        K   L   T+ RA+E+VHS R+LCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ A+LLVLGQ+KRSTTWRL+MVWAGHR        GGG VEYCIQN
Subjt:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        ASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

KAG7032262.1 hypothetical protein SDJN02_06306 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-8476.29Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN
        K   L   T+ RA+E+VHS R+LCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ A+LLVLGQ+KRSTTWRL+MVWAGHR        GGG VEYCIQN
Subjt:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        ASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

XP_022148999.1 universal stress protein PHOS32-like isoform X1 [Momordica charantia]1.6e-11499.07Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-AGELG
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK AGELG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-AGELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
        TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG

Query:  GYLITTKRQKDFWLLA
        GYLITTKRQKDFWLLA
Subjt:  GYLITTKRQKDFWLLA

XP_022149000.1 uncharacterized protein LOC111017526 isoform X2 [Momordica charantia]6.6e-11699.53Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT

Query:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
        NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
Subjt:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG

Query:  YLITTKRQKDFWLLA
        YLITTKRQKDFWLLA
Subjt:  YLITTKRQKDFWLLA

XP_022956495.1 uncharacterized protein LOC111458216 [Cucurbita moschata]1.3e-8476.39Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL--------GGGVVEYCIQ
        K   L   T+ RA+E+VHS R+LCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ A+LLVLGQ+KRSTTWRL+MVWAGHR         GGGVVEYCIQ
Subjt:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL--------GGGVVEYCIQ

Query:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        NASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

TrEMBL top hitse value%identityAlignment
A0A6J1D4I6 uncharacterized protein LOC111017526 isoform X23.2e-11699.53Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGT

Query:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
        NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
Subjt:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG

Query:  YLITTKRQKDFWLLA
        YLITTKRQKDFWLLA
Subjt:  YLITTKRQKDFWLLA

A0A6J1D5Q4 universal stress protein PHOS32-like isoform X17.8e-11599.07Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-AGELG
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK AGELG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-AGELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
        TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG

Query:  GYLITTKRQKDFWLLA
        GYLITTKRQKDFWLLA
Subjt:  GYLITTKRQKDFWLLA

A0A6J1G0H0 uncharacterized protein LOC1114495942.5e-8476.21Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGG---GSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKA
        MGKTG KLPSFCLNRIR HVRVPIQSKPDS    T E+KGG    SN G       GRKI+IVVDS+ EAEGALQWALS+TVQNQD I+LLH+  PSKK 
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGG---GSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKA

Query:  -GELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQ-RKRSTTWRLVMVWAGHRL-----GGGVVEYCIQNASCMA
         G   T  R +E+VHS R+LCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGA+LLVLGQ +KRSTTWRL+M+WAGHR      GGG VEYCIQNASCMA
Subjt:  -GELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQ-RKRSTTWRLVMVWAGHRL-----GGGVVEYCIQNASCMA

Query:  IAVRRKSKKVGGYLITTKRQKDFWLLA
        IAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  IAVRRKSKKVGGYLITTKRQKDFWLLA

A0A6J1GXX9 uncharacterized protein LOC1114582166.5e-8576.39Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL--------GGGVVEYCIQ
        K   L   T+ RA+E+VHS R+LCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ A+LLVLGQ+KRSTTWRL+MVWAGHR         GGGVVEYCIQ
Subjt:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL--------GGGVVEYCIQ

Query:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        NASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

A0A6J1IT49 uncharacterized protein LOC1114784233.2e-8475.86Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IV+DS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN
        K   L   T+ RA+E+VHS R+LCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ A+LLVLGQ+KRSTTWRL+MVWAGHR        GGG VEYCIQN
Subjt:  KAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRL-------GGGVVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        ASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-4746.55Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKAGELG
        MGK G     F ++R+R +VRV    +P +  T  + G        ++   GR+I++VVDS  EA+ AL W LSH  Q QD ILLLH +K  + ++G+L 
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKAGELG

Query:  -------------TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHR---LGGGVVEYCIQN
                     T  RA + V + + +C+LKRPEV+ EV  V+G EKGP IV+EAR++ A+LLVLGQ+K+  TWRL+MVWA           VEYCI N
Subjt:  -------------TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHR---LGGGVVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        + CMAIAVR++ KK+GGY +TTKR KDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein5.1e-4246.12Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKAGELG
        MGK G     F ++R+R +VRV    +P +  T  + G        ++   GR+I++VVDS  EA+ AL W LSH  Q QD ILLLH +K  + ++G+L 
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKAGELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHR---LGGGVVEYCIQNASCMAIAVRRKSK
         NK   E     +        +V+ EV  V+G EKGP IV+EAR++ A+LLVLGQ+K+  TWRL+MVWA           VEYCI N+ CMAIAVR++ K
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHR---LGGGVVEYCIQNASCMAIAVRRKSK

Query:  KVGGYLITTKRQKDFWLLA
        K+GGY +TTKR KDFWLLA
Subjt:  KVGGYLITTKRQKDFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.5e-4556.02Show/hide
Query:  VIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKA---GELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALL
        ++VVD++ + + ALQWAL+H VQ++D I LLHV + P  +A    +   N RA E+VH  ++ CQLK+P V+ E+ VVE  +EKG  IVEE++KQGA +L
Subjt:  VIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIK-PSKKA---GELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALL

Query:  VLGQRKRSTTWRLVMVW-AGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        VLGQRKR++ WR++  W     +GGGVVEYCI N+ CMAIAVR+KS   GGYLITTKR KDFWLLA
Subjt:  VLGQRKRSTTWRLVMVW-AGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.7e-4046.47Show/hide
Query:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++++VVD    + GAL+WAL HT+Q+QD + LL+  KP +K       +  +  E+VH+ + LCQ KRP +E+E+  ++G  KEKG  IVEEA++Q  
Subjt:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  ALLVLGQRKRSTTWRLVMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        +LLV+G+ K+   WRL+  W   +  G  G ++YC++ ASCM IAV+ K++K+GGYLITTKR K+FWLLA
Subjt:  ALLVLGQRKRSTTWRLVMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.5e-4348.82Show/hide
Query:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGTNK--RAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++++VVD +  + GAL+WA++HT+Q QD + LL+  KP +K+      +  +  E+VH+ + LCQ KRP +E+E+  +EG  K+KG  IVEE++KQ  
Subjt:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGTNK--RAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  ALLVLGQRKRSTTWRLVMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        +LLV+GQ K+   WRL+  WA  R  G  GV++YC++NASCM IAV+ K++K+GGYLITTKR K+FWLLA
Subjt:  ALLVLGQRKRSTTWRLVMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGGCGCCAAGTTGCCAAGTTTCTGCCTGAACCGGATCCGGCCGCACGTTCGGGTGCCCATACAGTCCAAACCGGACTCTGCCGGCACCGCCGAGAAGAA
GGGCGGCGGTTCAAATTCCGGCGACGGAAATGTTGGAGCAAATGGAAGGAAGATTGTGATTGTGGTTGATTCGAGCTTTGAAGCAGAGGGGGCTCTTCAATGGGCGCTCT
CCCACACCGTACAGAACCAGGACAAGATTCTTCTTCTCCATGTCATAAAGCCTTCTAAAAAAGCAGGGGAATTAGGAACCAACAAGAGAGCGTTTGAAGTGGTTCACTCT
TTCAGAGACTTGTGCCAGTTGAAGAGACCAGAGGTGGAAATTGAAGTGGCGGTGGTGGAAGGGAAAGAGAAGGGGCCGGTGATAGTGGAGGAAGCGAGAAAGCAGGGGGC
GGCGCTGCTGGTGCTCGGGCAGAGGAAGCGGTCGACGACGTGGCGGCTTGTGATGGTGTGGGCGGGGCACCGGCTCGGCGGCGGGGTGGTGGAGTATTGCATCCAGAATG
CGAGCTGTATGGCGATTGCGGTGCGGCGGAAGAGCAAGAAAGTGGGTGGATATTTGATCACCACCAAGCGTCAGAAGGATTTCTGGCTTTTGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAACCGGCGCCAAGTTGCCAAGTTTCTGCCTGAACCGGATCCGGCCGCACGTTCGGGTGCCCATACAGTCCAAACCGGACTCTGCCGGCACCGCCGAGAAGAA
GGGCGGCGGTTCAAATTCCGGCGACGGAAATGTTGGAGCAAATGGAAGGAAGATTGTGATTGTGGTTGATTCGAGCTTTGAAGCAGAGGGGGCTCTTCAATGGGCGCTCT
CCCACACCGTACAGAACCAGGACAAGATTCTTCTTCTCCATGTCATAAAGCCTTCTAAAAAAGCAGGGGAATTAGGAACCAACAAGAGAGCGTTTGAAGTGGTTCACTCT
TTCAGAGACTTGTGCCAGTTGAAGAGACCAGAGGTGGAAATTGAAGTGGCGGTGGTGGAAGGGAAAGAGAAGGGGCCGGTGATAGTGGAGGAAGCGAGAAAGCAGGGGGC
GGCGCTGCTGGTGCTCGGGCAGAGGAAGCGGTCGACGACGTGGCGGCTTGTGATGGTGTGGGCGGGGCACCGGCTCGGCGGCGGGGTGGTGGAGTATTGCATCCAGAATG
CGAGCTGTATGGCGATTGCGGTGCGGCGGAAGAGCAAGAAAGTGGGTGGATATTTGATCACCACCAAGCGTCAGAAGGATTTCTGGCTTTTGGCCTAA
Protein sequenceShow/hide protein sequence
MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKAGELGTNKRAFEVVHS
FRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLVMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA