; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1605 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1605
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationMC09:21717092..21719789
RNA-Seq ExpressionMC09g1605
SyntenyMC09g1605
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574035.1 hypothetical protein SDJN03_27922, partial [Cucurbita argyrosperma subsp. sororia]6.39e-11076.86Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG
        MGKTG KLPSFCLNRIR HVRVPIQSKPDS    T E+KGG    SN G       GRKI+IVVDS+ EAEGALQWALS+TVQNQD I+LLH+  PSKKG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG

Query:  ELGTNK----RAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRL-----GGGVVEYCIQNASC
        E G+ +    RA+E+VHS R+LCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGA+LLVLGQ+K RSTTWRLLM+WAGHR      GGG VEYCIQNASC
Subjt:  ELGTNK----RAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRL-----GGGVVEYCIQNASC

Query:  MAIAVRRKSKKVGGYLITTKRQKDFWLLA
        MAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  MAIAVRRKSKKVGGYLITTKRQKDFWLLA

KAG6601483.1 hypothetical protein SDJN03_06716, partial [Cucurbita argyrosperma subsp. sororia]1.75e-11077.16Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG-------VVEYCIQN
        KGE     T+ RA+E+VHS R+LCQLKRPEVE+EVAVVEG KEKG VIVEEARKQ A+LLVLGQ+KRSTTWRLLMVWAGHR GGG        VEYCIQN
Subjt:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG-------VVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        ASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

XP_022148999.1 universal stress protein PHOS32-like isoform X1 [Momordica charantia]2.37e-14898.61Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK--GELG
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK  GELG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK--GELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
        TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG

Query:  GYLITTKRQKDFWLLA
        GYLITTKRQKDFWLLA
Subjt:  GYLITTKRQKDFWLLA

XP_022149000.1 uncharacterized protein LOC111017526 isoform X2 [Momordica charantia]1.61e-14899.07Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-GELGT
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK GELGT
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-GELGT

Query:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
        NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
Subjt:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG

Query:  YLITTKRQKDFWLLA
        YLITTKRQKDFWLLA
Subjt:  YLITTKRQKDFWLLA

XP_022945318.1 uncharacterized protein LOC111449594 [Cucurbita moschata]6.17e-11077.09Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG
        MGKTG KLPSFCLNRIR HVRVPIQSKPDS    T E+KGG    SN G       GRKI+IVVDS+ EAEGALQWALS+TVQNQD I+LLH+  PSKKG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG

Query:  ELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRLGGG-----VVEYCIQNASCMA
        E    T  R +E+VHS R+LCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGA+LLVLGQ+K RSTTWRLLM+WAGHR GGG      VEYCIQNASCMA
Subjt:  ELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRLGGG-----VVEYCIQNASCMA

Query:  IAVRRKSKKVGGYLITTKRQKDFWLLA
        IAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  IAVRRKSKKVGGYLITTKRQKDFWLLA

TrEMBL top hitse value%identityAlignment
A0A6J1D4I6 uncharacterized protein LOC111017526 isoform X27.80e-14999.07Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-GELGT
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK GELGT
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK-GELGT

Query:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
        NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG
Subjt:  NKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGG

Query:  YLITTKRQKDFWLLA
        YLITTKRQKDFWLLA
Subjt:  YLITTKRQKDFWLLA

A0A6J1D5Q4 universal stress protein PHOS32-like isoform X11.15e-14898.61Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK--GELG
        MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK  GELG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKK--GELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
        TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRL MVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVG

Query:  GYLITTKRQKDFWLLA
        GYLITTKRQKDFWLLA
Subjt:  GYLITTKRQKDFWLLA

A0A6J1G0H0 uncharacterized protein LOC1114495942.99e-11077.09Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG
        MGKTG KLPSFCLNRIR HVRVPIQSKPDS    T E+KGG    SN G       GRKI+IVVDS+ EAEGALQWALS+TVQNQD I+LLH+  PSKKG
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGG---SNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKG

Query:  ELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRLGGG-----VVEYCIQNASCMA
        E    T  R +E+VHS R+LCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGA+LLVLGQ+K RSTTWRLLM+WAGHR GGG      VEYCIQNASCMA
Subjt:  ELG--TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRK-RSTTWRLLMVWAGHRLGGG-----VVEYCIQNASCMA

Query:  IAVRRKSKKVGGYLITTKRQKDFWLLA
        IAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  IAVRRKSKKVGGYLITTKRQKDFWLLA

A0A6J1GXX9 uncharacterized protein LOC1114582167.81e-11077.25Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IVVDS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG--------VVEYCIQ
        KGE     T+ RA+E+VHS R+LCQLKRPEVE+EVAVVEG KEKG VIVEEARKQ A+LLVLGQ+KRSTTWRLLMVWAGHR GGG        VVEYCIQ
Subjt:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG--------VVEYCIQ

Query:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        NASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  NASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

A0A6J1IT49 uncharacterized protein LOC1114784236.16e-10976.72Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK
        MGKTG KLPSFCLNRIRPHVRVPIQSK DS    T  KK      G G VGA     NGRKI+IV+DS+ EAEGALQWALSHTVQNQDKI+LLHVIKPS+
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAG--TAEKKGGGSNSGDGNVGA-----NGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSK

Query:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG-------VVEYCIQN
        KGE     T+ RA+E+VHS R+LCQLKRPEVE+EVAVVEG KEKG VIVEEARKQ A+LLVLGQ+KRSTTWRLLMVWAGHR GGG        VEYCIQN
Subjt:  KGE---LGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEG-KEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGG-------VVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        ASCMAIAVRRKSKK+GGYLITTKRQKDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.2e-4847.41Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVI--KPSKKGELG
        MGK G     F ++R+R +VRV    +P +  T  + G        ++   GR+I++VVDS  EA+ AL W LSH  Q QD ILLLH +  K S+ G+L 
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVI--KPSKKGELG

Query:  -------------TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHR---LGGGVVEYCIQN
                     T  RA + V + + +C+LKRPEV+ EV  V+G EKGP IV+EAR++ A+LLVLGQ+K+  TWRLLMVWA           VEYCI N
Subjt:  -------------TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHR---LGGGVVEYCIQN

Query:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        + CMAIAVR++ KK+GGY +TTKR KDFWLLA
Subjt:  ASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.7e-4247.03Show/hide
Query:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVI--KPSKKGELG
        MGK G     F ++R+R +VRV    +P +  T  + G        ++   GR+I++VVDS  EA+ AL W LSH  Q QD ILLLH +  K S+ G+L 
Subjt:  MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVI--KPSKKGELG

Query:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHR---LGGGVVEYCIQNASCMAIAVRRKSK
         NK   E     +        +V+ EV  V+G EKGP IV+EAR++ A+LLVLGQ+K+  TWRLLMVWA           VEYCI N+ CMAIAVR++ K
Subjt:  TNKRAFEVVHSFRDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHR---LGGGVVEYCIQNASCMAIAVRRKSK

Query:  KVGGYLITTKRQKDFWLLA
        K+GGY +TTKR KDFWLLA
Subjt:  KVGGYLITTKRQKDFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein8.4e-4554.82Show/hide
Query:  VIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPS-----KKGELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALL
        ++VVD++ + + ALQWAL+H VQ++D I LLHV +        + +   N RA E+VH  ++ CQLK+P V+ E+ VVE  +EKG  IVEE++KQGA +L
Subjt:  VIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPS-----KKGELGTNKRAFEVVHSFRDLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGAALL

Query:  VLGQRKRSTTWRLLMVW-AGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        VLGQRKR++ WR++  W     +GGGVVEYCI N+ CMAIAVR+KS   GGYLITTKR KDFWLLA
Subjt:  VLGQRKRSTTWRLLMVW-AGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.6e-4248.24Show/hide
Query:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKGELGTNK---RAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++++VVD    + GAL+WAL HT+Q+QD + LL+  KP +KG+    K   +  E+VH+ + LCQ KRP +E+E+  ++G  KEKG  IVEEA++Q  
Subjt:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKGELGTNK---RAFEVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  ALLVLGQRKRSTTWRLLMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        +LLV+G+ K+   WRLL  W   +  G  G ++YC++ ASCM IAV+ K++K+GGYLITTKR K+FWLLA
Subjt:  ALLVLGQRKRSTTWRLLMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-4450.59Show/hide
Query:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKGELGTNKRAF---EVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++++VVD +  + GAL+WA++HT+Q QD + LL+  KP +K +    KR     E+VH+ + LCQ KRP +E+E+  +EG  K+KG  IVEE++KQ  
Subjt:  GRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKGELGTNKRAF---EVVHSFRDLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  ALLVLGQRKRSTTWRLLMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA
        +LLV+GQ K+   WRLL  WA  R  G  GV++YC++NASCM IAV+ K++K+GGYLITTKR K+FWLLA
Subjt:  ALLVLGQRKRSTTWRLLMVWAGHRLGG--GVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGGCGCCAAGTTGCCAAGTTTCTGCCTGAACCGGATCCGGCCGCACGTTCGGGTGCCCATACAGTCCAAACCGGACTCTGCCGGCACCGCCGAGAAGAA
GGGCGGCGGTTCAAATTCCGGCGACGGAAATGTTGGAGCAAATGGAAGGAAGATTGTGATTGTGGTTGATTCGAGCTTTGAAGCAGAGGGGGCTCTTCAATGGGCGCTCT
CCCACACCGTACAGAACCAGGACAAGATTCTTCTTCTCCATGTCATAAAGCCTTCTAAAAAAGGGGAATTAGGAACCAACAAGAGAGCGTTTGAAGTGGTTCACTCTTTC
AGAGACTTGTGCCAGTTGAAGAGACCAGAGGTGGAAATTGAAGTGGCGGTGGTGGAAGGGAAAGAGAAGGGGCCGGTGATAGTGGAGGAAGCGAGAAAGCAGGGGGCGGC
GCTGCTGGTGCTCGGGCAGAGGAAGCGGTCGACGACGTGGCGGCTTCTGATGGTGTGGGCGGGGCACCGGCTCGGCGGCGGGGTGGTGGAGTATTGCATCCAGAATGCGA
GCTGTATGGCGATTGCGGTGCGGCGGAAGAGCAAGAAAGTGGGTGGATATTTGATCACCACCAAGCGTCAGAAGGATTTCTGGCTTTTGGCCTAA
mRNA sequenceShow/hide mRNA sequence
CCAAACCCAAAAATGACCAAAAGAAAAAACCCCCCTTGGATTCCAACTAAATAACCTCAATTTCGGTTAATTTCTAGAGTGATGGACTTAAAAGGACTCATAATTCCAAC
TTCTTTTGAAGGCAATTAAGCTTTTAATCAATCCAAATTAATTACTCCAACAATCATGACTCATCAGTTTCTTTTGAATTACGCTAAAATACTCTCAACACGAAAAAAGG
AGTTGTAGGGGTCAAGTTCTCAATAATAAAGCTTCTAATACTTTAATTGAATCCCTCATTTTCTCCGATCCCATTGCCATATCACAGCCTCCTCTCTCTCCCAATTTTGC
ATCGAAAATAAAGAATTGAACTCCCAGAAGTGAAGAAAACAGCTGGCCCTGGCGAGATTGAAGTTGAAGAGGATGATGGGCCTACACTATTTCATGACCACCACGTTCTT
GTACTAAGCCCCCACACACCCAACAAAATATAAAATAAATAAATAAAAAGGAAGCCAAACCCACTCAAATCATTACAAAGAAAAAAGAAAAAAAAAAGAAACCAAAACCA
TGCGACTCACAATCCCACAGTGCTCTGTTTACCGACATATAGAAATCTGTTGAGTTGAAACGAAAGGCTTTGGTTTTCTTCTGCCCTTTTGCCAGAAATGGGGAAAACCG
GCGCCAAGTTGCCAAGTTTCTGCCTGAACCGGATCCGGCCGCACGTTCGGGTGCCCATACAGTCCAAACCGGACTCTGCCGGCACCGCCGAGAAGAAGGGCGGCGGTTCA
AATTCCGGCGACGGAAATGTTGGAGCAAATGGAAGGAAGATTGTGATTGTGGTTGATTCGAGCTTTGAAGCAGAGGGGGCTCTTCAATGGGCGCTCTCCCACACCGTACA
GAACCAGGACAAGATTCTTCTTCTCCATGTCATAAAGCCTTCTAAAAAAGGGGAATTAGGAACCAACAAGAGAGCGTTTGAAGTGGTTCACTCTTTCAGAGACTTGTGCC
AGTTGAAGAGACCAGAGGTGGAAATTGAAGTGGCGGTGGTGGAAGGGAAAGAGAAGGGGCCGGTGATAGTGGAGGAAGCGAGAAAGCAGGGGGCGGCGCTGCTGGTGCTC
GGGCAGAGGAAGCGGTCGACGACGTGGCGGCTTCTGATGGTGTGGGCGGGGCACCGGCTCGGCGGCGGGGTGGTGGAGTATTGCATCCAGAATGCGAGCTGTATGGCGAT
TGCGGTGCGGCGGAAGAGCAAGAAAGTGGGTGGATATTTGATCACCACCAAGCGTCAGAAGGATTTCTGGCTTTTGGCCTAATTAATTAATCAAGCCTGTAATGTAATGT
AAATCTAAATAATTGAGAAATATAATGGGATTAGAATTGTAAGTGATTTTAATTGTTTGGAAAAGAAAAAAAAGAGTGTTGGTTGAATTTGAATAGGTGCAGTGGCACTC
ATGAGGTTTCCCAAAAAACTGTGGGTGGCATGTTCTGAAACTGTTGGAAATGTAATGATTTGGGGGTAACTTACTTAGCCTGTCCTGTCCATAAGTTTGGATCTAATTTT
CTAAAATGATTGATCTGAGGTTTTGTTTTTTTTTTTTTTTTTTTTCATTTTTCTCTCTCTTTTTGAATTATAATTTGGTCATGTACTTTCGTGATTTAGTATTTCAACTT
TTAATGGTAACGATTTAGTCTTTTTACTTTTATATTGATTAAATTTAGTTACTAAACTTTATAGGTAACAATATAGTTCCTAAAATTTAAAATTTGTAACAATTTTAGTT
CCTACGACGAAAATTTACATCAAAATTAAGTTTAAAGTTTTATCCTCTAACGATTTAATTTTTGTAATTTACAAATAAATTTGATCTTTGATTGGATTATCGATTTACAT
ATATAAATGATTTAACAAAAAAATTCACAAAATGAACTAAATGGTTACCAATTTAAATGATTACTTGATGGACTAAATGATTACCAAAAATATAAAGATCAAAATTTTTA
TTTCATAGCGTCCTGGACCCTTTAATTTTTATACTATATCAATTATATTAGCTACTCAAATGACAAAAGTAAGGTTACCCAACTCACAAATTGTTTACTCAATTTTCATA
TACTAATTTGTGAGGGAATGATCAAACCATTAACCTCTATGGTAACAACTGCATTATGTTAAAATTGTCTTTCTTATAAGTTAGGAATTTTTTATTTTTTATTTTTTT
Protein sequenceShow/hide protein sequence
MGKTGAKLPSFCLNRIRPHVRVPIQSKPDSAGTAEKKGGGSNSGDGNVGANGRKIVIVVDSSFEAEGALQWALSHTVQNQDKILLLHVIKPSKKGELGTNKRAFEVVHSF
RDLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGAALLVLGQRKRSTTWRLLMVWAGHRLGGGVVEYCIQNASCMAIAVRRKSKKVGGYLITTKRQKDFWLLA