; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037339 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037339
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontrihelix transcription factor ASR3
Genome locationchr2:5280372..5282789
RNA-Seq ExpressionLag0037339
SyntenyLag0037339
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608217.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia]4.9e-12179.04Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+E+G RG  VSGSRRTRS+IAPDWTAADCLVLVNVIAAVEADC KALSS+QKWKI+AENCTSLDVAR SNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP
        YWCLESGRRKELGLP+NFDE +FKAIDNV +MRANQSDTEPDSDPEA VE VDE AEPG KRQRR SMS +NQ+ EKS+ CE+E+EE    PV   E + 
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP

Query:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH---RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        R CYIKS+GEKATD+ EPEE+ MVKKLLE AEK+QAIVSENAEYATSDEK  D+   R + +RRQGSKLI+CL DFLNTINDL  LLED E
Subjt:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH---RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

KAG6608224.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia]2.2e-12179.31Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+E+G RG  VSGSRRTRS+IAPDWTAADCLVLVNVIAAVEADC KALSS+QKWKI+AENCTSLDVAR SNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP
        YWCLESGRRKELGLP+NFDE +FKAIDNV +MRANQSDTEPDSDPEA VE VDE AEPG KRQRR SMS +NQ+ EKS+ CE+E+EE    PV   E + 
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP

Query:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH--RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        R CYIKS+GEKATD+ EPEE+ MVKKLLE AEK+QAIVSENAEYATSDEKN ++  R + +RRQGSKLI+CL DFLNTINDL  LLED E
Subjt:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH--RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

XP_022139752.1 trihelix transcription factor ASR3 [Momordica charantia]2.0e-12783.39Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+E+GNRG GVSGSRRTRSQIAPDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP
        YW LESGRRKELGLPENFD+ LFKAIDNVATMRANQSDTEPDSDPEA VEM+DEI+EPG KRQRRRS+SK++QA EKSL+CE+E+EE  K P+   EA+P
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP

Query:  RECYIKSYGEKATDNHE-PEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        REC+IKS GEK  D+ E  EE+MM KKLLEN E+IQAIVSENAEYATSDEKN +HRID VRRQG+ LIRCLGD LN INDL GL ED E
Subjt:  RECYIKSYGEKATDNHE-PEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

XP_022976249.1 trihelix transcription factor ASR3-like [Cucurbita maxima]2.9e-12178.97Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MKKENGNRG GVSGSRRTRSQIAP+WTAA+CLVLVNVI AVEADC+KALSSYQKWKIVAE+CT+L+VARTSNQCR+KW+CLLIEHDVI+QWEL MP+DDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA
        YWCLESGRRKELGLP+NFDE LFKAI NV++MRANQSDTEPD+DPEA VE  DEI+EPG KRQRR SMSK+NQ  EKSL+C EDEEEE ++ P+L   EA
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA

Query:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
          R+CYIK+ G KATD+ EPEE+MMVKKLLENAE +Q IVSENAE  TSDEKN   + +L+RRQGSKLIRCLGDFLNTINDLR LLED E
Subjt:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

XP_038897371.1 trihelix transcription factor ASR3 [Benincasa hispida]4.3e-12579.29Show/hide
Query:  MKKEN-GNRGPGVSGSRRTRSQI--APDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD
        MKKEN GNRG GVSGSRRTRSQI  APDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDV RTSNQCRRKWDCLLIEHDVIKQWEL+MP+
Subjt:  MKKEN-GNRGPGVSGSRRTRSQI--APDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD

Query:  DDSYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC----------------
        DDSYWCLESGRRKELGLP+NFDE LFKAIDNVATMRANQSDTEPDSDPEA VE +DEIAEPG KRQRRRSMSK NQ  EKSL+C                
Subjt:  DDSYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC----------------

Query:  --EDEEEEEKKLPVLCLEAQPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTIND
          ED EEEE+K  +   E +PRECYIK+ G K TDN EP+E+MM K LLENAEK+QAIVSENAEYATSDEKN   + +LVR QGSKLIRCLGD LNTIND
Subjt:  --EDEEEEEKKLPVLCLEAQPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTIND

Query:  LRGLLEDHE
        LRGLLED E
Subjt:  LRGLLEDHE

TrEMBL top hitse value%identityAlignment
A0A0A0LDW0 Myb-like domain-containing protein7.6e-12076.92Show/hide
Query:  MKKEN-GNRGPGVSGSRRTRSQI--APDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD
        MKKEN GNRG GVSGSRRTRSQI  AP WTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDV RTSNQCRRKWDCLLIEHDVIKQWEL+MPD
Subjt:  MKKEN-GNRGPGVSGSRRTRSQI--APDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD

Query:  DDSYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCE---------------
        DDSYWCL SGRRKELGLPENFDE LFKAIDNVA+MRANQSDTEPDSDPEA +   DEIAEPG KRQRRRSMSK NQ  EKSL+CE               
Subjt:  DDSYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCE---------------

Query:  ------DEEEEEKKLPVLCLEAQPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNT
               EE EEK L +   E +PRECYIKS   K TDN EP+E+MM K LLENAEK+QAIVSENAEY TSDEK    + +LVR QGSKLIRCLGD LNT
Subjt:  ------DEEEEEKKLPVLCLEAQPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNT

Query:  INDLRGLLEDHE
        INDLRGLLED E
Subjt:  INDLRGLLEDHE

A0A6J1CEU7 trihelix transcription factor ASR39.9e-12883.39Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+E+GNRG GVSGSRRTRSQIAPDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP
        YW LESGRRKELGLPENFD+ LFKAIDNVATMRANQSDTEPDSDPEA VEM+DEI+EPG KRQRRRS+SK++QA EKSL+CE+E+EE  K P+   EA+P
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP

Query:  RECYIKSYGEKATDNHE-PEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        REC+IKS GEK  D+ E  EE+MM KKLLEN E+IQAIVSENAEYATSDEKN +HRID VRRQG+ LIRCLGD LN INDL GL ED E
Subjt:  RECYIKSYGEKATDNHE-PEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

A0A6J1FEH7 trihelix transcription factor ASR3-like9.0e-12178.97Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MKKENGNRG GVSGSRRTRSQIAP+WTAA+CLVLVNVI AVEADCLKALSSYQKWKIVAE+CT+L+VARTSNQCR+KW+CLLIEHDVIKQWEL MP+DDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA
        YWCLESGRRKELGLP+NFDE LFKAIDNV++MRANQSDTEPD+DPEA VE  DEI+EPG KRQRR SMSK+NQ  EKSL+  EDEE+E ++ P+L   E+
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA

Query:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
          R+CYIK+ G  ATD+ EPEE+MMVKKLLENAE +Q IVSENAE ATSDEKN   + +L+RRQGSKLIRCLGDFLNTINDLR LLED E
Subjt:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

A0A6J1FMB3 trihelix transcription factor ASR3-like1.2e-12078.97Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+E+G RG  VSGSRRTRS+IAPDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAENCTSLDVAR SNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP
        YWCLESGRRKELGLP+NFDE +FKAIDNV +MRANQSDTEPDSDPEA VE VDE AEPG KRQRR SMS +NQ+ EKS+KCE+E+EE +   V   E + 
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQP

Query:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH--RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        R CYIKS+GEKATD+ EPEE+ M KKLLE AEK+QAIVSENAEYATSDEKN ++  R + +R QGSKLI+CL DFLNTINDL  LLED E
Subjt:  RECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDH--RIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

A0A6J1IN02 trihelix transcription factor ASR3-like1.4e-12178.97Show/hide
Query:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MKKENGNRG GVSGSRRTRSQIAP+WTAA+CLVLVNVI AVEADC+KALSSYQKWKIVAE+CT+L+VARTSNQCR+KW+CLLIEHDVI+QWEL MP+DDS
Subjt:  MKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA
        YWCLESGRRKELGLP+NFDE LFKAI NV++MRANQSDTEPD+DPEA VE  DEI+EPG KRQRR SMSK+NQ  EKSL+C EDEEEE ++ P+L   EA
Subjt:  YWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKRQRRRSMSKQNQAPEKSLKC-EDEEEEEKKLPVLCL-EA

Query:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
          R+CYIK+ G KATD+ EPEE+MMVKKLLENAE +Q IVSENAE  TSDEKN   + +L+RRQGSKLIRCLGDFLNTINDLR LLED E
Subjt:  QPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein7.3e-0625.24Show/hide
Query:  KWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDD-----------------SYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQ
        +WK + + C      R+ NQC  KWD L+ ++  ++++E    +                   SYW +E   RKE  LP N     ++A+  V   +   
Subjt:  KWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDD-----------------SYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQ

Query:  SDT
        S T
Subjt:  SDT

AT2G35640.1 Homeodomain-like superfamily protein7.3e-0624.63Show/hide
Query:  DWTAADCLVLVNVIAAVEADCLKALSSYQK------------WKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLE
        +WT ++ LVL   I A + D  + +   +K            WK + E C      R  NQC  KWD L+ ++  I+++E    +         SYW ++
Subjt:  DWTAADCLVLVNVIAAVEADCLKALSSYQK------------WKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLE

Query:  SGRRKELGLPENFDEGLFKAIDNVATMRANQSDT
           RKE  LP N    ++  +  +   +   S +
Subjt:  SGRRKELGLPENFDEGLFKAIDNVATMRANQSDT

AT4G31270.1 sequence-specific DNA binding transcription factors5.7e-5142.27Show/hide
Query:  GVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRR
        G SGSRRTRSQ+AP+W   DCLVLVN IAAVEADC  ALSS+QKW ++ ENC +LDV+R  NQCRRKWD L+ +++ IK+WE +      SYW L S +R
Subjt:  GVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRR

Query:  KELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEA--IVEMVDEIAEPGSKRQRRRSM-SKQNQAPEKSLKCEDEEEEEKKLPVLCL-------EAQ
        K L LP + D  LF+AI+ V  ++  ++ TE DSDPEA  +V++  E+A  GSKR R+R+M  K+ +  E           EK +            E +
Subjt:  KELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEA--IVEMVDEIAEPGSKRQRRRSM-SKQNQAPEKSLKCEDEEEEEKKLPVLCL-------EAQ

Query:  PRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSEN--AEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE
        P E       E  T N E + ++M  KL    + I AIV  N   +  T D  ++D ++  VR+QG +LI CL + ++T+N L  + ++ E
Subjt:  PRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSEN--AEYATSDEKNVDHRIDLVRRQGSKLIRCLGDFLNTINDLRGLLEDHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCATTATACATTGTGGGTTCTTCTATTCTACCGCGAATATTGGGCATCGAAGCCAGCAGAGAATGGGCAATGCAGAAGGTCAAGACTCAAGGCTATGGCTTCTTC
ACTTCATCTCAAATTGAAAACGGAAGAATCTCGAGACTCGAAACGACGAACGTTTTTGTGCGAAATGAAGAAGGAGAACGGCAATCGAGGACCGGGGGTTTCAGGTTCTC
GTCGGACGCGGTCTCAGATAGCGCCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCTGATTGTTTGAAAGCTTTGTCTAGCTAT
CAGAAATGGAAGATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGACTTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGTTGATTGAACATGATGTTATCAA
GCAATGGGAGTTAGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGAGAATTTCGACGAGGGGCTGTTCAAAGCAA
TTGATAATGTCGCCACGATGAGGGCGAATCAGTCGGATACGGAGCCGGATAGCGATCCCGAGGCTATCGTTGAGATGGTTGATGAAATTGCAGAGCCTGGCTCTAAAAGG
CAAAGACGTCGTTCAATGTCTAAGCAAAATCAAGCCCCAGAGAAATCTTTGAAATGTGAAGATGAAGAAGAAGAAGAAAAAAAACTTCCAGTACTCTGTCTGGAAGCACA
GCCTCGTGAATGCTACATCAAAAGCTACGGAGAAAAGGCGACCGATAACCATGAACCCGAAGAGAAAATGATGGTGAAGAAATTGCTTGAAAATGCAGAAAAAATTCAAG
CAATTGTGTCTGAGAATGCAGAGTATGCAACTTCTGATGAAAAGAACGTCGACCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATGTCTTGGAGAT
TTTCTCAACACCATTAATGACCTCCGTGGCCTGCTCGAAGATCACGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCATTATACATTGTGGGTTCTTCTATTCTACCGCGAATATTGGGCATCGAAGCCAGCAGAGAATGGGCAATGCAGAAGGTCAAGACTCAAGGCTATGGCTTCTTC
ACTTCATCTCAAATTGAAAACGGAAGAATCTCGAGACTCGAAACGACGAACGTTTTTGTGCGAAATGAAGAAGGAGAACGGCAATCGAGGACCGGGGGTTTCAGGTTCTC
GTCGGACGCGGTCTCAGATAGCGCCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCTGATTGTTTGAAAGCTTTGTCTAGCTAT
CAGAAATGGAAGATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGACTTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGTTGATTGAACATGATGTTATCAA
GCAATGGGAGTTAGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGAGAATTTCGACGAGGGGCTGTTCAAAGCAA
TTGATAATGTCGCCACGATGAGGGCGAATCAGTCGGATACGGAGCCGGATAGCGATCCCGAGGCTATCGTTGAGATGGTTGATGAAATTGCAGAGCCTGGCTCTAAAAGG
CAAAGACGTCGTTCAATGTCTAAGCAAAATCAAGCCCCAGAGAAATCTTTGAAATGTGAAGATGAAGAAGAAGAAGAAAAAAAACTTCCAGTACTCTGTCTGGAAGCACA
GCCTCGTGAATGCTACATCAAAAGCTACGGAGAAAAGGCGACCGATAACCATGAACCCGAAGAGAAAATGATGGTGAAGAAATTGCTTGAAAATGCAGAAAAAATTCAAG
CAATTGTGTCTGAGAATGCAGAGTATGCAACTTCTGATGAAAAGAACGTCGACCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATGTCTTGGAGAT
TTTCTCAACACCATTAATGACCTCCGTGGCCTGCTCGAAGATCACGAGTGA
Protein sequenceShow/hide protein sequence
MGHYTLWVLLFYREYWASKPAENGQCRRSRLKAMASSLHLKLKTEESRDSKRRTFLCEMKKENGNRGPGVSGSRRTRSQIAPDWTAADCLVLVNVIAAVEADCLKALSSY
QKWKIVAENCTSLDVARTSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPENFDEGLFKAIDNVATMRANQSDTEPDSDPEAIVEMVDEIAEPGSKR
QRRRSMSKQNQAPEKSLKCEDEEEEEKKLPVLCLEAQPRECYIKSYGEKATDNHEPEEKMMVKKLLENAEKIQAIVSENAEYATSDEKNVDHRIDLVRRQGSKLIRCLGD
FLNTINDLRGLLEDHE