; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009744 (gene) of Snake gourd v1 genome

Gene IDTan0009744
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontrihelix transcription factor ASR3
Genome locationLG01:102346657..102349585
RNA-Seq ExpressionTan0009744
SyntenyTan0009744
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608224.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia]1.4e-12080.42Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DG RG  VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKI+AENCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YWCLESGRRKELGLPDNFDEE+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+ CEEE+EE PP+S PEVE R 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C++KS+GEKAT++ EP+EQ M K LLETA KVQAIVSENAEY  SD KN  +  R + +RRQGSKLI+ L DFLNTINDL  LLED
Subjt:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

KAG7037576.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-12181.12Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DG RG  VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAENCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YWCLESGRRKELGLPDNFDEE+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RN+ LEKS+KCEEE+EE PP+S PEVE R 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C++KS+GEKAT++ EP+EQ MAK LLETA KVQAIVSENAEY  SD KN  +  R + +RRQGSKLI+ L DFLNTINDL  LLED
Subjt:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

XP_022139752.1 trihelix transcription factor ASR3 [Momordica charantia]9.4e-12581.05Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DGNRG GVSGSRRTRSQI PDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDV R SNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YW LESGRRKELGLP+NFD+ELFKAIDNVATMR NQSDTEPDSDPEA VE++DE++EPGPKRQRRRS+SKR+Q LEKSL+CEEE+EEKPPL+ PE EPRE
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C +KSNGEK  ++ E  +EQMM K LLE   ++QAIVSENAEY  SD KN  HRID VRRQG+ LIR LGD LN INDL GL ED
Subjt:  CHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

XP_023524323.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo]6.3e-12180.21Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DG RG  VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAENCTSLDV RNSNQCRRKWDCLLIEHDVI+QWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YWCLES RRKELGLPDNFDEELFKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+KCE EEEE+PP+S PEVE R 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAK----NVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C++KS+GEK T++ EP+EQ MAK LLETA KVQAIVSENAEY  SD K    N   R + +RRQGSKLI+ L DFLNTINDL  LLED
Subjt:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAK----NVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

XP_038897371.1 trihelix transcription factor ASR3 [Benincasa hispida]3.8e-12680.33Show/hide
Query:  KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD
        K+  GNRG GVSGSRRTRSQI   PDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVR SNQCRRKWDCLLIEHDVIKQWEL+MP+DD
Subjt:  KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD

Query:  SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCE-----------------
        SYWCLESGRRKELGLPDNFDEELFKAIDNVATMR NQSDTEPDSDPEAAVE +DE+AEPGPKRQRRRSMSK NQVLEKSL+CE                 
Subjt:  SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCE-----------------

Query:  ---EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLS
           EEEEEKP LSFPEVEPREC++K+NG K T+N EPKEQMMAK LLE A KVQAIVSENAEY  SD KN   + +LVR QGSKLIR LGD LNTINDL 
Subjt:  ---EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLS

Query:  GLLED
        GLLED
Subjt:  GLLED

TrEMBL top hitse value%identityAlignment
A0A0A0LDW0 Myb-like domain-containing protein7.5e-12076.87Show/hide
Query:  KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD
        K+  GNRG GVSGSRRTRSQI   P WTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVR SNQCRRKWDCLLIEHDVIKQWEL+MPDDD
Subjt:  KKGDGNRGPGVSGSRRTRSQI--EPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD

Query:  SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCE-----------------
        SYWCL SGRRKELGLP+NFDEELFKAIDNVA+MR NQSDTEPDSDPEAA+   DE+AEPGPKRQRRRSMSK NQVLEKSL+CE                 
Subjt:  SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCE-----------------

Query:  -----EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTIND
             EE EEKP LS PE+EPREC++KSN  K T+N EPKEQMMAK LLE A KVQAIVSENAEY  SD K    + +LVR QGSKLIR LGD LNTIND
Subjt:  -----EEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTIND

Query:  LSGLLED
        L GLLED
Subjt:  LSGLLED

A0A6J1CEU7 trihelix transcription factor ASR34.5e-12581.05Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DGNRG GVSGSRRTRSQI PDWTAA+CLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDV R SNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YW LESGRRKELGLP+NFD+ELFKAIDNVATMR NQSDTEPDSDPEA VE++DE++EPGPKRQRRRS+SKR+Q LEKSL+CEEE+EEKPPL+ PE EPRE
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C +KSNGEK  ++ E  +EQMM K LLE   ++QAIVSENAEY  SD KN  HRID VRRQG+ LIR LGD LN INDL GL ED
Subjt:  CHLKSNGEKATNNAE-PKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

A0A6J1FMB3 trihelix transcription factor ASR3-like1.5e-12080.77Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DG RG  VSGSRRTRS+I PDWTAADCLVLVNVIAAVEADC KALSS+QKWKIVAENCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE
        YWCLESGRRKELGLPDNFDEE+FKAIDNV +MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+KCEEE+EE P +S PEVE R 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRE

Query:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        C++KS+GEKAT++ EP+EQ MAK LLETA KVQAIVSENAEY  SD KN  +  R + +R QGSKLI+ L DFLNTINDL  LLED
Subjt:  CHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGH--RIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

A0A6J1IN02 trihelix transcription factor ASR3-like2.3e-11675.52Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MKK +GNRG GVSGSRRTRSQI P+WTAA+CLVLVNVI AVEADC+KALSSYQKWKIVAE+CT+L+V R SNQCR+KW+CLLIEHDVI+QWEL MP+DDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKC----EEEEEEKPPLSFPEV
        YWCLESGRRKELGLPDNFDEELFKAI NV++MR NQSDTEPD+DPEAAVE  DE++EPGPKRQRR SMSKRNQ LEKSL+C    EEE EE+P LS PE 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKC----EEEEEEKPPLSFPEV

Query:  EPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLEDYK
        + R+C++K+NG KAT++ EP+EQMM K LLE A  VQ IVSENAE V SD KN   + +L+RRQGSKLIR LGDFLNTINDL  LLED++
Subjt:  EPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLEDYK

A0A6J1J3M2 trihelix transcription factor ASR34.1e-11877.97Show/hide
Query:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
        MK+ DG RG  VSGSRRTRS+I PDWTAA+CLVLVNVIAAVEADC KALSS+QKWKIVAENCTSLDV RNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS
Subjt:  MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDS

Query:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKC-------EEEEEEKPPLSF
        YWCLESGRRKELGLPDNFDEELFKAIDNV  MR NQSDTEPDSDPEAAVE VDE AEPGPKRQRR SMS RNQ LEKS+KC       EEEEEE+P +S 
Subjt:  YWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKC-------EEEEEEKPPLSF

Query:  PEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVAS----DAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED
        PEVE R C++KS+GEKAT+N EP+EQ M K LLETA KVQAIVSENA+Y  S    D  N  +R + +RRQGSKLI+ L DFLNTINDL  L ED
Subjt:  PEVEPRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSENAEYVAS----DAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein5.5e-0626.55Show/hide
Query:  KWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-----------------SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQ
        +WK + + C     +R+ NQC  KWD L+ ++  ++++E    +                   SYW +E   RKE  LP N   + ++A+  V      +
Subjt:  KWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-----------------SYWCLESGRRKELGLPDNFDEELFKAIDNVATMRENQ

Query:  SDTEPDSDPEAAV
        S T P S    AV
Subjt:  SDTEPDSDPEAAV

AT2G35640.1 Homeodomain-like superfamily protein6.5e-0726.57Show/hide
Query:  DWTAADCLVLVNVIAAVEADCLKALSSYQK------------WKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLE
        +WT ++ LVL   I A + D  + +   +K            WK + E C      RN NQC  KWD L+ ++  I+++E    +         SYW ++
Subjt:  DWTAADCLVLVNVIAAVEADCLKALSSYQK------------WKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLE

Query:  SGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAA
           RKE  LP N   +++  +  +   +     T P S   AA
Subjt:  SGRRKELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAA

AT4G31270.1 sequence-specific DNA binding transcription factors6.4e-4741.34Show/hide
Query:  GVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRR
        G SGSRRTRSQ+ P+W   DCLVLVN IAAVEADC  ALSS+QKW ++ ENC +LDV RN NQCRRKWD L+ +++ IK+WE +      SYW L S +R
Subjt:  GVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRR

Query:  KELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEA--AVEIVDEVAEPGPKRQRRRSMSKRNQVLE--KSLKCEEEEEEKP--------PLSFPEVE
        K L LP + D ELF+AI+ V  +++ ++ TE DSDPEA   V++  E+A  G KR R+R+M  +    E  ++ + +    EKP          +  E +
Subjt:  KELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEA--AVEIVDEVAEPGPKRQRRRSMSKRNQVLE--KSLKCEEEEEEKP--------PLSFPEVE

Query:  PRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSEN--AEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDL
        P E       E  T N E   ++M   L      + AIV  N   +    D  ++  ++  VR+QG +LI  L + ++T+N L
Subjt:  PRECHLKSNGEKATNNAEPKEQMMAKNLLETAAKVQAIVSEN--AEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGGGGGACGGCAACCGTGGACCGGGGGTTTCAGGTTCTCGTCGGACGCGGTCTCAGATAGAACCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGT
GATTGCGGCTGTGGAGGCCGATTGTTTGAAAGCTTTGTCTAGCTATCAGAAATGGAAGATTGTTGCTGAGAACTGCACGTCTTTAGATGTGGTTCGGAATTCGAATCAGT
GCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTGGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAG
GAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATTGATAATGTTGCCACAATGAGGGAAAATCAGTCGGATACCGAGCCAGATAGCGATCCCGAGGC
TGCAGTTGAGATTGTTGATGAAGTTGCAGAGCCTGGCCCTAAAAGACAAAGACGGCGTTCAATGTCTAAGAGAAATCAAGTCCTTGAGAAATCTTTAAAATGTGAAGAAG
AAGAAGAAGAAAAACCTCCATTGAGCTTTCCCGAAGTAGAGCCTCGTGAATGCCACCTCAAAAGCAACGGAGAAAAGGCAACCAATAATGCTGAACCCAAAGAGCAAATG
ATGGCGAAGAATTTGCTTGAAACTGCAGCAAAAGTTCAAGCAATTGTGTCTGAGAATGCAGAATATGTGGCTTCTGATGCAAAGAACGTCGGCCACCGAATTGATTTGGT
AAGGCGTCAAGGGAGCAAGCTTATCAGATCCCTTGGAGATTTTCTCAACACCATTAATGATCTCAGTGGCCTGCTCGAAGATTACAAACATGTCAAGTTGGCCAATTCGA
GCAACAGCAAAACGTTGCCGACATTTTCCAGGTTACAGATTAACCCAATCCAATGTTTATTTATGAAACCCCTTCTGTCTTATCTCTCTGTTATTCTCTGTTATCATTAT
TATTTGGTTAAAATATTGTTTTGGTCCGTGTAA
mRNA sequenceShow/hide mRNA sequence
GAATGGGACATTTGTGCGTTCTTCTATTCTACCGCGAAGATTGTGCATCGAAGCGAATGGGCAATGCAGAAGGTTATGGCTTCATCTCAAATTGATAACGCGTTTGGATT
GAAATAGAGAACTGAAGAATCTCGAAACTCGAAACGAAGAACATTTTAGTGCGAAATGAAGAAGGGGGACGGCAACCGTGGACCGGGGGTTTCAGGTTCTCGTCGGACGC
GGTCTCAGATAGAACCGGATTGGACGGCGGCGGATTGCCTCGTTCTTGTTAATGTGATTGCGGCTGTGGAGGCCGATTGTTTGAAAGCTTTGTCTAGCTATCAGAAATGG
AAGATTGTTGCTGAGAACTGCACGTCTTTAGATGTGGTTCGGAATTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGA
GTTGGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAGAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATTGATAATG
TTGCCACAATGAGGGAAAATCAGTCGGATACCGAGCCAGATAGCGATCCCGAGGCTGCAGTTGAGATTGTTGATGAAGTTGCAGAGCCTGGCCCTAAAAGACAAAGACGG
CGTTCAATGTCTAAGAGAAATCAAGTCCTTGAGAAATCTTTAAAATGTGAAGAAGAAGAAGAAGAAAAACCTCCATTGAGCTTTCCCGAAGTAGAGCCTCGTGAATGCCA
CCTCAAAAGCAACGGAGAAAAGGCAACCAATAATGCTGAACCCAAAGAGCAAATGATGGCGAAGAATTTGCTTGAAACTGCAGCAAAAGTTCAAGCAATTGTGTCTGAGA
ATGCAGAATATGTGGCTTCTGATGCAAAGAACGTCGGCCACCGAATTGATTTGGTAAGGCGTCAAGGGAGCAAGCTTATCAGATCCCTTGGAGATTTTCTCAACACCATT
AATGATCTCAGTGGCCTGCTCGAAGATTACAAACATGTCAAGTTGGCCAATTCGAGCAACAGCAAAACGTTGCCGACATTTTCCAGGTTACAGATTAACCCAATCCAATG
TTTATTTATGAAACCCCTTCTGTCTTATCTCTCTGTTATTCTCTGTTATCATTATTATTTGGTTAAAATATTGTTTTGGTCCGTGTAATTAGAAAGTACGCATTCCATTT
TAGTTTTTATACTTTC
Protein sequenceShow/hide protein sequence
MKKGDGNRGPGVSGSRRTRSQIEPDWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRK
ELGLPDNFDEELFKAIDNVATMRENQSDTEPDSDPEAAVEIVDEVAEPGPKRQRRRSMSKRNQVLEKSLKCEEEEEEKPPLSFPEVEPRECHLKSNGEKATNNAEPKEQM
MAKNLLETAAKVQAIVSENAEYVASDAKNVGHRIDLVRRQGSKLIRSLGDFLNTINDLSGLLEDYKHVKLANSSNSKTLPTFSRLQINPIQCLFMKPLLSYLSVILCYHY
YLVKILFWSV