; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G037720 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G037720
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionhomeobox protein 6
Genome locationCiama_Chr02:23974352..23975501
RNA-Seq ExpressionCaUC02G037720
SyntenyCaUC02G037720
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605199.1 hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sororia]4.2e-8164.17Show/hide
Query:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN
        M  + HG E  + + +E+E+EEEALS CDLPVKEKQQ +  +   + TE+FDF  W P    PPM AAD++FFQG +LPLRLS SS+N     N  F   
Subjt:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN

Query:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL
        L  RSESMDHNMLRFRNGS SSS  SSRSHYSR SS SNNS+SIPTNSKPRTQ NVFHSHPSPTPQIRSFST   RSRSRSSSRW+FFR+GLLRTPGMEL
Subjt:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL

Query:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPA-------TAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA
         DLKTRTT T A  T   KTTAS LGVVSCKKSV+ +PA       +    + RNEK           N++VEIREKEKEK  R+SHRRTFEWLKQLSHA
Subjt:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPA-------TAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA

Query:  -TFGEEQ
         TF ++Q
Subjt:  -TFGEEQ

XP_008457772.1 PREDICTED: homeobox protein 6 [Cucumis melo]7.7e-9973.97Show/hide
Query:  MKMMG-RSCHGDE-------IWETEI-EEEEEEEEALSFCDLPVKEKQQAMRTSSAA-VETEEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFSSENR
        MK++G RSCHGDE         ET+I ++++EEEEALS CDLPVKEKQQ  R+ SA  VETE+FDF  WRP P PML AD+LFFQG MLPLRLSFSSEN 
Subjt:  MKMMG-RSCHGDE-------IWETEI-EEEEEEEEALSFCDLPVKEKQQAMRTSSAA-VETEEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFSSENR

Query:  QSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIP-TNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFF
        Q+N      GNLW+RSESMD +NMLRFRNGS SSSSSRSHYSRSSS SNNS+SIP TN+KPR +  NVFHSHPSPTPQIRSFSTSSH  RSRSSSRWEFF
Subjt:  QSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIP-TNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFF

Query:  RLGLLRTPGMELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT--AAKNRIRNEKVLNNNNNNNNNNNSVEIR--EKEKEKERRVSHRRTF
        RLGLLRTPGMEL DLKTRTTTT  T    HKTTASILGVVSCK+SVD VP T  ++ NRIR E VL       NNNN VEIR  EKEKEKERRVSHRRTF
Subjt:  RLGLLRTPGMELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT--AAKNRIRNEKVLNNNNNNNNNNNSVEIR--EKEKEKERRVSHRRTF

Query:  EWLKQLSHATFGEEQ
        EWLKQLSHATFGEEQ
Subjt:  EWLKQLSHATFGEEQ

XP_011649316.1 homeobox protein 6 [Cucumis sativus]4.1e-10074.53Show/hide
Query:  MKMMG-RSCHGDEIWETE-----------IEEEEEEEEALSFCDLPVKEKQQAMRT-SSAAVET-EEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFS
        MK++G RSCHGDE  E E            EEEEEEEEALS CDLPVKEKQQ  R+ S+  VET ++FDF  WRP P PML AD+LFFQG MLPLRLSFS
Subjt:  MKMMG-RSCHGDEIWETE-----------IEEEEEEEEALSFCDLPVKEKQQAMRT-SSAAVET-EEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFS

Query:  SENRQSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRW
        SEN Q+N      GNLW RSESMD +NMLRFRN S SSSSSRSHYSRSSS SNNS+SIPTNSKPR +  NVFHSHPSPTPQIRSFSTSSH  RSRSSSRW
Subjt:  SENRQSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRW

Query:  EFFRLGLLRTPGMELQDLKTRTTTTAATGT---AAHKTTASILGVVSCKKSVDAVP-ATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHR
        EFFRLGLLRTPGMEL DLKTRTTTT  T T    AHKTTASILGVVSCK+SV+ VP  T +KNRIR E VL  NN  NN++N VEIREKEKEKERRVSHR
Subjt:  EFFRLGLLRTPGMELQDLKTRTTTTAATGT---AAHKTTASILGVVSCKKSVDAVP-ATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHR

Query:  RTFEWLKQLSHATFGEEQ
        RTFEWLKQLSHATFGEEQ
Subjt:  RTFEWLKQLSHATFGEEQ

XP_022947585.1 uncharacterized protein LOC111451408 [Cucurbita moschata]1.9e-8164.38Show/hide
Query:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN
        M  + HG+E      +++E+EEEALS CDLPVKEKQQ    +   + TE+FDF  W P    PPM AAD++FFQG +LPLRLS SS+N     N  F   
Subjt:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN

Query:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL
        L  RSESMDHNMLRFRNGS SSS  SSRSHYSR SS SNNS+SIPTNSKPRTQ NVFHSHPSPTPQIRSFST   RSRSRSSSRW+FFR+GLLRTPGMEL
Subjt:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL

Query:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA
         DLKTRTT + A  T   KTTAS LGVVSCKKSV+ +PA    KN      + RNEK           N++VEIREKEKEK  R+SHRRTFEWLKQLSHA
Subjt:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA

Query:  TFGEEQ
        TF ++Q
Subjt:  TFGEEQ

XP_038902148.1 uncharacterized protein LOC120088781 [Benincasa hispida]3.1e-10879.47Show/hide
Query:  MKMMGRSCHGDEIWETEIEEE---EEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWR-PAPPMLAADELFFQGQMLPLRLSFSSEN-RQSNINE
        MK+MGRS HGDE WE   +EE   EEEEEALSFCDLPVKEKQQ MR++SAAVETE+FDF  WR P PPM AADELFFQGQMLPLRLSFSSEN   +NI+ 
Subjt:  MKMMGRSCHGDEIWETEIEEE---EEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWR-PAPPMLAADELFFQGQMLPLRLSFSSEN-RQSNINE

Query:  LFGGNLWNRSESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPG
        LFGGNLW RSESMDHNMLRF NGS SSSSSRSHYSRSSS SNNSVSIPTNSK R QKNVFHSHPSPTPQIRSFS SSH  RSRSSSRWEFFRLGLLRTPG
Subjt:  LFGGNLWNRSESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPG

Query:  MELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHATFGE
        MEL DLKTR TTTAA  T  HKTT SILGVVSCK+SVD V  T AKNR RNE V  NN          E +EKEKEKERRVSHRRTFEWLKQLSHATFGE
Subjt:  MELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHATFGE

Query:  EQ
        EQ
Subjt:  EQ

TrEMBL top hitse value%identityAlignment
A0A0A0LPT6 Uncharacterized protein2.0e-10074.53Show/hide
Query:  MKMMG-RSCHGDEIWETE-----------IEEEEEEEEALSFCDLPVKEKQQAMRT-SSAAVET-EEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFS
        MK++G RSCHGDE  E E            EEEEEEEEALS CDLPVKEKQQ  R+ S+  VET ++FDF  WRP P PML AD+LFFQG MLPLRLSFS
Subjt:  MKMMG-RSCHGDEIWETE-----------IEEEEEEEEALSFCDLPVKEKQQAMRT-SSAAVET-EEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFS

Query:  SENRQSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRW
        SEN Q+N      GNLW RSESMD +NMLRFRN S SSSSSRSHYSRSSS SNNS+SIPTNSKPR +  NVFHSHPSPTPQIRSFSTSSH  RSRSSSRW
Subjt:  SENRQSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRW

Query:  EFFRLGLLRTPGMELQDLKTRTTTTAATGT---AAHKTTASILGVVSCKKSVDAVP-ATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHR
        EFFRLGLLRTPGMEL DLKTRTTTT  T T    AHKTTASILGVVSCK+SV+ VP  T +KNRIR E VL  NN  NN++N VEIREKEKEKERRVSHR
Subjt:  EFFRLGLLRTPGMELQDLKTRTTTTAATGT---AAHKTTASILGVVSCKKSVDAVP-ATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHR

Query:  RTFEWLKQLSHATFGEEQ
        RTFEWLKQLSHATFGEEQ
Subjt:  RTFEWLKQLSHATFGEEQ

A0A1S3C7L1 homeobox protein 63.7e-9973.97Show/hide
Query:  MKMMG-RSCHGDE-------IWETEI-EEEEEEEEALSFCDLPVKEKQQAMRTSSAA-VETEEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFSSENR
        MK++G RSCHGDE         ET+I ++++EEEEALS CDLPVKEKQQ  R+ SA  VETE+FDF  WRP P PML AD+LFFQG MLPLRLSFSSEN 
Subjt:  MKMMG-RSCHGDE-------IWETEI-EEEEEEEEALSFCDLPVKEKQQAMRTSSAA-VETEEFDFKQWRPAP-PMLAADELFFQGQMLPLRLSFSSENR

Query:  QSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIP-TNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFF
        Q+N      GNLW+RSESMD +NMLRFRNGS SSSSSRSHYSRSSS SNNS+SIP TN+KPR +  NVFHSHPSPTPQIRSFSTSSH  RSRSSSRWEFF
Subjt:  QSNINELFGGNLWNRSESMD-HNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIP-TNSKPR-TQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFF

Query:  RLGLLRTPGMELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT--AAKNRIRNEKVLNNNNNNNNNNNSVEIR--EKEKEKERRVSHRRTF
        RLGLLRTPGMEL DLKTRTTTT  T    HKTTASILGVVSCK+SVD VP T  ++ NRIR E VL       NNNN VEIR  EKEKEKERRVSHRRTF
Subjt:  RLGLLRTPGMELQDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT--AAKNRIRNEKVLNNNNNNNNNNNSVEIR--EKEKEKERRVSHRRTF

Query:  EWLKQLSHATFGEEQ
        EWLKQLSHATFGEEQ
Subjt:  EWLKQLSHATFGEEQ

A0A6J1F3H1 probable membrane-associated kinase regulator 12.8e-7863.19Show/hide
Query:  MGRSCHGDEIWETEIE-EEEEEEEALSFCDLPVKEKQQAMR------TSSAAVETEEFDFKQWRP--APPMLAADELFFQGQMLPLRLSFSSENRQSNIN
        MGRSCHGDE WE + + +E+E+EEALSFCDLP+KE Q  +        SSAAV++E+FDF    P    PM AADE+FFQG +LPLR SFSSEN  S+ N
Subjt:  MGRSCHGDEIWETEIE-EEEEEEEALSFCDLPVKEKQQAMR------TSSAAVETEEFDFKQWRP--APPMLAADELFFQGQMLPLRLSFSSENRQSNIN

Query:  ELFGGNLWNRSESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTP
          F  N   RSES D  MLRFRNGS SSSSSRSHYSRSSS SNNS+SIPTNSKPR   NVFHSHPSPTPQIRS STS    RSRSSSRW+FFRLGLLRTP
Subjt:  ELFGGNLWNRSESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTP

Query:  GMELQDLKTRT----TTTAATGTAAHKTTASILGVVSCKKSVDAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSH
        GMEL DLKTRT    + TAA   AAH T  S LGVVSCKKSVD V A   K R  N K                  E EKE+  RVSHRRTFEW+KQLSH
Subjt:  GMELQDLKTRT----TTTAATGTAAHKTTASILGVVSCKKSVDAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSH

Query:  ATFGEEQ
        A+ G+EQ
Subjt:  ATFGEEQ

A0A6J1G7B7 uncharacterized protein LOC1114514089.2e-8264.38Show/hide
Query:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN
        M  + HG+E      +++E+EEEALS CDLPVKEKQQ    +   + TE+FDF  W P    PPM AAD++FFQG +LPLRLS SS+N     N  F   
Subjt:  MGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRP---APPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGN

Query:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL
        L  RSESMDHNMLRFRNGS SSS  SSRSHYSR SS SNNS+SIPTNSKPRTQ NVFHSHPSPTPQIRSFST   RSRSRSSSRW+FFR+GLLRTPGMEL
Subjt:  LWNRSESMDHNMLRFRNGSCSSS--SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMEL

Query:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA
         DLKTRTT + A  T   KTTAS LGVVSCKKSV+ +PA    KN      + RNEK           N++VEIREKEKEK  R+SHRRTFEWLKQLSHA
Subjt:  QDLKTRTTTTAATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHA

Query:  TFGEEQ
        TF ++Q
Subjt:  TFGEEQ

A0A6J1KZ54 uncharacterized protein LOC1114995744.3e-7964.71Show/hide
Query:  MGRSCHGDE--IWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRPAPPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGNL
        M  + HGDE    + E EEEEEEEEALS CDLPVKEKQQ +      + TE+FDF  W P PPM AAD++FFQG +LPLRLS SS+N     N  F   L
Subjt:  MGRSCHGDE--IWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRPAPPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGNL

Query:  WNRSESMDHNMLRFRNGSCSSS-SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMELQD
          RSESMDHNMLRFRNGS SSS SSRSHYSR SS SNNS+SIPTNSKPRTQ NVFHSHPSPTPQIRSFST       RSSSRW+FFR+GLLRTPGMEL D
Subjt:  WNRSESMDHNMLRFRNGSCSSS-SSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMELQD

Query:  LKTRTTTTA---ATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSH
        LKTRTT  A   A  T   KT  + LGVVSCKKSVD +PA    KN      + RNEK        N  N++VEIREKEKEK  R+SHRRTFEWLKQLSH
Subjt:  LKTRTTTTA---ATGTAAHKTTASILGVVSCKKSVDAVPAT-AAKN------RIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSH

Query:  ATFGEE
        ATF ++
Subjt:  ATFGEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G67350.1 unknown protein9.9e-1230.92Show/hide
Query:  EEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFD--------------FKQWRPAPPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGNLWNRS
        EEEEEEEALS CDLP    ++    S    E EEFD                   PAP M  ADELFF+G++LPLR S S +   + +NE     L  RS
Subjt:  EEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFD--------------FKQWRPAPPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGNLWNRS

Query:  ESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRS----RSSSRWEFFRLGLLRTPGMELQDL
        ES++     FR         R+   RS     N+              + +S PSP PQIR  S+ + R  S    +SSS W+F RLGL+RTP +EL   
Subjt:  ESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRS----RSSSRWEFFRLGLLRTPGMELQDL

Query:  KTRTTTTAATGTAAHKTTASILGVVSCKKSV-----------------DAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEW
          RTT   A  + +  ++ S     S  K +                 D   + + + ++   K+  ++         +E +  +KE++  ++ +RTFEW
Subjt:  KTRTTTTAATGTAAHKTTASILGVVSCKKSV-----------------DAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEW

Query:  LKQL
        L Q+
Subjt:  LKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGATGGGAAGAAGCTGCCATGGAGATGAAATATGGGAGACTGAAATTGAAGAAGAAGAAGAAGAAGAAGAAGCATTGTCTTTCTGTGATCTTCCTGTG
AAAGAAAAGCAGCAGGCGATGAGAACGTCGTCGGCCGCCGTGGAAACAGAGGAATTTGATTTCAAGCAGTGGCGACCGGCGCCGCCGATGTTAGCGGCGGATGAG
CTGTTCTTCCAAGGCCAAATGCTCCCTCTTCGTCTGTCCTTCAGCTCTGAAAATAGACAGAGTAATATTAATGAGTTGTTTGGTGGAAATTTGTGGAATAGGTCG
GAGTCTATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCTGTAGTAGTAGTAGCAGTAGAAGCCATTATTCCAGGTCATCGAGTCATAGCAATAATTCCGTT
TCAATTCCCACGAACTCAAAGCCAAGAACTCAGAAGAACGTTTTCCACTCTCACCCAAGTCCCACGCCTCAAATCAGATCCTTCTCAACTTCCAGCCACCGGAGC
CGGAGCCGGAGTTCCTCCCGGTGGGAATTTTTCCGACTGGGTCTTCTTCGAACGCCGGGAATGGAGCTTCAAGACCTCAAAACTCGCACCACCACCACCGCCGCC
ACGGGCACGGCGGCGCATAAAACAACGGCCTCGATTCTAGGTGTGGTTAGCTGCAAAAAATCGGTGGATGCAGTACCGGCGACGGCGGCGAAGAATAGAATTAGG
AATGAAAAAGTTTTGAATAATAATAATAATAATAATAATAATAATAATAGTGTTGAAATTAGGGAAAAGGAAAAAGAAAAGGAAAGAAGGGTGTCACATCGTCGA
ACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTTGGAGAGGAACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGATGGGAAGAAGCTGCCATGGAGATGAAATATGGGAGACTGAAATTGAAGAAGAAGAAGAAGAAGAAGAAGCATTGTCTTTCTGTGATCTTCCTGTG
AAAGAAAAGCAGCAGGCGATGAGAACGTCGTCGGCCGCCGTGGAAACAGAGGAATTTGATTTCAAGCAGTGGCGACCGGCGCCGCCGATGTTAGCGGCGGATGAG
CTGTTCTTCCAAGGCCAAATGCTCCCTCTTCGTCTGTCCTTCAGCTCTGAAAATAGACAGAGTAATATTAATGAGTTGTTTGGTGGAAATTTGTGGAATAGGTCG
GAGTCTATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCTGTAGTAGTAGTAGCAGTAGAAGCCATTATTCCAGGTCATCGAGTCATAGCAATAATTCCGTT
TCAATTCCCACGAACTCAAAGCCAAGAACTCAGAAGAACGTTTTCCACTCTCACCCAAGTCCCACGCCTCAAATCAGATCCTTCTCAACTTCCAGCCACCGGAGC
CGGAGCCGGAGTTCCTCCCGGTGGGAATTTTTCCGACTGGGTCTTCTTCGAACGCCGGGAATGGAGCTTCAAGACCTCAAAACTCGCACCACCACCACCGCCGCC
ACGGGCACGGCGGCGCATAAAACAACGGCCTCGATTCTAGGTGTGGTTAGCTGCAAAAAATCGGTGGATGCAGTACCGGCGACGGCGGCGAAGAATAGAATTAGG
AATGAAAAAGTTTTGAATAATAATAATAATAATAATAATAATAATAATAGTGTTGAAATTAGGGAAAAGGAAAAAGAAAAGGAAAGAAGGGTGTCACATCGTCGA
ACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTTGGAGAGGAACAATAA
Protein sequenceShow/hide protein sequence
MKMMGRSCHGDEIWETEIEEEEEEEEALSFCDLPVKEKQQAMRTSSAAVETEEFDFKQWRPAPPMLAADELFFQGQMLPLRLSFSSENRQSNINELFGGNLWNRS
ESMDHNMLRFRNGSCSSSSSRSHYSRSSSHSNNSVSIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSSHRSRSRSSSRWEFFRLGLLRTPGMELQDLKTRTTTTAA
TGTAAHKTTASILGVVSCKKSVDAVPATAAKNRIRNEKVLNNNNNNNNNNNSVEIREKEKEKERRVSHRRTFEWLKQLSHATFGEEQ