; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G019190 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G019190
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationCG_Chr05:31426221..31428789
RNA-Seq ExpressionClCG05G019190
SyntenyClCG05G019190
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147794.1 uncharacterized protein LOC101206163 [Cucumis sativus]9.2e-10490.13Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNG FS LHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NLHWNGSDMASDDTLKSHRPRQRTHKSSGSS KTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFLATRRVTGDCWVCPRCKET
         L  L    + G      R K T
Subjt:  VLSFLATRRVTGDCWVCPRCKET

XP_008466608.1 PREDICTED: uncharacterized protein LOC103503976 [Cucumis melo]3.0e-10295.12Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNG FS LHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NLHWNGSDMASDDTLKSHRPRQR HK SGSS KTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

XP_022144205.1 uncharacterized protein LOC111013658 isoform X1 [Momordica charantia]1.0e-10295.12Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNGGFSQL SSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTG+EDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NL WNGSDMASDDTLKSHRPR RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVD+VQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

XP_022939981.1 uncharacterized protein LOC111445752 isoform X1 [Cucurbita moschata]2.3e-10295.61Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNGGFSQL SSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NL WNGSDMASDDTLKSHR R RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

XP_038903295.1 uncharacterized protein LOC120089924 [Benincasa hispida]2.4e-10496.1Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNGGFS L SSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        N+HWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

TrEMBL top hitse value%identityAlignment
A0A0A0LDS8 SAP30_Sin3_bdg domain-containing protein4.5e-10490.13Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNG FS LHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NLHWNGSDMASDDTLKSHRPRQRTHKSSGSS KTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFLATRRVTGDCWVCPRCKET
         L  L    + G      R K T
Subjt:  VLSFLATRRVTGDCWVCPRCKET

A0A1S3CRQ8 uncharacterized protein LOC1035039761.4e-10295.12Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNG FS LHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NLHWNGSDMASDDTLKSHRPRQR HK SGSS KTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

A0A5A7UBV0 SAP30_Sin3_bdg domain-containing protein1.4e-10295.12Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNG FS LHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NLHWNGSDMASDDTLKSHRPRQR HK SGSS KTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

A0A6J1CR00 uncharacterized protein LOC111013658 isoform X14.9e-10395.12Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNGGFSQL SSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTG+EDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NL WNGSDMASDDTLKSHRPR RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVD+VQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

A0A6J1FHA1 uncharacterized protein LOC111445752 isoform X11.1e-10295.61Show/hide
Query:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
        MLEAVESSVNGGFSQL SSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE
Subjt:  MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE

Query:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
        NL WNGSDMASDDTLKSHR R RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVD GKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ
Subjt:  NLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQ

Query:  VLSFL
         L  L
Subjt:  VLSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein1.4e-7874.77Show/hide
Query:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD
        MLEAV+SS  VNGGF Q+ S  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDD
Subjt:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD

Query:  LEFENLHWNGSDM-----ASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVD
        L+FEN   NGSDM     AS+DTLK H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +MKVD  KLEM AL  YWRHFNLVDA PNPSKEQL+D
Subjt:  LEFENLHWNGSDM-----ASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVD

Query:  VVQRHFMSQVLSFL
        +VQRHFMSQ +  L
Subjt:  VVQRHFMSQVLSFL

AT1G19330.2 unknown protein5.8e-8076.08Show/hide
Query:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD
        MLEAV+SS  VNGGF Q+ S  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDD
Subjt:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD

Query:  LEFENLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRH
        L+FEN   NGSDM S+DTLK H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +MKVD  KLEM AL  YWRHFNLVDA PNPSKEQL+D+VQRH
Subjt:  LEFENLHWNGSDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRH

Query:  FMSQVLSFL
        FMSQ +  L
Subjt:  FMSQVLSFL

AT1G19330.3 unknown protein3.5e-7774.42Show/hide
Query:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD
        MLEAV+SS  VNGGF Q+ S  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDD
Subjt:  MLEAVESS--VNGGFSQLHS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDD

Query:  LEFENLHWNGSDM-----ASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-KVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLV
        L+FEN   NGSDM     AS+DTLK H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +M KVD  KLEM AL  YWRHFNLVDA PNPSKEQL+
Subjt:  LEFENLHWNGSDM-----ASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-KVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLV

Query:  DVVQRHFMSQVLSFL
        D+VQRHFMSQ +  L
Subjt:  DVVQRHFMSQVLSFL

AT1G75060.1 unknown protein1.4e-7071.36Show/hide
Query:  GGFSQLHSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLHWN-G
        GGFSQL S  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+D+DLE + +  WN  
Subjt:  GGFSQLHSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLHWN-G

Query:  SDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQVLSFL
        SDM ++DTLK H+ ++R H+SS  S K + R  S +S SK S  TPR +MKVD  KL+M+AL RYWRHFNLVDA PNP+KEQL+D++QRHFMSQ +  L
Subjt:  SDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQVLSFL

AT1G75060.2 unknown protein1.3e-6870.85Show/hide
Query:  GGFSQLHSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLHWN-G
        GGFSQL S  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL+GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+D+DLE + +  WN  
Subjt:  GGFSQLHSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLHWN-G

Query:  SDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQVLSFL
        SDM ++DTLK H+ ++R H+SS  S K + R  S +S SK S  TPR +M VD  KL+M+AL RYWRHFNLVDA PNP+KEQL+D++QRHFMSQ +  L
Subjt:  SDMASDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQVLSFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGAAGCAGTGGAGAGCTCCGTCAATGGCGGTTTCTCGCAGCTACACAGCAGTGGTGACAGTAGCGAGGAGGAGCTTTCTGTTCTCCCTCGTCATACGAAAGTGGT
TGTCACCGGAAACAACCGCACCAAATCGGTTCTCGTTGGACTTCGAGGCGTTGTTAAGAAGGCTGTCGGTTTGGGTGGATGGCATTGGCTGGTTCTCACGAATGGCATTG
AAGTTAAGCTACAGAGAAATGCCCTCAGTGTAATTGAAGCGCCAACTGGCAATGAGGATGATGATGATCTAGAATTCGAAAACTTACACTGGAATGGATCCGATATGGCA
TCTGATGACACCCTAAAGTCCCATAGACCGAGACAAAGGACACACAAATCGTCGGGATCATCTCACAAAACAATCAGCCGATCGTTCTCTTATGAATCACAGTCCAAGGG
ATCTATTTCCACCCCTCGTGGGTCCATGAAAGTTGACTTTGGGAAACTTGAGATGTCTGCCTTATGGAGATATTGGCGACACTTCAATCTTGTGGATGCTTTTCCTAACC
CATCAAAAGAGCAGTTGGTGGATGTTGTTCAAAGGCATTTCATGTCCCAGGTACTCAGTTTTCTTGCAACTAGACGAGTTACAGGTGATTGTTGGGTTTGTCCACGCTGC
AAAGAGACTCAAGACCGTCTGCAAATGACAAGTAAAGGCTGTAACACCAAGTTCATTCATCCATTGCTAACAAAAAACACCGGTACCACCCGAAATTCCCACTTTCCCGC
TACATTAATGTTCTCTGTAATATAA
mRNA sequenceShow/hide mRNA sequence
CTCTCTCTCTCTCTCTCTCTCTTACTCTGCTATGTGTTTGTTGTCAACGAAAATTCCCAAATTTCTCTGCGATTTCTCTGTCTTTTGCCGCTGAAATTCCCAAAAACATG
CTCCCAGTTTCATGATCTTTCTGTATCCTCCCATTTCTCACCTCCGTTTCTATTCAATTTTCCTCCAATTTGATCGCCCACGGAATCGGGAATGCTTGAAGCAGTGGAGA
GCTCCGTCAATGGCGGTTTCTCGCAGCTACACAGCAGTGGTGACAGTAGCGAGGAGGAGCTTTCTGTTCTCCCTCGTCATACGAAAGTGGTTGTCACCGGAAACAACCGC
ACCAAATCGGTTCTCGTTGGACTTCGAGGCGTTGTTAAGAAGGCTGTCGGTTTGGGTGGATGGCATTGGCTGGTTCTCACGAATGGCATTGAAGTTAAGCTACAGAGAAA
TGCCCTCAGTGTAATTGAAGCGCCAACTGGCAATGAGGATGATGATGATCTAGAATTCGAAAACTTACACTGGAATGGATCCGATATGGCATCTGATGACACCCTAAAGT
CCCATAGACCGAGACAAAGGACACACAAATCGTCGGGATCATCTCACAAAACAATCAGCCGATCGTTCTCTTATGAATCACAGTCCAAGGGATCTATTTCCACCCCTCGT
GGGTCCATGAAAGTTGACTTTGGGAAACTTGAGATGTCTGCCTTATGGAGATATTGGCGACACTTCAATCTTGTGGATGCTTTTCCTAACCCATCAAAAGAGCAGTTGGT
GGATGTTGTTCAAAGGCATTTCATGTCCCAGGTACTCAGTTTTCTTGCAACTAGACGAGTTACAGGTGATTGTTGGGTTTGTCCACGCTGCAAAGAGACTCAAGACCGTC
TGCAAATGACAAGTAAAGGCTGTAACACCAAGTTCATTCATCCATTGCTAACAAAAAACACCGGTACCACCCGAAATTCCCACTTTCCCGCTACATTAATGTTCTCTGTA
ATATAAGTATCCTGGCTTGTTATCGAGGTATTGTGAGCCTTTTGTTAGTGTAATTATGGTAGGTATATATAGTATATAATTTATATAATGTATGTACGACTGGCTGGCTA
ATAGACCTTCGAAAACTGAGAGAAGTCCTTACCGACTTCATTTTCCTAAACCTCGATCGAATGCTGACTTGGGAAAGTAGGTATCTTGAAATGAATGTGTATATTCTTTA
AATGCTTTATCTTGTTTATTATTGTCTAGTTGAATTGTATGAAATGAGATAGTATTGAATCA
Protein sequenceShow/hide protein sequence
MLEAVESSVNGGFSQLHSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLRGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLHWNGSDMA
SDDTLKSHRPRQRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDFGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQVLSFLATRRVTGDCWVCPRC
KETQDRLQMTSKGCNTKFIHPLLTKNTGTTRNSHFPATLMFSVI