; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G002730 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G002730
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationchr07:2962333..2966314
RNA-Seq ExpressionLsi07G002730
SyntenyLsi07G002730
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044820.1 uncharacterized protein E6C27_scaffold74G001110 [Cucumis melo var. makuwa]7.2e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

TYK16644.1 uncharacterized protein E5676_scaffold21G004740 [Cucumis melo var. makuwa]7.2e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

XP_004146519.1 uncharacterized protein LOC101209634 [Cucumis sativus]3.4e-9986.34Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+P APSSSAKS      GEIQSIHMPK GTRV+LGKLGTESLWRYIKHFNLVNSISN SRDQILQVVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

XP_008452044.1 PREDICTED: uncharacterized protein LOC103493170 [Cucumis melo]7.2e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

XP_022136863.1 uncharacterized protein LOC111008453 isoform X2 [Momordica charantia]6.1e-9684.65Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVF            +VLKNGVEVKLQRNALSVLEHPTGNEND+HD D
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVV
        NSSCSS+IGEKDNDFSSSIVFHKLSK KVRQIRP APSSSAK      S GEIQSIH+PK   RVKLGKLGTESLWRYI+HFNLVNSISNASRDQ+L VV
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVV

Query:  QRHFALQTSLNEASVMTEFIRAVKRRRD
        QRHFALQT+LNEASVMTEFIRAVKRRR+
Subjt:  QRHFALQTSLNEASVMTEFIRAVKRRRD

TrEMBL top hitse value%identityAlignment
A0A0A0KUM7 SAP30_Sin3_bdg domain-containing protein1.7e-9986.34Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+P APSSSAKS      GEIQSIHMPK GTRV+LGKLGTESLWRYIKHFNLVNSISN SRDQILQVVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

A0A1S3BU28 uncharacterized protein LOC1034931703.5e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

A0A5A7TNP9 SAP30_Sin3_bdg domain-containing protein3.5e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

A0A5D3D093 SAP30_Sin3_bdg domain-containing protein3.5e-9785.46Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWL               VLKNGVEVKLQRNALSVLEHPTGNENDE DLD
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ
        NSSCSS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GTRVKLGKLGTESLWRYIKHFNLVNSISN SRDQIL VVQ
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQ

Query:  RHFALQTSLNEASVMTEFIRAVKRRRD
        RHFALQTSLNEASVMTEFIRAVK+RR+
Subjt:  RHFALQTSLNEASVMTEFIRAVKRRRD

A0A6J1C8P1 uncharacterized protein LOC111008453 isoform X22.9e-9684.65Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +SPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVF            +VLKNGVEVKLQRNALSVLEHPTGNEND+HD D
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVV
        NSSCSS+IGEKDNDFSSSIVFHKLSK KVRQIRP APSSSAK      S GEIQSIH+PK   RVKLGKLGTESLWRYI+HFNLVNSISNASRDQ+L VV
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVV

Query:  QRHFALQTSLNEASVMTEFIRAVKRRRD
        QRHFALQT+LNEASVMTEFIRAVKRRR+
Subjt:  QRHFALQTSLNEASVMTEFIRAVKRRRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein1.1e-4248Show/hide
Query:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSE
        S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWL               VL NG+EVKLQRNALSVLE PTGNE D+ DLD  +    
Subjt:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSE

Query:  IGEKDNDFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR
          +    F +S    K  K K+R             R L+  S +KS+G       P    +V L KL   +L  Y +HFNLV++I N S++Q++ +VQR
Subjt:  IGEKDNDFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR

Query:  HFALQTSLNEASVMTEFIRAVKRRR
        HF  Q  ++E  V+  F++A KR +
Subjt:  HFALQTSLNEASVMTEFIRAVKRRR

AT1G19330.2 unknown protein4.1e-4247.95Show/hide
Query:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-
        S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWL               VL NG+EVKLQRNALSVLE PTGNE D+  D +N+  + 
Subjt:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-

Query:  SEIGEKD--NDFSSSIVFHKLSKPKVRQI-RPLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQRHFALQT
        S++  +D      S +   + S+   + + R L+  S +KS+G       P    +V L KL   +L  Y +HFNLV++I N S++Q++ +VQRHF  Q 
Subjt:  SEIGEKD--NDFSSSIVFHKLSKPKVRQI-RPLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQRHFALQT

Query:  SLNEASVMTEFIRAVKRRR
         ++E  V+  F++A KR +
Subjt:  SLNEASVMTEFIRAVKRRR

AT1G19330.3 unknown protein1.4e-4247.96Show/hide
Query:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSE
        S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWL               VL NG+EVKLQRNALSVLE PTGNE D+ DLD  +    
Subjt:  SGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSE

Query:  IGEKDNDFSSSIVFHKLSKPKVRQIR-------PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQRHFAL
          +    F +S    K  K K+R  R        ++ S S+ S  +      P+   +V L KL   +L  Y +HFNLV++I N S++Q++ +VQRHF  
Subjt:  IGEKDNDFSSSIVFHKLSKPKVRQIR-------PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQRHFAL

Query:  QTSLNEASVMTEFIRAVKRRR
        Q  ++E  V+  F++A KR +
Subjt:  QTSLNEASVMTEFIRAVKRRR

AT1G75060.1 unknown protein8.3e-4348.18Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +S F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWL               VL NG+EVKLQRNALSVLEHPTGNE D +DL+
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR
            +      D     ++  HK  K   R  R     L    S  S+ +I SI  P+   +V L KL   +L RY +HFNLV+++ N +++Q++ ++QR
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR

Query:  HFALQTSLNEASVMTEFIRA
        HF  Q  ++E  V+  F++A
Subjt:  HFALQTSLNEASVMTEFIRA

AT1G75060.2 unknown protein2.7e-4148.18Show/hide
Query:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD
        +S F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWL               VL NG+EVKLQRNALSVLEHPTGNE D +DL+
Subjt:  RSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLD

Query:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR
            +      D     ++  HK  K   R  R     L    S  S+ +I SI  P+    V L KL   +L RY +HFNLV+++ N +++Q++ ++QR
Subjt:  NSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQR

Query:  HFALQTSLNEASVMTEFIRA
        HF  Q  ++E  V+  F++A
Subjt:  HFALQTSLNEASVMTEFIRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGTTCAACATCTGTGAAGTTAGATTCCACTGAGGCAAGCAAAATTCATAATGCCTTACTGATCAGCGAAAGAGTGCTGTTGGTTATTATAGCTCAAATTGAGCT
TGAATTCTTGCACTATATGGACCACAGGGCTTCAGGGTTCGATTTAAGTTTTAACTTTCGTCATCGCTACCTGGAGGAAAGACAGTTAGGAAGGAAGAAAAATGCTGGAA
CCAGATCTCCTTTCCGTGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTAAAGTTATTGTCACTGGAAATAACAGAACAAAGTCTGTTTTATTGGGA
TTGCAAGGCGTGGTTAAGAAGGCCGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTTAAGCTAGAAGCCTGTGACTGGAAGGAATTCATGGATTACGTTTTGAAAAATGG
GGTTGAAGTGAAGCTGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCCGAGATCGGGG
AGAAGGATAATGATTTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTTAGGCAGATTAGGCCATTGGCTCCATCTTCATCAGCAAAGTCAAATGGAGAA
ATTCAGTCTATTCACATGCCAAAGCCAGGGACGCGGGTCAAACTGGGGAAACTAGGAACAGAGTCTTTGTGGAGATATATCAAGCACTTCAATCTAGTGAATAGTATTTC
AAATGCGTCAAGGGACCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTTGAACGAGGCATCGGTGATGACTGAATTCATTCGAGCTGTTAAGAGAC
GAAGAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGTTCAACATCTGTGAAGTTAGATTCCACTGAGGCAAGCAAAATTCATAATGCCTTACTGATCAGCGAAAGAGTGCTGTTGGTTATTATAGCTCAAATTGAGCT
TGAATTCTTGCACTATATGGACCACAGGGCTTCAGGGTTCGATTTAAGTTTTAACTTTCGTCATCGCTACCTGGAGGAAAGACAGTTAGGAAGGAAGAAAAATGCTGGAA
CCAGATCTCCTTTCCGTGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTAAAGTTATTGTCACTGGAAATAACAGAACAAAGTCTGTTTTATTGGGA
TTGCAAGGCGTGGTTAAGAAGGCCGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTTAAGCTAGAAGCCTGTGACTGGAAGGAATTCATGGATTACGTTTTGAAAAATGG
GGTTGAAGTGAAGCTGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCCGAGATCGGGG
AGAAGGATAATGATTTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTTAGGCAGATTAGGCCATTGGCTCCATCTTCATCAGCAAAGTCAAATGGAGAA
ATTCAGTCTATTCACATGCCAAAGCCAGGGACGCGGGTCAAACTGGGGAAACTAGGAACAGAGTCTTTGTGGAGATATATCAAGCACTTCAATCTAGTGAATAGTATTTC
AAATGCGTCAAGGGACCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTTGAACGAGGCATCGGTGATGACTGAATTCATTCGAGCTGTTAAGAGAC
GAAGAGACTGAGGGAATGAGTAATGATATTGAAAAGTGATTCACGTAAATGTATGTTTGTATAGTGATTAGTGATTATGCCTGTTAACTATAGCCTATAACTATGTAAAC
TTCTAACTCTCTAACATTGAGTTGATAACTTCCTAGACCGATACTATACTTTTAACTTCTCTCTCCCGAGTTTCGATGAAATTCTAAGGGGTATTTAGGGCACTAATTTC
GAATGGGTTAAGTGGGCTATAATAGTCCATCCAGCGTTTGGGACATGTGTTAAGATAATAGGAGTAAAGGTTATTTTAGCTCCCTA
Protein sequenceShow/hide protein sequence
MGSSTSVKLDSTEASKIHNALLISERVLLVIIAQIELEFLHYMDHRASGFDLSFNFRHRYLEERQLGRKKNAGTRSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLG
LQGVVKKAVGLGGWHWLVFKLEACDWKEFMDYVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSNGE
IQSIHMPKPGTRVKLGKLGTESLWRYIKHFNLVNSISNASRDQILQVVQRHFALQTSLNEASVMTEFIRAVKRRRD