; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G011260 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G011260
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr03:21489131..21493636
RNA-Seq ExpressionLsi03G011260
SyntenyLsi03G011260
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038119.1 Zinc knuckle family protein isoform 1 [Cucumis melo var. makuwa]2.9e-12794.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+S+SRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

XP_008447472.1 PREDICTED: uncharacterized protein LOC103489910 [Cucumis melo]2.9e-12794.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+S+SRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

XP_031738009.1 uncharacterized protein LOC101215062 isoform X1 [Cucumis sativus]8.4e-12793.65Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV  QEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK CGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+SRSRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ +
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR

XP_038905438.1 uncharacterized protein LOC120091471 isoform X1 [Benincasa hispida]7.3e-13197.22Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD E EPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSH+SRSRSPDRSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR
        GSRSQRSDVHRSHSGPTN+SQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ +
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR

XP_038905439.1 uncharacterized protein LOC120091471 isoform X2 [Benincasa hispida]9.6e-13198Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD E EPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSH+SRSRSPDRSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTN+SQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

TrEMBL top hitse value%identityAlignment
A0A0A0L866 CCHC-type domain-containing protein5.3e-12794.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV  QEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK CGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+SRSRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

A0A1S3BHI0 uncharacterized protein LOC1034899101.4e-12794.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+S+SRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

A0A5D3DAF0 Zinc knuckle family protein isoform 11.4e-12794.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVD DDDFSELYKEYTGPPRSTTV SQEKTNTNKRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGS+RKSQDFFERVPARDKHVRAIFTDRV+QKIEKDVGCKIKMDEKFIIVSGKDRLIL+KG+DAV+KLIKE+GDQKGSSSSH+S+SRSPDRSP 
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ
        GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENR RDDLQKY RSSVQ
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQ

A0A6J1DRY4 uncharacterized protein LOC111023835 isoform X19.1e-12792.06Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+VT+QEKT T KRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+FTD+VVQKIEKD+GCKIK+DEKFIIVSGKDRLIL+KGVDAVHK+IKEEGDQKGSSSSH+SRSRSP+RSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR
        GSRSQRS+VHRSHSGPTNASQFQPRFSR+EKVVENRVRDDLQKY R S+Q R
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR

A0A6J1DTF4 uncharacterized protein LOC111023835 isoform X39.1e-12792.06Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+VT+QEKT T KRSHAGSD EDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+FTD+VVQKIEKD+GCKIK+DEKFIIVSGKDRLIL+KGVDAVHK+IKEEGDQKGSSSSH+SRSRSP+RSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPV

Query:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR
        GSRSQRS+VHRSHSGPTNASQFQPRFSR+EKVVENRVRDDLQKY R S+Q R
Subjt:  GSRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDDLQKYSRSSVQGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62330.1 Zinc knuckle (CCHC-type) family protein3.2e-7663.18Show/hide
Query:  DVDADDDFSELYKEYTGPPRSTT---VTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
        D + DDDFSE+YKEYTGP  + T   +  ++K    +      + E++  DPN+VPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
Subjt:  DVDADDDFSELYKEYTGPPRSTT---VTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG

Query:  CPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPVG-
        CPSTLG+NRKSQ+FFERVPARD +VR +FT++V++ IE++  CKIK+DEKFIIVSGKDRLIL KGVDAVHK +KE+G+ K SS SH SRSRSP R+ VG 
Subjt:  CPSTLGSNRKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPVG-

Query:  SRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDD
        SR++ S+  R       +S F  R  RQ+K V+NR R++
Subjt:  SRSQRSDVHRSHSGPTNASQFQPRFSRQEKVVENRVRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATTCACCAGATGTGGATGCAGATGATGACTTTAGTGAACTCTACAAGGAGTACACAGGCCCTCCAAGATCGACCACTGTCACTTCACAAGAGAAGACAAATAC
AAATAAAAGGTCTCATGCTGGTTCTGATGGGGAGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCAAGTCGA
AAGCTACAGAGAGGAATTGGAAGAAGAGAAAAGAGGAAGAAATGATCTGCAAAATATGTGGTGAATCAGGCCATTTTACTCAGGGATGCCCCTCAACCTTGGGATCAAAT
CGTAAATCCCAAGATTTTTTTGAAAGGGTACCAGCCAGGGATAAACATGTGAGAGCAATTTTCACTGATAGAGTAGTACAAAAAATTGAAAAGGATGTTGGTTGTAAGAT
CAAGATGGATGAGAAATTCATAATTGTTAGTGGCAAGGACAGGTTAATTTTGGTAAAGGGAGTGGATGCTGTCCACAAGCTAATTAAGGAGGAAGGCGATCAAAAGGGTT
CTTCTAGTTCTCATTTGAGTAGATCCAGGTCACCTGATCGAAGCCCTGTTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGCCCTACAAATGCATCA
CAATTTCAACCTAGGTTTAGCAGACAGGAGAAGGTTGTTGAAAACCGCGTTCGTGATGATCTGCAGAAATATTCAAGGAGCTCGGTTCAAGGTAGGAAAAGCATAGGATA
TGTTTACGGCATCAATAGAAAGCTGCACAAGGTAAGTTCTGTTCAAGCTTATTTATGCGACTCTGAAGTAATTTTCAGTGATAGGATCTGA
mRNA sequenceShow/hide mRNA sequence
CTCTAAATCACAGGGGCATATATGTAATTTAACTAATACATCAAACCGTTCAAACCCTATCGGAGTCCCTGGTTTTATTCCGCGGCAGTTCGAGCCATAAAACACAGACC
CCCTACCTCTTCTCCGGCAGCAGCCTCCGCCTCCGGCAGCTTCTTCACTCCGGCGACTAGGAGCTCAACTGAATTGGCTCTTCTTGTGATGGCAAATTCACCAGATGTGG
ATGCAGATGATGACTTTAGTGAACTCTACAAGGAGTACACAGGCCCTCCAAGATCGACCACTGTCACTTCACAAGAGAAGACAAATACAAATAAAAGGTCTCATGCTGGT
TCTGATGGGGAGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCAAGTCGAAAGCTACAGAGAGGAATTGGAA
GAAGAGAAAAGAGGAAGAAATGATCTGCAAAATATGTGGTGAATCAGGCCATTTTACTCAGGGATGCCCCTCAACCTTGGGATCAAATCGTAAATCCCAAGATTTTTTTG
AAAGGGTACCAGCCAGGGATAAACATGTGAGAGCAATTTTCACTGATAGAGTAGTACAAAAAATTGAAAAGGATGTTGGTTGTAAGATCAAGATGGATGAGAAATTCATA
ATTGTTAGTGGCAAGGACAGGTTAATTTTGGTAAAGGGAGTGGATGCTGTCCACAAGCTAATTAAGGAGGAAGGCGATCAAAAGGGTTCTTCTAGTTCTCATTTGAGTAG
ATCCAGGTCACCTGATCGAAGCCCTGTTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGCCCTACAAATGCATCACAATTTCAACCTAGGTTTAGCA
GACAGGAGAAGGTTGTTGAAAACCGCGTTCGTGATGATCTGCAGAAATATTCAAGGAGCTCGGTTCAAGGTAGGAAAAGCATAGGATATGTTTACGGCATCAATAGAAAG
CTGCACAAGGTAAGTTCTGTTCAAGCTTATTTATGCGACTCTGAAGTAATTTTCAGTGATAGGATCTGA
Protein sequenceShow/hide protein sequence
MANSPDVDADDDFSELYKEYTGPPRSTTVTSQEKTNTNKRSHAGSDGEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQGCPSTLGSN
RKSQDFFERVPARDKHVRAIFTDRVVQKIEKDVGCKIKMDEKFIIVSGKDRLILVKGVDAVHKLIKEEGDQKGSSSSHLSRSRSPDRSPVGSRSQRSDVHRSHSGPTNAS
QFQPRFSRQEKVVENRVRDDLQKYSRSSVQGRKSIGYVYGINRKLHKVSSVQAYLCDSEVIFSDRI