; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038704 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038704
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold12:4484151..4490406
RNA-Seq ExpressionSpg038704
SyntenySpg038704
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038119.1 Zinc knuckle family protein isoform 1 [Cucumis melo var. makuwa]4.3e-12690.31Show/hide
Query:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI
        S+EFAL VMANSPDVD DDDFSELYKEYTGPPRSTTV +QEKTNTNKRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI
Subjt:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI

Query:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS
        CGESGHFTQGCPSTLGS+RKSQDFFERVPARDKHVRA+F+D+V+QKIEKD+GCKIKMDEKFIIVSGKDRLIL+KG+DAV+K+IKE+GDQKGSSSS MS+S
Subjt:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS

Query:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ
        RSP+RSP GSRSQRSDVHRSHSGP NASQFQPRFSRQEKVVENR RDDLQKY R SVQ
Subjt:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ

XP_022157005.1 uncharacterized protein LOC111023835 isoform X1 [Momordica charantia]7.3e-12693.25Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+V AQEKT T KRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+F+DKVVQKIEKDIGCKIK+DEKFIIVSGKDRLIL+KGVDAVHKVIKEEGDQKGSSSS MSRSRSPERSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR
        GSRSQRS+VHRSHSGP NASQFQPRFSR+EKVVENRVRDDLQKY RGS+Q R
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR

XP_022157007.1 uncharacterized protein LOC111023835 isoform X3 [Momordica charantia]7.3e-12693.25Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+V AQEKT T KRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+F+DKVVQKIEKDIGCKIK+DEKFIIVSGKDRLIL+KGVDAVHKVIKEEGDQKGSSSS MSRSRSPERSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR
        GSRSQRS+VHRSHSGP NASQFQPRFSR+EKVVENRVRDDLQKY RGS+Q R
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR

XP_038905438.1 uncharacterized protein LOC120091471 isoform X1 [Benincasa hispida]1.5e-12693.65Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV +QEKTNTNKRS AGSDEE EPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFERVPARDKHVRA+F+D+VVQKIEKD+GCKIKMDEKFIIVSGKDRLILVKGVDAVHK+IKEEGDQKGSSSS MSRSRSP+RSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR
        GSRSQRSDVHRSHSGP N+SQFQPRFSRQEKVVENRVRDDLQKYSR SVQ +
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR

XP_038905439.1 uncharacterized protein LOC120091471 isoform X2 [Benincasa hispida]1.9e-12694.4Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRSTTV +QEKTNTNKRS AGSDEE EPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFERVPARDKHVRA+F+D+VVQKIEKD+GCKIKMDEKFIIVSGKDRLILVKGVDAVHK+IKEEGDQKGSSSS MSRSRSP+RSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ
        GSRSQRSDVHRSHSGP N+SQFQPRFSRQEKVVENRVRDDLQKYSR SVQ
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ

TrEMBL top hitse value%identityAlignment
A0A0A0L866 CCHC-type domain-containing protein7.9e-12690.7Show/hide
Query:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI
        S EFAL VMANSPDVDADDDFSELYKEYTGPPRSTTV  QEKTNTNKRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICK 
Subjt:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI

Query:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS
        CGESGHFTQGCPSTLGS+RKSQDFFERVPARDKHVRA+F+D+V+QKIEKD+GCKIKMDEKFIIVSGKDRLIL+KG+DAV+K+IKE+GDQKGSSSS MSRS
Subjt:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS

Query:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ
        RSP+RSP GSRSQRSDVHRSHSGP NASQFQPRFSRQEKVVENR RDDLQKY R SVQ
Subjt:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ

A0A5D3DAF0 Zinc knuckle family protein isoform 12.1e-12690.31Show/hide
Query:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI
        S+EFAL VMANSPDVD DDDFSELYKEYTGPPRSTTV +QEKTNTNKRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI
Subjt:  SSEFALCVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKI

Query:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS
        CGESGHFTQGCPSTLGS+RKSQDFFERVPARDKHVRA+F+D+V+QKIEKD+GCKIKMDEKFIIVSGKDRLIL+KG+DAV+K+IKE+GDQKGSSSS MS+S
Subjt:  CGESGHFTQGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRS

Query:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ
        RSP+RSP GSRSQRSDVHRSHSGP NASQFQPRFSRQEKVVENR RDDLQKY R SVQ
Subjt:  RSPERSPVGSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ

A0A6J1DRY4 uncharacterized protein LOC111023835 isoform X13.5e-12693.25Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+V AQEKT T KRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+F+DKVVQKIEKDIGCKIK+DEKFIIVSGKDRLIL+KGVDAVHKVIKEEGDQKGSSSS MSRSRSPERSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR
        GSRSQRS+VHRSHSGP NASQFQPRFSR+EKVVENRVRDDLQKY RGS+Q R
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR

A0A6J1DTF4 uncharacterized protein LOC111023835 isoform X33.5e-12693.25Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+V AQEKT T KRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+F+DKVVQKIEKDIGCKIK+DEKFIIVSGKDRLIL+KGVDAVHKVIKEEGDQKGSSSS MSRSRSPERSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR
        GSRSQRS+VHRSHSGP NASQFQPRFSR+EKVVENRVRDDLQKY RGS+Q R
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGR

A0A6J1DWP5 uncharacterized protein LOC111023835 isoform X21.3e-12593.6Show/hide
Query:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
        MANSPDVDADDDFSELYKEYTGPPRST+V AQEKT T KRS AGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT
Subjt:  MANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFT

Query:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV
        QGCPSTLGSNRKSQDFFER+PARDKHVRA+F+DKVVQKIEKDIGCKIK+DEKFIIVSGKDRLIL+KGVDAVHKVIKEEGDQKGSSSS MSRSRSPERSPV
Subjt:  QGCPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPV

Query:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ
        GSRSQRS+VHRSHSGP NASQFQPRFSR+EKVVENRVRDDLQKY RGS+Q
Subjt:  GSRSQRSDVHRSHSGPANASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62330.1 Zinc knuckle (CCHC-type) family protein3.3e-7661.96Show/hide
Query:  DVDADDDFSELYKEYTGPPRSTT---VAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
        D + DDDFSE+YKEYTGP  + T   +  ++K    +      +EE++  DPN+VPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG
Subjt:  DVDADDDFSELYKEYTGPPRSTT---VAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQG

Query:  CPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPVG-
        CPSTLG+NRKSQ+FFERVPARD +VR LF++KV++ IE++  CKIK+DEKFIIVSGKDRLIL KGVDAVHKV KE+G+ K SS S  SRSRSP R+ VG 
Subjt:  CPSTLGSNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPVG-

Query:  SRSQRSDVHRSHSGPANASQFQPRFSRQEKVVEN------RVRDDLQKYSRGSVQ
        SR++ S+  R       +S F  R  RQ+K V+N      RVR++ +   RGS Q
Subjt:  SRSQRSDVHRSHSGPANASQFQPRFSRQEKVVEN------RVRDDLQKYSRGSVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTTCCTCTCATAAACAAATACTCACGTACATCATTGTTGTGGAGAGGAGTCAAATGTCAATACCCCAAATCACAGGGGCATATAAGTAATTTAACAATAGGGTTCAAACC
CTATCGGAGTCGCTGGTTTTTATTCCGCGGCAGTTCGAGCCATAAAACACAGACCACCTCCTTCTCCTCCGGCAGCAGCCTCCGCCTCCGGCAGCTTCTTCACTCCGGCG
ACAAGTGTGCTTTGTTTTACGTTGACGATGTGTACCACGCTCTCTCTCGCCTTGGCGGCCACAGGGAGTTGTGGCGTGGCGTTCGAGTGGGCTCATCTGAATTTGCTCTT
TGTGTGATGGCAAATTCACCAGATGTGGATGCGGATGATGACTTTAGTGAACTCTACAAGGAGTACACTGGTCCCCCACGATCGACCACTGTTGCTGCACAAGAGAAGAC
GAATACGAATAAAAGGTCTCTTGCAGGTTCTGATGAGGAGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCA
AGTCAAAAGCTACAGAGAGGAATTGGAAGAAGAGAAAAGAGGAAGAAATGATCTGCAAAATATGTGGAGAATCCGGCCATTTTACTCAGGGATGCCCCTCTACCTTGGGA
TCAAATCGTAAATCTCAAGATTTTTTTGAAAGGGTACCAGCTAGGGATAAACATGTGAGAGCACTTTTCAGTGATAAAGTAGTACAGAAAATTGAAAAGGATATTGGTTG
TAAGATCAAAATGGACGAGAAATTCATAATTGTTAGTGGCAAGGACAGATTAATTTTGGTAAAGGGAGTGGATGCTGTACACAAGGTAATTAAGGAGGAAGGTGATCAAA
AGGGTTCTTCTAGTTCTCGTATGAGTAGATCCAGGTCACCCGAGCGAAGCCCAGTTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGTCCTGCAAAT
GCATCACAATTTCAACCTAGGTTTAGCAGACAGGAGAAGGTTGTTGAAAACCGTGTTCGTGATGATCTGCAGAAATATTCAAGGGGTTCGGTTCAAGGTAGGAAAAGCAT
AGGATATGTTTACGGCATCAATAGAAAGCTGGTAATTCTATGTTGTAGCTTGTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCTCTCATAAACAAATACTCACGTACATCATTGTTGTGGAGAGGAGTCAAATGTCAATACCCCAAATCACAGGGGCATATAAGTAATTTAACAATAGGGTTCAAACC
CTATCGGAGTCGCTGGTTTTTATTCCGCGGCAGTTCGAGCCATAAAACACAGACCACCTCCTTCTCCTCCGGCAGCAGCCTCCGCCTCCGGCAGCTTCTTCACTCCGGCG
ACAAGTGTGCTTTGTTTTACGTTGACGATGTGTACCACGCTCTCTCTCGCCTTGGCGGCCACAGGGAGTTGTGGCGTGGCGTTCGAGTGGGCTCATCTGAATTTGCTCTT
TGTGTGATGGCAAATTCACCAGATGTGGATGCGGATGATGACTTTAGTGAACTCTACAAGGAGTACACTGGTCCCCCACGATCGACCACTGTTGCTGCACAAGAGAAGAC
GAATACGAATAAAAGGTCTCTTGCAGGTTCTGATGAGGAGGATGAACCTCGTGATCCCAATGCTGTGCCAACTGATTTTACCAGCCGAGAAGCCAAGGTTTGGGAGGCCA
AGTCAAAAGCTACAGAGAGGAATTGGAAGAAGAGAAAAGAGGAAGAAATGATCTGCAAAATATGTGGAGAATCCGGCCATTTTACTCAGGGATGCCCCTCTACCTTGGGA
TCAAATCGTAAATCTCAAGATTTTTTTGAAAGGGTACCAGCTAGGGATAAACATGTGAGAGCACTTTTCAGTGATAAAGTAGTACAGAAAATTGAAAAGGATATTGGTTG
TAAGATCAAAATGGACGAGAAATTCATAATTGTTAGTGGCAAGGACAGATTAATTTTGGTAAAGGGAGTGGATGCTGTACACAAGGTAATTAAGGAGGAAGGTGATCAAA
AGGGTTCTTCTAGTTCTCGTATGAGTAGATCCAGGTCACCCGAGCGAAGCCCAGTTGGTTCAAGATCACAACGTTCTGATGTCCATAGATCACATTCTGGTCCTGCAAAT
GCATCACAATTTCAACCTAGGTTTAGCAGACAGGAGAAGGTTGTTGAAAACCGTGTTCGTGATGATCTGCAGAAATATTCAAGGGGTTCGGTTCAAGGTAGGAAAAGCAT
AGGATATGTTTACGGCATCAATAGAAAGCTGGTAATTCTATGTTGTAGCTTGTAG
Protein sequenceShow/hide protein sequence
LPLINKYSRTSLLWRGVKCQYPKSQGHISNLTIGFKPYRSRWFLFRGSSSHKTQTTSFSSGSSLRLRQLLHSGDKCALFYVDDVYHALSRLGGHRELWRGVRVGSSEFAL
CVMANSPDVDADDDFSELYKEYTGPPRSTTVAAQEKTNTNKRSLAGSDEEDEPRDPNAVPTDFTSREAKVWEAKSKATERNWKKRKEEEMICKICGESGHFTQGCPSTLG
SNRKSQDFFERVPARDKHVRALFSDKVVQKIEKDIGCKIKMDEKFIIVSGKDRLILVKGVDAVHKVIKEEGDQKGSSSSRMSRSRSPERSPVGSRSQRSDVHRSHSGPAN
ASQFQPRFSRQEKVVENRVRDDLQKYSRGSVQGRKSIGYVYGINRKLVILCCSL