; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G003230 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G003230
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGag/pol protein
Genome locationchr08:8692842..8735410
RNA-Seq ExpressionLsi08G003230
SyntenyLsi08G003230
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7067.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7067.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7067.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7067.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7067.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.7e-7167.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

A0A5A7TU93 Gag/pol protein6.7e-7167.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

A0A5A7TWB9 Gag/pol protein6.7e-7167.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

A0A5A7V4M1 Gag/pol protein6.7e-7167.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

A0A5D3CPJ6 Gag/pol protein6.7e-7167.23Show/hide
Query:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT
        KHE+M TAREIM+SLQ+MFGQ S QI+H+A KY+YNARM +G SVREHV ++MVHFNVAEMNGAVIDE S VSFILESLP SFLQFRSNAVMNKI YT+T
Subjt:  KHEAMFTAREIMESLQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTIT

Query:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE
        TLLNELQTF+SLMK K  K GEANVA S +KFH+GSTS TKS            KK G+G KA  A    +K  KA KG CFHCN +GHWKRNCPKY+ E
Subjt:  TLLNELQTFQSLMKNKRTKEGEANVAHS-KKFHKGSTSRTKS------------KKDGKG-KAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTE

Query:  NKEKE---------ETCLVENDYNAWILDSGATNH
         K+ +         ETCLVEND +AWI+DSGATNH
Subjt:  NKEKE---------ETCLVENDYNAWILDSGATNH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTGTTGAATATATATATGAATGTCAACTTGTATACTTTTCAACCATGTACAAGAATAATATTTCTGGAATTATGGAGTTCCATTGGGTGATTATGTTGTTTTT
GGTGTGTGACTATAATTTGGCTCGCATGTCAATGTTAAGGGAGAGGAGCTTAGACAGTTCATTGGTTGTAGTGACTTCTTGTAAAGATTGGATACCAAGAATGTTTGATA
AAGATTGGATTCATTTGAACAGGATGCATAAAAAGGAATTGCTACTCTCGAATAAAAGACTTGCCAATGATGAGTCTCCTAATAAAGGTTCACTCGGAACAGTTTCCCCT
TTGGTTGAAAGAGTTTCATTAGACGTTGGGAACAAAAACGACTCAAATTTATTGAAATGGCTTGCAAATGGACCTCGCAAGATTGCAATGTCATATTTAGGACAAATATG
GAGAATGTTGACACAAACTTTGTGGGTGAAGGATCCTTCATATCGCTTGACGATATTGATGATGAGTTTGAACGTCGTAACAATGCACAAAATGATTGTGAAGGCATCTT
GTTATGATAACGTAGAACGTATGGATTTGGATAAGCAAAGAACTGAACCAAACAAGGGTAAGACAACCTTGGTGAAAACATCTTGGAGATTTGCAAAGTCATTTAGTATG
GACTTAACCAATATGGCCAGTGAAGACGAGGAGGATGAGGACCAAGCTGCTTATTCAAAGAATGTAAAAAGGAATATAGCATATGAACTAGATAATGTCGTGAAGGCTAG
TGTAGAGAGAGAGGTTATGGAAAAATCTCCTTTTTCTGACCCACTTCACCACAACAATAGGGATCCTTCACTTGTTACTACTCCACTTCCTTCAAGTGTCAATGAAATTG
TTTGTCCCGAGCCGAAAAGACCACAAATTAACCCATTAGATTCCCCTGCTATTCGTACTAGATCAACAATGCCTATAACTTTCGTTGAAAAAATCAGCGAGCAAAATTCT
CATAATACTACTGTTAGAGTACCACCATCAACAAATGCTCCAGCTCCAATCTTAAATTCTCCACCGTGGGAGAAAAATGGTATATATGATATGTCAATTTTTATCTCTCA
CATCCCTAAAAGTTCACACCGTGAGATCCATGCTCGACTTCGTGTCGCCTTGGGCGTGGCCTCCCTTCGAAAAGTGTTTGCATGGGTCAATACCAAGGTGAATAGGAAAA
ATGTTCATAATAAGTGGGAGGAGGACACGTCTCAACGTTTTCTACGGTCTCCTTCATTAGGTTGCAACAAACATGAGGCCATGTTCACTGCTCGTGAGATCATGGAGTCG
CTGCAGGATATGTTTGGGCAACCGTCCACACAGATCCGGCACGAGGCCTTCAAGTACGTTTATAACGCGCGTATGAAGAAAGGCCAATCAGTGAGAGAACATGTTTTCGA
CCTCATGGTTCACTTCAACGTCGCTGAAATGAACGGTGCAGTCATAGATGAGCAAAGTCACGTGTCCTTTATTTTGGAATCTCTTCCAACGAGTTTCCTTCAATTCCGCA
GCAATGCGGTAATGAACAAAATACATTACACCATAACTACCCTCCTAAATGAGTTACAGACTTTTCAGTCTCTTATGAAAAATAAGAGAACGAAAGAAGGAGAGGCAAAT
GTTGCCCATTCCAAGAAGTTCCACAAGGGTTCAACCTCTAGAACTAAGTCTAAGAAAGATGGAAAGGGGAAGGCTCCCGCTGCTGGCAAAGGCAAGAGCAAGGCCAAGAA
GGCTGACAAGGGCAAGTGCTTCCACTGCAATGTGGATGGTCACTGGAAGCGCAATTGCCCCAAGTACGTCACCGAGAATAAGGAAAAGGAAGAAACTTGTTTAGTGGAAA
ATGACTATAATGCCTGGATACTTGATTCAGGAGCCACTAATCATGAACATATTGTTGAGGGGGATGTTCATGTTATGGACACACATGCAACAATATTAGATGAGATTGCA
ACCTTAGCAACCTCTCGCTACATTCGATCGTTTATCTACAACCTCGACATCAACATTTTCTACCTTCAATTTGCAAGCATCATACTCCCCTTGCGCAAACTTGTTCTCAG
GATAGAATACAGCCATCACTTGGTTCCCTCCAAATTTTCTTCCATTCAAACCGGCTCGAGCTTTTGTTGCGCTGTCAATATCTGCATACTCCAAAAACACCTGGACCCAA
TCAACTGGCAGATTCAAATTAGTTTTCAAAATGGAAAAATAAGAAAAACGTGGTTTAAAGATGTAAGAATTCATGCCTTTCCAACTCCGGGTGCTGCTTCATTGGGCCTC
GGACGCGGGATAACAACGTTCACTAATGTACCTGCAAAGACACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTGTTGAATATATATATGAATGTCAACTTGTATACTTTTCAACCATGTACAAGAATAATATTTCTGGAATTATGGAGTTCCATTGGGTGATTATGTTGTTTTT
GGTGTGTGACTATAATTTGGCTCGCATGTCAATGTTAAGGGAGAGGAGCTTAGACAGTTCATTGGTTGTAGTGACTTCTTGTAAAGATTGGATACCAAGAATGTTTGATA
AAGATTGGATTCATTTGAACAGGATGCATAAAAAGGAATTGCTACTCTCGAATAAAAGACTTGCCAATGATGAGTCTCCTAATAAAGGTTCACTCGGAACAGTTTCCCCT
TTGGTTGAAAGAGTTTCATTAGACGTTGGGAACAAAAACGACTCAAATTTATTGAAATGGCTTGCAAATGGACCTCGCAAGATTGCAATGTCATATTTAGGACAAATATG
GAGAATGTTGACACAAACTTTGTGGGTGAAGGATCCTTCATATCGCTTGACGATATTGATGATGAGTTTGAACGTCGTAACAATGCACAAAATGATTGTGAAGGCATCTT
GTTATGATAACGTAGAACGTATGGATTTGGATAAGCAAAGAACTGAACCAAACAAGGGTAAGACAACCTTGGTGAAAACATCTTGGAGATTTGCAAAGTCATTTAGTATG
GACTTAACCAATATGGCCAGTGAAGACGAGGAGGATGAGGACCAAGCTGCTTATTCAAAGAATGTAAAAAGGAATATAGCATATGAACTAGATAATGTCGTGAAGGCTAG
TGTAGAGAGAGAGGTTATGGAAAAATCTCCTTTTTCTGACCCACTTCACCACAACAATAGGGATCCTTCACTTGTTACTACTCCACTTCCTTCAAGTGTCAATGAAATTG
TTTGTCCCGAGCCGAAAAGACCACAAATTAACCCATTAGATTCCCCTGCTATTCGTACTAGATCAACAATGCCTATAACTTTCGTTGAAAAAATCAGCGAGCAAAATTCT
CATAATACTACTGTTAGAGTACCACCATCAACAAATGCTCCAGCTCCAATCTTAAATTCTCCACCGTGGGAGAAAAATGGTATATATGATATGTCAATTTTTATCTCTCA
CATCCCTAAAAGTTCACACCGTGAGATCCATGCTCGACTTCGTGTCGCCTTGGGCGTGGCCTCCCTTCGAAAAGTGTTTGCATGGGTCAATACCAAGGTGAATAGGAAAA
ATGTTCATAATAAGTGGGAGGAGGACACGTCTCAACGTTTTCTACGGTCTCCTTCATTAGGTTGCAACAAACATGAGGCCATGTTCACTGCTCGTGAGATCATGGAGTCG
CTGCAGGATATGTTTGGGCAACCGTCCACACAGATCCGGCACGAGGCCTTCAAGTACGTTTATAACGCGCGTATGAAGAAAGGCCAATCAGTGAGAGAACATGTTTTCGA
CCTCATGGTTCACTTCAACGTCGCTGAAATGAACGGTGCAGTCATAGATGAGCAAAGTCACGTGTCCTTTATTTTGGAATCTCTTCCAACGAGTTTCCTTCAATTCCGCA
GCAATGCGGTAATGAACAAAATACATTACACCATAACTACCCTCCTAAATGAGTTACAGACTTTTCAGTCTCTTATGAAAAATAAGAGAACGAAAGAAGGAGAGGCAAAT
GTTGCCCATTCCAAGAAGTTCCACAAGGGTTCAACCTCTAGAACTAAGTCTAAGAAAGATGGAAAGGGGAAGGCTCCCGCTGCTGGCAAAGGCAAGAGCAAGGCCAAGAA
GGCTGACAAGGGCAAGTGCTTCCACTGCAATGTGGATGGTCACTGGAAGCGCAATTGCCCCAAGTACGTCACCGAGAATAAGGAAAAGGAAGAAACTTGTTTAGTGGAAA
ATGACTATAATGCCTGGATACTTGATTCAGGAGCCACTAATCATGAACATATTGTTGAGGGGGATGTTCATGTTATGGACACACATGCAACAATATTAGATGAGATTGCA
ACCTTAGCAACCTCTCGCTACATTCGATCGTTTATCTACAACCTCGACATCAACATTTTCTACCTTCAATTTGCAAGCATCATACTCCCCTTGCGCAAACTTGTTCTCAG
GATAGAATACAGCCATCACTTGGTTCCCTCCAAATTTTCTTCCATTCAAACCGGCTCGAGCTTTTGTTGCGCTGTCAATATCTGCATACTCCAAAAACACCTGGACCCAA
TCAACTGGCAGATTCAAATTAGTTTTCAAAATGGAAAAATAAGAAAAACGTGGTTTAAAGATGTAAGAATTCATGCCTTTCCAACTCCGGGTGCTGCTTCATTGGGCCTC
GGACGCGGGATAACAACGTTCACTAATGTACCTGCAAAGACACAATAA
Protein sequenceShow/hide protein sequence
MDIVEYIYECQLVYFSTMYKNNISGIMEFHWVIMLFLVCDYNLARMSMLRERSLDSSLVVVTSCKDWIPRMFDKDWIHLNRMHKKELLLSNKRLANDESPNKGSLGTVSP
LVERVSLDVGNKNDSNLLKWLANGPRKIAMSYLGQIWRMLTQTLWVKDPSYRLTILMMSLNVVTMHKMIVKASCYDNVERMDLDKQRTEPNKGKTTLVKTSWRFAKSFSM
DLTNMASEDEEDEDQAAYSKNVKRNIAYELDNVVKASVEREVMEKSPFSDPLHHNNRDPSLVTTPLPSSVNEIVCPEPKRPQINPLDSPAIRTRSTMPITFVEKISEQNS
HNTTVRVPPSTNAPAPILNSPPWEKNGIYDMSIFISHIPKSSHREIHARLRVALGVASLRKVFAWVNTKVNRKNVHNKWEEDTSQRFLRSPSLGCNKHEAMFTAREIMES
LQDMFGQPSTQIRHEAFKYVYNARMKKGQSVREHVFDLMVHFNVAEMNGAVIDEQSHVSFILESLPTSFLQFRSNAVMNKIHYTITTLLNELQTFQSLMKNKRTKEGEAN
VAHSKKFHKGSTSRTKSKKDGKGKAPAAGKGKSKAKKADKGKCFHCNVDGHWKRNCPKYVTENKEKEETCLVENDYNAWILDSGATNHEHIVEGDVHVMDTHATILDEIA
TLATSRYIRSFIYNLDINIFYLQFASIILPLRKLVLRIEYSHHLVPSKFSSIQTGSSFCCAVNICILQKHLDPINWQIQISFQNGKIRKTWFKDVRIHAFPTPGAASLGL
GRGITTFTNVPAKTQ