; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr4:12794751..12799859
RNA-Seq ExpressionMoc04g17360
SyntenyMoc04g17360
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.9e-4641.56Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP +EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK+   G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN HI                   GVG+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIGSGILHTSTSS
        GQ+ PI+H G   L  ++ S
Subjt:  GQSLPIAHIGSGILHTSTSS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]6.2e-4539.88Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIGSGILHTSTSSLKLCNL
        GQ+ P++H G      ++ S  +  L
Subjt:  GQSLPIAHIGSGILHTSTSSLKLCNL

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.8e-4541.29Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIG
        GQ+ P++H G
Subjt:  GQSLPIAHIG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.1e-4642.26Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP +EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK+   G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN HI                   GVG+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIG
        GQ+ PI+H G
Subjt:  GQSLPIAHIG

XP_022156563.1 uncharacterized protein LOC111023438 [Momordica charantia]5.7e-7560.63Show/hide
Query:  VPQLNPAFEDWIVKDHALMTVINATLSL-AALAYVVG--------CETSKEVWDTFGEALLIQLLNEYLVIYALNGLTADYNTFRTSMRTREKSVSFEDL
        VPQLNP FEDWI KDHALMTVINAT ++ + L +V           +  KE  D   +   + + +E L+IYALNGLTA YNTFRTSM TREKS +F   
Subjt:  VPQLNPAFEDWIVKDHALMTVINATLSL-AALAYVVG--------CETSKEVWDTFGEALLIQLLNEYLVIYALNGLTADYNTFRTSMRTREKSVSFEDL

Query:  HVLLVSEEAATEKQ--NKRDEVFASPTALLAAS------LLPLAPTGKSSQEGFSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYA
            V    A+ K   +K      +PT   + +      LLPLAPTGKSSQEGFSS I CQICLK GHS LDCFN MNYSFQG HP +QL  MVANHNYA
Subjt:  HVLLVSEEAATEKQ--NKRDEVFASPTALLAAS------LLPLAPTGKSSQEGFSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYA

Query:  TLASSNPSWLTDSGCNAH-------------------IGVGSGQSLPIAHIGSGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCVE
        TLASSNPSWLTDSGCNAH                   IGVG+GQSLPIAH  SGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCV+
Subjt:  TLASSNPSWLTDSGCNAH-------------------IGVGSGQSLPIAHIGSGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCVE

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.0e-4539.88Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIGSGILHTSTSSLKLCNL
        GQ+ P++H G      ++ S  +  L
Subjt:  GQSLPIAHIGSGILHTSTSSLKLCNL

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.3e-4541.29Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIG
        GQ+ P++H G
Subjt:  GQSLPIAHIG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.0e-4539.88Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAHIGSGILHTSTSSLKLCNL
        GQ+ P++H G      ++ S  +  L
Subjt:  GQSLPIAHIGSGILHTSTSSLKLCNL

A0A5D3CLI6 T4.53.9e-4541.23Show/hide
Query:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY
        PQ NP++EDWI KD ALMTVINATLS  ALAYVVG  +SK+VWD   +                                          A +   +NE 
Subjt:  PQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGE------------------------------------------ALLIQLLNEY

Query:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------
         L+IYALNGL  +YNTFRTSMRTR + V+FE+LHVLL +EE+A  KQ+K D+ +  PT LL++  SLL  APT           GK    G         
Subjt:  -LVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAA--SLLPLAPT-----------GKSSQEG---------

Query:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS
                     +   CQIC + GH+ LDCFN MNY+FQG HP  QLA MVA+ N A L+  N S LTDSGCN  I                   G+G+
Subjt:  -----------FSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHI-------------------GVGS

Query:  GQSLPIAH
        GQ+ P++H
Subjt:  GQSLPIAH

A0A6J1DQZ0 uncharacterized protein LOC1110234382.8e-7560.63Show/hide
Query:  VPQLNPAFEDWIVKDHALMTVINATLSL-AALAYVVG--------CETSKEVWDTFGEALLIQLLNEYLVIYALNGLTADYNTFRTSMRTREKSVSFEDL
        VPQLNP FEDWI KDHALMTVINAT ++ + L +V           +  KE  D   +   + + +E L+IYALNGLTA YNTFRTSM TREKS +F   
Subjt:  VPQLNPAFEDWIVKDHALMTVINATLSL-AALAYVVG--------CETSKEVWDTFGEALLIQLLNEYLVIYALNGLTADYNTFRTSMRTREKSVSFEDL

Query:  HVLLVSEEAATEKQ--NKRDEVFASPTALLAAS------LLPLAPTGKSSQEGFSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYA
            V    A+ K   +K      +PT   + +      LLPLAPTGKSSQEGFSS I CQICLK GHS LDCFN MNYSFQG HP +QL  MVANHNYA
Subjt:  HVLLVSEEAATEKQ--NKRDEVFASPTALLAAS------LLPLAPTGKSSQEGFSSRIVCQICLKTGHSTLDCFNIMNYSFQGHHPLVQLAVMVANHNYA

Query:  TLASSNPSWLTDSGCNAH-------------------IGVGSGQSLPIAHIGSGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCVE
        TLASSNPSWLTDSGCNAH                   IGVG+GQSLPIAH  SGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCV+
Subjt:  TLASSNPSWLTDSGCNAH-------------------IGVGSGQSLPIAHIGSGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCVE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-0732.56Show/hide
Query:  CQICLKTGHSTLDCFNIMNY--SFQGHHPLVQLAVMVANHNYATLAS-SNPSWLTDSGCNAHIG-------------------VGSGQSLPIAHIGSGIL
        CQIC   GHS   C  + ++  S     P           N A  +  S+ +WL DSG   HI                    V  G ++PI+H GS  L
Subjt:  CQICLKTGHSTLDCFNIMNY--SFQGHHPLVQLAVMVANHNYATLAS-SNPSWLTDSGCNAHIG-------------------VGSGQSLPIAHIGSGIL

Query:  HTSTSSLKLCNLLHVPTISSNLLSVHQLC
         T +  L L N+L+VP I  NL+SV++LC
Subjt:  HTSTSSLKLCNLLHVPTISSNLLSVHQLC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0633.59Show/hide
Query:  CQICLKTGHSTLDCFNIMNYSFQGHHPLVQ-LAVMVANHNYATLASSNP----SWLTDSGCNAHIG-------------------VGSGQSLPIAHIGSG
        CQIC   GHS   C  +  + FQ      Q  +        A LA ++P    +WL DSG   HI                    +  G ++PI H GS 
Subjt:  CQICLKTGHSTLDCFNIMNYSFQGHHPLVQ-LAVMVANHNYATLASSNP----SWLTDSGCNAHIG-------------------VGSGQSLPIAHIGSG

Query:  ILHTSTSSLKLCNLLHVPTISSNLLSVHQLC
         L TS+ SL L  +L+VP I  NL+SV++LC
Subjt:  ILHTSTSSLKLCNLLHVPTISSNLLSVHQLC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAGGAGTGGAAGTATGCTCCATCCCATCCTAAGGATTTAATTCTTGGTGATCCCGAACAAGGGGAGTGGGAGGTTGCTGTGACGTGCTACCACAAGCACGATCC
TGAGACCCAAGAGGATAGCGAGGAAGATCCGGTGCTGGTGTTCGAGGGGAACTTACCGAAGAAACATGTTCCTCAATTGAATCCGGCGTTTGAAGACTGGATTGTGAAAG
ACCACGCTCTTATGACTGTGATTAACGCCACACTTTCACTGGCTGCTCTAGCCTACGTTGTTGGTTGTGAGACTTCTAAGGAAGTTTGGGATACGTTTGGTGAAGCATTA
CTCATCCAACTCTTGAACGAATACCTTGTCATCTATGCTCTAAACGGCCTCACTGCTGACTATAACACTTTTCGAACATCGATGCGTACTCGTGAGAAGTCTGTGAGTTT
TGAAGATCTTCATGTGTTACTCGTCTCAGAAGAAGCAGCTACTGAAAAGCAGAATAAACGTGATGAAGTTTTCGCTTCTCCTACTGCTCTATTGGCTGCAAGTCTTCTCC
CTCTTGCTCCTACTGGCAAATCATCTCAAGAAGGTTTCTCTTCAAGGATTGTTTGTCAGATTTGCCTGAAAACTGGTCACTCTACATTGGATTGCTTTAATATAATGAAT
TACAGCTTCCAAGGGCATCATCCTCTTGTTCAGTTGGCAGTAATGGTGGCTAATCACAACTATGCCACTCTTGCATCCTCTAATCCCTCGTGGCTTACTGATTCAGGGTG
CAATGCTCATATTGGAGTAGGTAGTGGTCAGTCATTGCCAATTGCCCACATAGGCAGTGGTATTCTTCATACCTCTACCTCTTCTCTAAAACTTTGCAACCTTCTTCATG
TTCCAACTATTTCATCTAATCTTCTTTCCGTTCACCAATTATGTGTTGAAATAATTGCTTTGTTGTTTTCGATTCTACCTCCTCCTTTTTACACTGTTCCTCATTCTTAC
CCGGATGTTCCTGAGTCTGGACTTGAATCTCAGCTTGCACCACATGCTCCTGATATTAGTTCTAATACTACTGCACCTCCTACTAGTTCTCTACATGTTATGTCTGAGCC
TATTGTTGCCCCTATTGCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAGGAGTGGAAGTATGCTCCATCCCATCCTAAGGATTTAATTCTTGGTGATCCCGAACAAGGGGAGTGGGAGGTTGCTGTGACGTGCTACCACAAGCACGATCC
TGAGACCCAAGAGGATAGCGAGGAAGATCCGGTGCTGGTGTTCGAGGGGAACTTACCGAAGAAACATGTTCCTCAATTGAATCCGGCGTTTGAAGACTGGATTGTGAAAG
ACCACGCTCTTATGACTGTGATTAACGCCACACTTTCACTGGCTGCTCTAGCCTACGTTGTTGGTTGTGAGACTTCTAAGGAAGTTTGGGATACGTTTGGTGAAGCATTA
CTCATCCAACTCTTGAACGAATACCTTGTCATCTATGCTCTAAACGGCCTCACTGCTGACTATAACACTTTTCGAACATCGATGCGTACTCGTGAGAAGTCTGTGAGTTT
TGAAGATCTTCATGTGTTACTCGTCTCAGAAGAAGCAGCTACTGAAAAGCAGAATAAACGTGATGAAGTTTTCGCTTCTCCTACTGCTCTATTGGCTGCAAGTCTTCTCC
CTCTTGCTCCTACTGGCAAATCATCTCAAGAAGGTTTCTCTTCAAGGATTGTTTGTCAGATTTGCCTGAAAACTGGTCACTCTACATTGGATTGCTTTAATATAATGAAT
TACAGCTTCCAAGGGCATCATCCTCTTGTTCAGTTGGCAGTAATGGTGGCTAATCACAACTATGCCACTCTTGCATCCTCTAATCCCTCGTGGCTTACTGATTCAGGGTG
CAATGCTCATATTGGAGTAGGTAGTGGTCAGTCATTGCCAATTGCCCACATAGGCAGTGGTATTCTTCATACCTCTACCTCTTCTCTAAAACTTTGCAACCTTCTTCATG
TTCCAACTATTTCATCTAATCTTCTTTCCGTTCACCAATTATGTGTTGAAATAATTGCTTTGTTGTTTTCGATTCTACCTCCTCCTTTTTACACTGTTCCTCATTCTTAC
CCGGATGTTCCTGAGTCTGGACTTGAATCTCAGCTTGCACCACATGCTCCTGATATTAGTTCTAATACTACTGCACCTCCTACTAGTTCTCTACATGTTATGTCTGAGCC
TATTGTTGCCCCTATTGCTTCTTAG
Protein sequenceShow/hide protein sequence
MPKEWKYAPSHPKDLILGDPEQGEWEVAVTCYHKHDPETQEDSEEDPVLVFEGNLPKKHVPQLNPAFEDWIVKDHALMTVINATLSLAALAYVVGCETSKEVWDTFGEAL
LIQLLNEYLVIYALNGLTADYNTFRTSMRTREKSVSFEDLHVLLVSEEAATEKQNKRDEVFASPTALLAASLLPLAPTGKSSQEGFSSRIVCQICLKTGHSTLDCFNIMN
YSFQGHHPLVQLAVMVANHNYATLASSNPSWLTDSGCNAHIGVGSGQSLPIAHIGSGILHTSTSSLKLCNLLHVPTISSNLLSVHQLCVEIIALLFSILPPPFYTVPHSY
PDVPESGLESQLAPHAPDISSNTTAPPTSSLHVMSEPIVAPIAS