; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr8:13780289..13781552
RNA-Seq ExpressionMoc08g18210
SyntenyMoc08g18210
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]6.5e-6247.19Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQD----------------QAELKTQSSQPKKST-----------TKVSNVSKKKE--KGKEVVNAP-----TKVFI
        +GEFVSP+L+ NVA+    L  Q QD                QAE +TQ SQ +  T           TK   V K K+  KGK VV  P      +V  
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQD----------------QAELKTQSSQPKKST-----------TKVSNVSKKKE--KGKEVVNAP-----TKVFI

Query:  PNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQSPTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRT
          E    G   H+AIGS++N+V IG +F++DVQ PTIHG+PLG DN+RV VD++M ED  +PI +KGEI+TLNQ IG+FVAWPR+LVIL +EK   S   
Subjt:  PNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQSPTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRT

Query:  SKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVIEFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQ
        ++  TQSS++ DVH+TIKL+N+Y + +M  KD+I+ N+ ++IFG EK I L  DDI+QYC M +IGY C+LT IA LWN C+ E  K+F++VD   I   
Subjt:  SKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVIEFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQ

Query:  IMS
        I S
Subjt:  IMS

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]7.4e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]7.4e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]7.4e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]7.4e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.3e-6045.21Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQD----------------QAELKTQSSQPKKSTTKV-SNVSKKKEKGKEVVNAPT----KVFIPNEQPTM------
        +GEFVSP+L+ NVA+    L  Q QD                +AE +TQ S  +  T +  S+VS+KK KGK+V         K+ +   + T+      
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQD----------------QAELKTQSSQPKKSTTKV-SNVSKKKEKGKEVVNAPT----KVFIPNEQPTM------

Query:  -------GKLSHMAIGSVENIVVIGTVFDNDVQSPTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRT
               G   H+AIGS++N+V +G +F++DVQ PTIHG+PLG +N+RV VDI M ED  +PI +KG+I+TLNQ IG+FVAWPR+LVI+ +EK   S   
Subjt:  -------GKLSHMAIGSVENIVVIGTVFDNDVQSPTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRT

Query:  SKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVIEFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQ
        S+  TQSS++ DVH+TIKL+N+Y +++M  +D+I+ ++ ++IFG EK I L RDDI+QYC M +IGY C+LT IA LWN CE E  K+F++VD   I   
Subjt:  SKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVIEFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQ

Query:  IMS
        I S
Subjt:  IMS

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X13.6e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X43.6e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

A0A6J1C398 uncharacterized protein LOC111007859 isoform X33.6e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X23.6e-8260.97Show/hide
Query:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS
        +GEFVSPSLYFNV K KS            KTQ  QP KSTT+ SN SKKK KGKE+VN   ++++ +EQ   GK  H+A+ SV+NIV +GT+FDN+VQ 
Subjt:  LGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTTKVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQS

Query:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI
        PT+HGVPLGVDNVRVMVDIV+ E  TIPI ++GEI+TLNQTIG FVAWPRRLVIL+EEKN++S RTS+ RTQ S+H DVH++IKL+N+Y + SM ++D +
Subjt:  PTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRLVILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVI

Query:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS
        E N+   IFG EK I L R+DIMQYC MI+IGY C+LT IAYLW+  E E  KKFL+VDP  I P + S
Subjt:  EFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDNIVPQIMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATAGTGTACAAGAAGAAGTTGAGGATGACGACTTTGATGACACAATAGGGATGTTTGAGGCTGCATATGATTACTTTGATGAAAATCCACAAAATTTCGAGGA
GCTACTGGGTGATGCGAAGAAACCATTATATTCAAATTGTAATAACTTTACCAAAATATCTGTATTGTTGAAGTTATATAATTTGAAGAAACCATTAGGTGAGTTTGTTT
CACCATCCCTATATTTTAATGTTGCCAAAAGGAAGTCAAATTTGGGATTGCAACCTCAAGACCAAGCTGAATTGAAGACTCAGAGCTCACAACCAAAAAAATCCACAACT
AAAGTCAGTAATGTCTCAAAGAAAAAGGAGAAAGGGAAGGAAGTTGTCAATGCACCTACTAAAGTTTTTATCCCAAACGAACAACCAACAATGGGAAAACTGTCTCATAT
GGCTATTGGATCGGTGGAGAACATTGTTGTAATAGGCACAGTATTTGATAATGACGTTCAAAGTCCAACAATTCATGGAGTGCCATTAGGTGTGGACAATGTCAGAGTGA
TGGTAGACATTGTCATGGGCGAAGATACTACGATACCAATTCTTATGAAGGGTGAGATAAAGACATTAAATCAAACCATTGGTAGCTTTGTGGCATGGCCTCGTCGACTA
GTAATTTTAAATGAGGAGAAAAATGTTGCTTCTCCTCGAACATCCAAGCCAAGAACACAATCATCTGAACATATTGATGTCCACATGACTATTAAACTCATAAATCAATA
TGACGTGCGTTCAATGAACAACAAGGATGTAATTGAATTCAATATCGGTGATTACATATTTGGAATGGAGAAGATCATTGGTTTAATACGTGATGACATCATGCAATACT
GCGAAATGATTCAAATAGGCTATATGTGTATGCTCACGTGCATCGCGTATCTTTGGAATGAATGTGAGGAGGAGACACATAAGAAGTTCTTGGTGGTTGACCCTGACAAC
ATTGTACCACAAATTATGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATAGTGTACAAGAAGAAGTTGAGGATGACGACTTTGATGACACAATAGGGATGTTTGAGGCTGCATATGATTACTTTGATGAAAATCCACAAAATTTCGAGGA
GCTACTGGGTGATGCGAAGAAACCATTATATTCAAATTGTAATAACTTTACCAAAATATCTGTATTGTTGAAGTTATATAATTTGAAGAAACCATTAGGTGAGTTTGTTT
CACCATCCCTATATTTTAATGTTGCCAAAAGGAAGTCAAATTTGGGATTGCAACCTCAAGACCAAGCTGAATTGAAGACTCAGAGCTCACAACCAAAAAAATCCACAACT
AAAGTCAGTAATGTCTCAAAGAAAAAGGAGAAAGGGAAGGAAGTTGTCAATGCACCTACTAAAGTTTTTATCCCAAACGAACAACCAACAATGGGAAAACTGTCTCATAT
GGCTATTGGATCGGTGGAGAACATTGTTGTAATAGGCACAGTATTTGATAATGACGTTCAAAGTCCAACAATTCATGGAGTGCCATTAGGTGTGGACAATGTCAGAGTGA
TGGTAGACATTGTCATGGGCGAAGATACTACGATACCAATTCTTATGAAGGGTGAGATAAAGACATTAAATCAAACCATTGGTAGCTTTGTGGCATGGCCTCGTCGACTA
GTAATTTTAAATGAGGAGAAAAATGTTGCTTCTCCTCGAACATCCAAGCCAAGAACACAATCATCTGAACATATTGATGTCCACATGACTATTAAACTCATAAATCAATA
TGACGTGCGTTCAATGAACAACAAGGATGTAATTGAATTCAATATCGGTGATTACATATTTGGAATGGAGAAGATCATTGGTTTAATACGTGATGACATCATGCAATACT
GCGAAATGATTCAAATAGGCTATATGTGTATGCTCACGTGCATCGCGTATCTTTGGAATGAATGTGAGGAGGAGACACATAAGAAGTTCTTGGTGGTTGACCCTGACAAC
ATTGTACCACAAATTATGTCTTAA
Protein sequenceShow/hide protein sequence
MSDSVQEEVEDDDFDDTIGMFEAAYDYFDENPQNFEELLGDAKKPLYSNCNNFTKISVLLKLYNLKKPLGEFVSPSLYFNVAKRKSNLGLQPQDQAELKTQSSQPKKSTT
KVSNVSKKKEKGKEVVNAPTKVFIPNEQPTMGKLSHMAIGSVENIVVIGTVFDNDVQSPTIHGVPLGVDNVRVMVDIVMGEDTTIPILMKGEIKTLNQTIGSFVAWPRRL
VILNEEKNVASPRTSKPRTQSSEHIDVHMTIKLINQYDVRSMNNKDVIEFNIGDYIFGMEKIIGLIRDDIMQYCEMIQIGYMCMLTCIAYLWNECEEETHKKFLVVDPDN
IVPQIMS