; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031007 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031007
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold10:23406536..23413864
RNA-Seq ExpressionSpg031007
SyntenySpg031007
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.7e-4337.92Show/hide
Query:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII
        EFD+I  + H    V ++I   RLR  Y  D   +K  EL+++F    F +D+D VK+ ++YFIELAMMG+ERKQ +D +LLG++D W+ FCN +WS +I
Subjt:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII

Query:  FDKTIKSLKKALSGKVESYKERLDGKQ---ETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV
        FD+TI SLK AL  K+  Y+++        ETYSLYGFPY F+VW YETIS+L        S++ IP +LRWSC +S    VL+ EVF +  ++V   L+
Subjt:  FDKTIKSLKKALSGKVESYKERLDGKQ---ETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV

Query:  ATEEEVQFMDRVMEPPQA-----PPAT--------PPHHPPPAPLP-------------ALIDMHVDDTDAEDTHDRTEDVE---TSYEVSDRVCKKCKL
        AT+ + Q M RV+ PP+      PPA         PP  P  A +P              ++D H  D +A  + +  E +E      +   R+ ++ K 
Subjt:  ATEEEVQFMDRVMEPPQA-----PPAT--------PPHHPPPAPLP-------------ALIDMHVDDTDAEDTHDRTEDVE---TSYEVSDRVCKKCKL

Query:  LDSRVEGIKNGIKELNGRMEGIEGDLK
        LD+ V  I++ + +    ++GI+  LK
Subjt:  LDSRVEGIKNGIKELNGRMEGIEGDLK

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]1.6e-3632.11Show/hide
Query:  NLDLLAQATQTSEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEW
        NL     + + ++F +I  +++    V+ N    RL  +Y ND+T +   + ++++   RFE+D D VK+ ++Y + + ++GRER  + D +LLGI+D+W
Subjt:  NLDLLAQATQTSEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEW

Query:  DRFCNENWSKIIFDKTIKSLKKALSGKVESYKERLDGK-QETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFAS
        +  CN NW+ + F+KTI SL++         K   DGK +++YSLYGFP+ F+VW Y+TISSL+ RVAN+V  + +P I +W   HS    VL +++F S
Subjt:  DRFCNENWSKIIFDKTIKSLKKALSGKVESYKERLDGK-QETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFAS

Query:  NAARVTLELVATEEEVQFMDRVMEPPQAPPATPPHHPPPAPLPALI---DMHVDDTDAEDTHDRTEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELN
           R T  L  T+ E  F++R  +PP +              P+ +     + D++   D     +D E   E +    K C +   R++ ++  +K ++
Subjt:  NAARVTLELVATEEEVQFMDRVMEPPQAPPATPPHHPPPAPLPALI---DMHVDDTDAEDTHDRTEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELN

Query:  GRMEGIEGDLKVIKSIEKDIKAIKKFM
         RM+   GD      IE ++K+IKKF+
Subjt:  GRMEGIEGDLKVIKSIEKDIKAIKKFM

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]2.7e-3639.41Show/hide
Query:  MMGRERKQQMDMSLLGIIDEWDRFCNENWSKIIFDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIP
        MMG+ERKQ+MD SLLGI+D W+ FC+ + S +IF++T+ SLK AL  KVE+YK+++       ETYSLYGFPY F+VW YETIS+L+ RVA R++++ IP
Subjt:  MMGRERKQQMDMSLLGIIDEWDRFCNENWSKIIFDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIP

Query:  WILRWSCSHSPTLAVLSKEVFASNAARVTLELVATEEEVQFMDRVMEPPQAP----------------PATPPHHPPPAPLPALIDMHVDDTDAEDTHDR
         +LRWSC++S    VL +EVF +  ++V + L AT+ E Q M RVM PP AP                 +T    P  + +  L+++     DA    DR
Subjt:  WILRWSCSHSPTLAVLSKEVFASNAARVTLELVATEEEVQFMDRVMEPPQAP----------------PATPPHHPPPAPLPALIDMHVDDTDAEDTHDR

Query:  -TEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELNGRMEGIEGDLKVIKSIEKDIKAIKKFMRRLSK
         TED+  +    D++  +      + +      +EL    + +      +  +  DIK IKKFM+RL+K
Subjt:  -TEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELNGRMEGIEGDLKVIKSIEKDIKAIKKFMRRLSK

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]3.2e-5050.24Show/hide
Query:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII
        EFD+I  +RH+   V  ++ + RLR +Y  D+ ++K  EL+++F    FEND+DAVK+ ++YFIELAMMG+ERK +MD SLLGI+D W+ FCN +WS +I
Subjt:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII

Query:  FDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV
        F++T+ SLK AL  KVE YK+++       ETYSLY FPY F+VW YETIS+L+ RVA R++++ IP +LRWSC++S    VL +EVF +  ++V + L 
Subjt:  FDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV

Query:  ATEEE
        AT+ E
Subjt:  ATEEE

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]2.2e-3542.47Show/hide
Query:  SEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKI
        SEF +I  +++S    + + S  RLR++Y +D+  +   E +  +  ++FE+D DAVK+++L F+EL + GR+R  ++D SLLG++D+ +  CN  W+++
Subjt:  SEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKI

Query:  IFDKTIKSLKKALSGKVESYKERLDGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVF
         F+KTI+SLK+AL     + K R  G ++TYSLYGFP+ F+VW YETIS LT RVA+ +  + +P IL+W C +SP   V+ KE+F
Subjt:  IFDKTIKSLKKALSGKVESYKERLDGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVF

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207578.3e-4437.92Show/hide
Query:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII
        EFD+I  + H    V ++I   RLR  Y  D   +K  EL+++F    F +D+D VK+ ++YFIELAMMG+ERKQ +D +LLG++D W+ FCN +WS +I
Subjt:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII

Query:  FDKTIKSLKKALSGKVESYKERLDGKQ---ETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV
        FD+TI SLK AL  K+  Y+++        ETYSLYGFPY F+VW YETIS+L        S++ IP +LRWSC +S    VL+ EVF +  ++V   L+
Subjt:  FDKTIKSLKKALSGKVESYKERLDGKQ---ETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV

Query:  ATEEEVQFMDRVMEPPQA-----PPAT--------PPHHPPPAPLP-------------ALIDMHVDDTDAEDTHDRTEDVE---TSYEVSDRVCKKCKL
        AT+ + Q M RV+ PP+      PPA         PP  P  A +P              ++D H  D +A  + +  E +E      +   R+ ++ K 
Subjt:  ATEEEVQFMDRVMEPPQA-----PPAT--------PPHHPPPAPLP-------------ALIDMHVDDTDAEDTHDRTEDVE---TSYEVSDRVCKKCKL

Query:  LDSRVEGIKNGIKELNGRMEGIEGDLK
        LD+ V  I++ + +    ++GI+  LK
Subjt:  LDSRVEGIKNGIKELNGRMEGIEGDLK

A0A6J1DL40 uncharacterized protein LOC1110221101.3e-3639.41Show/hide
Query:  MMGRERKQQMDMSLLGIIDEWDRFCNENWSKIIFDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIP
        MMG+ERKQ+MD SLLGI+D W+ FC+ + S +IF++T+ SLK AL  KVE+YK+++       ETYSLYGFPY F+VW YETIS+L+ RVA R++++ IP
Subjt:  MMGRERKQQMDMSLLGIIDEWDRFCNENWSKIIFDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIP

Query:  WILRWSCSHSPTLAVLSKEVFASNAARVTLELVATEEEVQFMDRVMEPPQAP----------------PATPPHHPPPAPLPALIDMHVDDTDAEDTHDR
         +LRWSC++S    VL +EVF +  ++V + L AT+ E Q M RVM PP AP                 +T    P  + +  L+++     DA    DR
Subjt:  WILRWSCSHSPTLAVLSKEVFASNAARVTLELVATEEEVQFMDRVMEPPQAP----------------PATPPHHPPPAPLPALIDMHVDDTDAEDTHDR

Query:  -TEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELNGRMEGIEGDLKVIKSIEKDIKAIKKFMRRLSK
         TED+  +    D++  +      + +      +EL    + +      +  +  DIK IKKFM+RL+K
Subjt:  -TEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELNGRMEGIEGDLKVIKSIEKDIKAIKKFMRRLSK

A0A6J1DP34 uncharacterized protein LOC1110218027.5e-3732.11Show/hide
Query:  NLDLLAQATQTSEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEW
        NL     + + ++F +I  +++    V+ N    RL  +Y ND+T +   + ++++   RFE+D D VK+ ++Y + + ++GRER  + D +LLGI+D+W
Subjt:  NLDLLAQATQTSEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEW

Query:  DRFCNENWSKIIFDKTIKSLKKALSGKVESYKERLDGK-QETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFAS
        +  CN NW+ + F+KTI SL++         K   DGK +++YSLYGFP+ F+VW Y+TISSL+ RVAN+V  + +P I +W   HS    VL +++F S
Subjt:  DRFCNENWSKIIFDKTIKSLKKALSGKVESYKERLDGK-QETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFAS

Query:  NAARVTLELVATEEEVQFMDRVMEPPQAPPATPPHHPPPAPLPALI---DMHVDDTDAEDTHDRTEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELN
           R T  L  T+ E  F++R  +PP +              P+ +     + D++   D     +D E   E +    K C +   R++ ++  +K ++
Subjt:  NAARVTLELVATEEEVQFMDRVMEPPQAPPATPPHHPPPAPLPALI---DMHVDDTDAEDTHDRTEDVETSYEVSDRVCKKCKLLDSRVEGIKNGIKELN

Query:  GRMEGIEGDLKVIKSIEKDIKAIKKFM
         RM+   GD      IE ++K+IKKF+
Subjt:  GRMEGIEGDLKVIKSIEKDIKAIKKFM

A0A6J1DRZ7 uncharacterized protein LOC1110238471.6e-5050.24Show/hide
Query:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII
        EFD+I  +RH+   V  ++ + RLR +Y  D+ ++K  EL+++F    FEND+DAVK+ ++YFIELAMMG+ERK +MD SLLGI+D W+ FCN +WS +I
Subjt:  EFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKII

Query:  FDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV
        F++T+ SLK AL  KVE YK+++       ETYSLY FPY F+VW YETIS+L+ RVA R++++ IP +LRWSC++S    VL +EVF +  ++V + L 
Subjt:  FDKTIKSLKKALSGKVESYKERL---DGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVFASNAARVTLELV

Query:  ATEEE
        AT+ E
Subjt:  ATEEE

A0A6J1DWS4 uncharacterized protein LOC1110251361.1e-3542.47Show/hide
Query:  SEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKI
        SEF +I  +++S    + + S  RLR++Y +D+  +   E +  +  ++FE+D DAVK+++L F+EL + GR+R  ++D SLLG++D+ +  CN  W+++
Subjt:  SEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKMTLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKI

Query:  IFDKTIKSLKKALSGKVESYKERLDGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVF
         F+KTI+SLK+AL     + K R  G ++TYSLYGFP+ F+VW YETIS LT RVA+ +  + +P IL+W C +SP   V+ KE+F
Subjt:  IFDKTIKSLKKALSGKVESYKERLDGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILRWSCSHSPTLAVLSKEVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATCAGCCCCCCTACCTCAGCTTTTCTCTCCTTCTTCGACAGACGACAGTGGTTTGAGGAGCTTTGATGTGCGGATTTTCCTCTTCTCCAGTGAGCAATTTGAAGT
CGCGGCGACACTTCTCTCCCTTCAGCCAGCAGCCGATGGTGACGCCCACGAACAAGCAAGCTTTTTCACTTCAAGACAAGATTCAATGAATCTTTTGGATAAACAACTTT
TCAATTCGTTGGTCAATAAATTGCACAACATTATTCTGAAACACATAGGATATGGATTCTTGTTGAGTTCTGTCGCCTTGACAATGTCTTCTAAGATACCTCGGGAGGTC
GTAAGTGAAGCAAGTAAATTAAGAATTAGGGAGAAGGTTCAATACCTAGTTGAGTTCCCCTACCTTAGGCCTGGTGAGCCATTAGTGATTGAGGAGATAATTGTGCCGCC
GCCGATGGCTCGGGAAGAAATGGTTGATTGGAGCTATAGCGCAATATTGGAGTTAATCGGATGCTCGGGACGTGAAAAGATGCAAAGAAATGAAAAAGAATCAAAGAGAA
CAAAGTCAACATTCGGTCAACACTTGACCAGCGTCGAGACGCCATCCCTTGAGCGTCAAGACGCTGGCATTCCATATCAGAATAGGCGCAAAAAGGAGTGGGTTTTTCAT
ATGCGGAATGATGCATCACGGTCAGGATGCTTAGGTGGAAAGCGAGACGGGACCGGGAAAACAATTTCTTGTTGGAAATGCTTTTCCCATGCCCGATCTCGATCGGGCGA
TCCGACTGGGAATCTCGATCTTCTCGCTCAGGCGACCCAAACGAGTGAATTCGATATCATTATCGAGATTAGGCATAGTACTAGACACGTTAAGAGCAATATCAGTAGTA
TTAGGCTTAGAAGAATCTACCTGAACGACAGAACGACGATGAAAGGATTTGAGTTAGATAGATTATTCCCTAACCTCCGATTTGAGAATGATGACGACGCAGTTAAGATG
ACCCTGCTTTATTTCATCGAGCTTGCGATGATGGGGAGAGAGAGGAAACAACAAATGGATATGAGCCTGCTAGGTATAATTGACGAGTGGGATAGATTTTGCAATGAAAA
TTGGAGCAAAATCATATTTGATAAGACCATTAAATCATTGAAGAAAGCTTTGTCTGGAAAAGTGGAATCTTATAAGGAGAGATTAGATGGTAAACAAGAGACGTATAGTC
TTTACGGCTTCCCATACACGTTTCGGGTATGGATGTATGAGACTATTTCGTCGTTAACTGGACGTGTTGCTAACCGTGTCAGCGAAAATGTCATCCCGTGGATTCTTAGA
TGGTCATGTTCCCACTCGCCCACTCTGGCAGTGCTTAGTAAAGAGGTTTTTGCATCAAACGCAGCGAGGGTCACATTGGAACTTGTGGCCACAGAGGAGGAGGTTCAATT
TATGGACCGTGTGATGGAGCCGCCTCAAGCCCCACCTGCCACCCCACCTCATCATCCACCTCCAGCTCCACTCCCAGCACTTATCGATATGCATGTTGATGATACAGATG
CTGAGGATACCCATGATAGGACGGAGGATGTTGAGACTAGTTATGAGGTTTCTGACAGAGTTTGTAAGAAGTGTAAACTTCTTGATAGCCGTGTGGAGGGCATTAAAAAT
GGCATCAAGGAGTTAAATGGAAGGATGGAGGGAATCGAAGGAGACCTGAAGGTGATCAAGTCAATAGAGAAAGATATCAAGGCAATAAAGAAGTTCATGCGTCGATTGTC
TAAGTGCGAGATGATGGTCCAGACGATCAGGATGGGGCAGGATCTAGATCTGACGCGAAAGGAGCAGACGTTGAATTTGGCAACTGGTTCAGCCGTTCAGACCCAACAAA
AAGGTCCAGTAATGGAAGGGACTGATGGTGGTGATCCACTTCAAGGGGTGGAAAAGGGAATCGTTGATCTAGTCGAGGGGGATGATTGCACTGAGAGTGGTGAAGTTGTA
GTGGGCAAGGAACTGGAAGTCACCGAGGCTCATAATGCGCTGGTTAATGGTCGTAATGTGCTGGGTGTCCAATCTACTTCTCAACAAAACGAGCCCATAGAACGACGGGA
GACTCGTAAGAGGAAGACTGCATGGAAGTCGAGAACTCCATGGAAAGACACACGGGAAGATGGGAAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATCAGCCCCCCTACCTCAGCTTTTCTCTCCTTCTTCGACAGACGACAGTGGTTTGAGGAGCTTTGATGTGCGGATTTTCCTCTTCTCCAGTGAGCAATTTGAAGT
CGCGGCGACACTTCTCTCCCTTCAGCCAGCAGCCGATGGTGACGCCCACGAACAAGCAAGCTTTTTCACTTCAAGACAAGATTCAATGAATCTTTTGGATAAACAACTTT
TCAATTCGTTGGTCAATAAATTGCACAACATTATTCTGAAACACATAGGATATGGATTCTTGTTGAGTTCTGTCGCCTTGACAATGTCTTCTAAGATACCTCGGGAGGTC
GTAAGTGAAGCAAGTAAATTAAGAATTAGGGAGAAGGTTCAATACCTAGTTGAGTTCCCCTACCTTAGGCCTGGTGAGCCATTAGTGATTGAGGAGATAATTGTGCCGCC
GCCGATGGCTCGGGAAGAAATGGTTGATTGGAGCTATAGCGCAATATTGGAGTTAATCGGATGCTCGGGACGTGAAAAGATGCAAAGAAATGAAAAAGAATCAAAGAGAA
CAAAGTCAACATTCGGTCAACACTTGACCAGCGTCGAGACGCCATCCCTTGAGCGTCAAGACGCTGGCATTCCATATCAGAATAGGCGCAAAAAGGAGTGGGTTTTTCAT
ATGCGGAATGATGCATCACGGTCAGGATGCTTAGGTGGAAAGCGAGACGGGACCGGGAAAACAATTTCTTGTTGGAAATGCTTTTCCCATGCCCGATCTCGATCGGGCGA
TCCGACTGGGAATCTCGATCTTCTCGCTCAGGCGACCCAAACGAGTGAATTCGATATCATTATCGAGATTAGGCATAGTACTAGACACGTTAAGAGCAATATCAGTAGTA
TTAGGCTTAGAAGAATCTACCTGAACGACAGAACGACGATGAAAGGATTTGAGTTAGATAGATTATTCCCTAACCTCCGATTTGAGAATGATGACGACGCAGTTAAGATG
ACCCTGCTTTATTTCATCGAGCTTGCGATGATGGGGAGAGAGAGGAAACAACAAATGGATATGAGCCTGCTAGGTATAATTGACGAGTGGGATAGATTTTGCAATGAAAA
TTGGAGCAAAATCATATTTGATAAGACCATTAAATCATTGAAGAAAGCTTTGTCTGGAAAAGTGGAATCTTATAAGGAGAGATTAGATGGTAAACAAGAGACGTATAGTC
TTTACGGCTTCCCATACACGTTTCGGGTATGGATGTATGAGACTATTTCGTCGTTAACTGGACGTGTTGCTAACCGTGTCAGCGAAAATGTCATCCCGTGGATTCTTAGA
TGGTCATGTTCCCACTCGCCCACTCTGGCAGTGCTTAGTAAAGAGGTTTTTGCATCAAACGCAGCGAGGGTCACATTGGAACTTGTGGCCACAGAGGAGGAGGTTCAATT
TATGGACCGTGTGATGGAGCCGCCTCAAGCCCCACCTGCCACCCCACCTCATCATCCACCTCCAGCTCCACTCCCAGCACTTATCGATATGCATGTTGATGATACAGATG
CTGAGGATACCCATGATAGGACGGAGGATGTTGAGACTAGTTATGAGGTTTCTGACAGAGTTTGTAAGAAGTGTAAACTTCTTGATAGCCGTGTGGAGGGCATTAAAAAT
GGCATCAAGGAGTTAAATGGAAGGATGGAGGGAATCGAAGGAGACCTGAAGGTGATCAAGTCAATAGAGAAAGATATCAAGGCAATAAAGAAGTTCATGCGTCGATTGTC
TAAGTGCGAGATGATGGTCCAGACGATCAGGATGGGGCAGGATCTAGATCTGACGCGAAAGGAGCAGACGTTGAATTTGGCAACTGGTTCAGCCGTTCAGACCCAACAAA
AAGGTCCAGTAATGGAAGGGACTGATGGTGGTGATCCACTTCAAGGGGTGGAAAAGGGAATCGTTGATCTAGTCGAGGGGGATGATTGCACTGAGAGTGGTGAAGTTGTA
GTGGGCAAGGAACTGGAAGTCACCGAGGCTCATAATGCGCTGGTTAATGGTCGTAATGTGCTGGGTGTCCAATCTACTTCTCAACAAAACGAGCCCATAGAACGACGGGA
GACTCGTAAGAGGAAGACTGCATGGAAGTCGAGAACTCCATGGAAAGACACACGGGAAGATGGGAAGAAGTGA
Protein sequenceShow/hide protein sequence
MPSAPLPQLFSPSSTDDSGLRSFDVRIFLFSSEQFEVAATLLSLQPAADGDAHEQASFFTSRQDSMNLLDKQLFNSLVNKLHNIILKHIGYGFLLSSVALTMSSKIPREV
VSEASKLRIREKVQYLVEFPYLRPGEPLVIEEIIVPPPMAREEMVDWSYSAILELIGCSGREKMQRNEKESKRTKSTFGQHLTSVETPSLERQDAGIPYQNRRKKEWVFH
MRNDASRSGCLGGKRDGTGKTISCWKCFSHARSRSGDPTGNLDLLAQATQTSEFDIIIEIRHSTRHVKSNISSIRLRRIYLNDRTTMKGFELDRLFPNLRFENDDDAVKM
TLLYFIELAMMGRERKQQMDMSLLGIIDEWDRFCNENWSKIIFDKTIKSLKKALSGKVESYKERLDGKQETYSLYGFPYTFRVWMYETISSLTGRVANRVSENVIPWILR
WSCSHSPTLAVLSKEVFASNAARVTLELVATEEEVQFMDRVMEPPQAPPATPPHHPPPAPLPALIDMHVDDTDAEDTHDRTEDVETSYEVSDRVCKKCKLLDSRVEGIKN
GIKELNGRMEGIEGDLKVIKSIEKDIKAIKKFMRRLSKCEMMVQTIRMGQDLDLTRKEQTLNLATGSAVQTQQKGPVMEGTDGGDPLQGVEKGIVDLVEGDDCTESGEVV
VGKELEVTEAHNALVNGRNVLGVQSTSQQNEPIERRETRKRKTAWKSRTPWKDTREDGKK