; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020003 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020003
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold5:36999352..37001917
RNA-Seq ExpressionSpg020003
SyntenySpg020003
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]2.6e-3939.3Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQ---ETYSLYGF
        EL+++F    F  DED VK+ I YFIELAMMG ERKQ +DT+LLG++D W+ FCN DWS +IF +TI  LK A+  K   Y+++        ETYSLYGF
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQ---ETYSLYGF

Query:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRA---PSPPPPP-----PPPPL
        PYAFQVW YET S+        L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M  V+ PP     P PP  P     P PP 
Subjt:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRA---PSPPPPP-----PPPPL

Query:  PPPTALEDIPDEDTVIEDLE--TKNTNEVVEGVGTYVTNDRICKRCKVLEDEIKVIKEDVKVIKEDVKVIKSIEKD----LKAIRKFMRRLSKGKFVDAS
         P  A    P  D  +  LE    + + V E   +    + + KR K  + + K I   +K +   V  I+    D    LK I+ ++++L+KGKF D+S
Subjt:  PPPTALEDIPDEDTVIEDLE--TKNTNEVVEGVGTYVTNDRICKRCKVLEDEIKVIKEDVKVIKEDVKVIKSIEKD----LKAIRKFMRRLSKGKFVDAS

Query:  KYIDPDDG-SDDGGAGSRPHSKGQDDDGGPAFGSQAKSNDD
        KY     G  DDG +  RP    + D G  +     +S++D
Subjt:  KYIDPDDG-SDDGGAGSRPHSKGQDDDGGPAFGSQAKSNDD

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]1.5e-3438.48Show/hide
Query:  MMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGFPYAFQVWTYETASSLTGRVANRLNDNAIP
        MMG ERKQ+MDTSLLGI+D W+ FC+ D S +IF +T+  LK A+  K  +YK++        ETYSLYGFPYAFQVW YET S+L+ RVA RLND+AIP
Subjt:  MMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGFPYAFQVWTYETASSLTGRVANRLNDNAIP

Query:  RILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRAPSPPPPP---PPPPL---------PPPTALEDIPDEDTVIEDLETKNTN
        R+LRWSC++S     L REVF +  ++V   L A++ E Q M  VM PP AP  PP P      PL         P  + + D+ + D V +D       
Subjt:  RILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRAPSPPPPP---PPPPL---------PPPTALEDIPDEDTVIEDLETKNTN

Query:  EVVEGVGTYVTNDRICKRCKVLEDEIKVIK----EDVKVIKEDVKVIKS----IEKDLKAIRKFMRRLSKGKFVDASKY-----IDPDDGSDDGGAGSRP
           + +GT    D++  + K  E + K  K     +++ + + V  I++    +  D+K I+KFM+RL+K      +KY     +   DGS  G   S  
Subjt:  EVVEGVGTYVTNDRICKRCKVLEDEIKVIK----EDVKVIKEDVKVIKS----IEKDLKAIRKFMRRLSKGKFVDASKY-----IDPDDGSDDGGAGSRP

Query:  HSKGQDDDGGPAFGSQAKSNDDTPMADHADPMDTTEQHGQVEE
        + +  D D  P  G + K+ D+  M +  DP  T E+   V E
Subjt:  HSKGQDDDGGPAFGSQAKSNDDTPMADHADPMDTTEQHGQVEE

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]5.9e-4456.89Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGF
        EL+++F    FE+DEDAVK+AI YFIELAMMG ERK +MDTSLLGI+D W+ FCN DWS +IF +T+  LK A+  K   YK++        ETYSLY F
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGF

Query:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKE
        PYAFQVW YET S+L+ RVA RLND+AIPR+LRWSC++S     L REVF +  ++V   L A++ E
Subjt:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKE

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]2.4e-2944.22Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQETYSLYGFPYA
        E +  +  + FE D DAVK+++  F+EL + G +R  ++D SLLG++DD +  CN  W+++ F KTI+ LK+A     ++ K R  G ++TYSLYGFP+A
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQETYSLYGFPYA

Query:  FQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVF
        FQVW YET S LT RVA+ +  + +PRIL+W C +SP    + +E+F
Subjt:  FQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVF

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]6.4e-3045.64Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQETYSLYGFPYA
        EL++++ +I FE D DAVK+ + YF+EL ++G ER  + D  LLGI+DDW+  CN DW+ L F KTI  L++       S K +  G +++YSLYGFP+A
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQETYSLYGFPYA

Query:  FQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSS
        FQVW YE  SSL+G +   ++ + +PRIL+W   HS     L+RE+F S
Subjt:  FQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSS

TrEMBL top hitse value%identityAlignment
A0A1S3ATU8 uncharacterized protein LOC103482899 isoform X16.4e-2840.45Show/hide
Query:  MEGFELDRLFPNINFESDEDAVKMAIFYFIELAMMGSER-KQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQE---TY
        M     +  + N+NF  D DAVK+ + Y+ ELAMMG +R K  ++ SLL  ++D + + + DW  +++ +T++GL+ A+  K   YK++    +     Y
Subjt:  MEGFELDRLFPNINFESDEDAVKMAIFYFIELAMMGSER-KQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQE---TY

Query:  SLYGFPYAFQVWTYETASSL-TGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMD
        SL GFP+AFQVW YE  SS+  GR   RLN++A+PR LRWSCS+S     L R++F+S    ++  LV S+ E QF D
Subjt:  SLYGFPYAFQVWTYETASSL-TGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMD

A0A1S3AUB0 uncharacterized protein LOC103482899 isoform X26.4e-2840.45Show/hide
Query:  MEGFELDRLFPNINFESDEDAVKMAIFYFIELAMMGSER-KQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQE---TY
        M     +  + N+NF  D DAVK+ + Y+ ELAMMG +R K  ++ SLL  ++D + + + DW  +++ +T++GL+ A+  K   YK++    +     Y
Subjt:  MEGFELDRLFPNINFESDEDAVKMAIFYFIELAMMGSER-KQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQE---TY

Query:  SLYGFPYAFQVWTYETASSL-TGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMD
        SL GFP+AFQVW YE  SS+  GR   RLN++A+PR LRWSCS+S     L R++F+S    ++  LV S+ E QF D
Subjt:  SLYGFPYAFQVWTYETASSL-TGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMD

A0A6J1DJX9 uncharacterized protein LOC1110207571.2e-3939.3Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQ---ETYSLYGF
        EL+++F    F  DED VK+ I YFIELAMMG ERKQ +DT+LLG++D W+ FCN DWS +IF +TI  LK A+  K   Y+++        ETYSLYGF
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQ---ETYSLYGF

Query:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRA---PSPPPPP-----PPPPL
        PYAFQVW YET S+        L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M  V+ PP     P PP  P     P PP 
Subjt:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRA---PSPPPPP-----PPPPL

Query:  PPPTALEDIPDEDTVIEDLE--TKNTNEVVEGVGTYVTNDRICKRCKVLEDEIKVIKEDVKVIKEDVKVIKSIEKD----LKAIRKFMRRLSKGKFVDAS
         P  A    P  D  +  LE    + + V E   +    + + KR K  + + K I   +K +   V  I+    D    LK I+ ++++L+KGKF D+S
Subjt:  PPPTALEDIPDEDTVIEDLE--TKNTNEVVEGVGTYVTNDRICKRCKVLEDEIKVIKEDVKVIKEDVKVIKSIEKD----LKAIRKFMRRLSKGKFVDAS

Query:  KYIDPDDG-SDDGGAGSRPHSKGQDDDGGPAFGSQAKSNDD
        KY     G  DDG +  RP    + D G  +     +S++D
Subjt:  KYIDPDDG-SDDGGAGSRPHSKGQDDDGGPAFGSQAKSNDD

A0A6J1DL40 uncharacterized protein LOC1110221107.1e-3538.48Show/hide
Query:  MMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGFPYAFQVWTYETASSLTGRVANRLNDNAIP
        MMG ERKQ+MDTSLLGI+D W+ FC+ D S +IF +T+  LK A+  K  +YK++        ETYSLYGFPYAFQVW YET S+L+ RVA RLND+AIP
Subjt:  MMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGFPYAFQVWTYETASSLTGRVANRLNDNAIP

Query:  RILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRAPSPPPPP---PPPPL---------PPPTALEDIPDEDTVIEDLETKNTN
        R+LRWSC++S     L REVF +  ++V   L A++ E Q M  VM PP AP  PP P      PL         P  + + D+ + D V +D       
Subjt:  RILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRAPSPPPPP---PPPPL---------PPPTALEDIPDEDTVIEDLETKNTN

Query:  EVVEGVGTYVTNDRICKRCKVLEDEIKVIK----EDVKVIKEDVKVIKS----IEKDLKAIRKFMRRLSKGKFVDASKY-----IDPDDGSDDGGAGSRP
           + +GT    D++  + K  E + K  K     +++ + + V  I++    +  D+K I+KFM+RL+K      +KY     +   DGS  G   S  
Subjt:  EVVEGVGTYVTNDRICKRCKVLEDEIKVIK----EDVKVIKEDVKVIKS----IEKDLKAIRKFMRRLSKGKFVDASKY-----IDPDDGSDDGGAGSRP

Query:  HSKGQDDDGGPAFGSQAKSNDDTPMADHADPMDTTEQHGQVEE
        + +  D D  P  G + K+ D+  M +  DP  T E+   V E
Subjt:  HSKGQDDDGGPAFGSQAKSNDDTPMADHADPMDTTEQHGQVEE

A0A6J1DRZ7 uncharacterized protein LOC1110238472.9e-4456.89Show/hide
Query:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGF
        EL+++F    FE+DEDAVK+AI YFIELAMMG ERK +MDTSLLGI+D W+ FCN DWS +IF +T+  LK A+  K   YK++        ETYSLY F
Subjt:  ELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERT---DGKQETYSLYGF

Query:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKE
        PYAFQVW YET S+L+ RVA RLND+AIPR+LRWSC++S     L REVF +  ++V   L A++ E
Subjt:  PYAFQVWTYETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases1.3e-0436.36Show/hide
Query:  DAVYMPYNIGGLYWVLVCIDFEVGEVVVSDSMVVLNKDEVVEKELRVLCQVLPAL
        D VYMP+N    +WV +C+D +  ++ + DS + L +D  +  EL+ L  +LP L
Subjt:  DAVYMPYNIGGLYWVLVCIDFEVGEVVVSDSMVVLNKDEVVEKELRVLCQVLPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGTTTGAACTAGATAGATTATTCCCTAACATTAATTTTGAGAGCGACGAGGATGCTGTGAAGATGGCCATATTTTATTTCATTGAGTTGGCTATGATGGGGAG
TGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCATTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGCTCATTTTTTATAAGACCATCAAGGGAC
TTAAGAAGGCAGTAGGTGGGAAGGAAGTGTCCTATAAAGAGAGGACGGATGGCAAACAGGAAACGTACAGTCTGTATGGCTTCCCATACGCGTTTCAGGTATGGACATAC
GAGACTGCATCTTCGTTGACCGGGCGTGTAGCTAATCGCCTGAATGACAATGCCATTCCACGTATATTAAGATGGTCATGTAGCCACTCACCTACACTTGCAGCGCTGAG
TCGGGAGGTGTTTTCTTCAGATATGGCTAGGGTCACAACCGAACTTGTGGCCTCAGAAAAGGAGATCCAATTTATGGATCATGTGATGCAGCCACCTCGAGCACCATCTC
CACCTCCACCTCCACCTCCACCTCCACTTCCACCCCCAACAGCTTTGGAAGATATTCCAGATGAAGATACTGTCATTGAGGATCTCGAGACTAAGAATACAAATGAAGTG
GTGGAGGGTGTTGGGACGTATGTTACGAATGACAGAATCTGCAAGAGGTGCAAAGTCCTCGAAGATGAGATCAAAGTGATTAAAGAAGATGTGAAGGTGATTAAGGAGGA
TGTGAAGGTCATTAAGTCCATCGAAAAAGACCTCAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGAAAATTTGTTGACGCCAGCAAGTACATAGATCCGGATG
ACGGTTCGGACGATGGTGGTGCTGGATCCAGACCACATTCGAAAGGTCAGGATGATGATGGTGGTCCTGCATTCGGGTCACAAGCAAAATCAAATGACGACACCCCAATG
GCTGACCATGCGGATCCGATGGATACAACAGAACAACATGGTCAAGTCGAGGAAGTAAATGACTCGATAGAGGGTGTGGGGAAAGACATGCAGATGGATACAACAGAACA
AGAAGTCACTGAAATAGGAGAACATGTAGATGACCCGATAGAGGGTGTGGGAAAGGATATGTCTGTTGTCGAAAGTCAAAATTCGCTGGGTGTCCAGTCCATTTCTGAAC
AGAACGAGCCGATAGAAAGACAGAGGACTCGTGGTCGACTGGATCCTCATGTTCATCCAAAAGAAATGGGGAGAACGACCAGAGTTATGCCGCAGGAAGTTCACCACTGG
GGACCTGTGTGTAACCGCAACGTGTTGAAGTACGAAATGGGCAAGCTTGCAGACCATAACATACCATGGAACACGGTTGATGCGGTGTACATGCCGTATAACATCGGTGG
GCTGTATTGGGTCCTCGTATGCATTGACTTCGAAGTGGGGGAGGTCGTTGTATCGGATTCCATGGTGGTGTTGAATAAGGACGAGGTGGTTGAGAAGGAGTTAAGGGTCC
TTTGCCAAGTCCTGCCAGCTCTGCTTTGGAAGATCGGGGTCATGGATGTGAGGAAGGATCTATCCGTTCAAAGATGGCCTGTGCGTCAGCAATTGTCAAGGTCGCAGCAG
AAACGTAGTGGCGAC
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGTTTGAACTAGATAGATTATTCCCTAACATTAATTTTGAGAGCGACGAGGATGCTGTGAAGATGGCCATATTTTATTTCATTGAGTTGGCTATGATGGGGAG
TGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCATTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGCTCATTTTTTATAAGACCATCAAGGGAC
TTAAGAAGGCAGTAGGTGGGAAGGAAGTGTCCTATAAAGAGAGGACGGATGGCAAACAGGAAACGTACAGTCTGTATGGCTTCCCATACGCGTTTCAGGTATGGACATAC
GAGACTGCATCTTCGTTGACCGGGCGTGTAGCTAATCGCCTGAATGACAATGCCATTCCACGTATATTAAGATGGTCATGTAGCCACTCACCTACACTTGCAGCGCTGAG
TCGGGAGGTGTTTTCTTCAGATATGGCTAGGGTCACAACCGAACTTGTGGCCTCAGAAAAGGAGATCCAATTTATGGATCATGTGATGCAGCCACCTCGAGCACCATCTC
CACCTCCACCTCCACCTCCACCTCCACTTCCACCCCCAACAGCTTTGGAAGATATTCCAGATGAAGATACTGTCATTGAGGATCTCGAGACTAAGAATACAAATGAAGTG
GTGGAGGGTGTTGGGACGTATGTTACGAATGACAGAATCTGCAAGAGGTGCAAAGTCCTCGAAGATGAGATCAAAGTGATTAAAGAAGATGTGAAGGTGATTAAGGAGGA
TGTGAAGGTCATTAAGTCCATCGAAAAAGACCTCAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGAAAATTTGTTGACGCCAGCAAGTACATAGATCCGGATG
ACGGTTCGGACGATGGTGGTGCTGGATCCAGACCACATTCGAAAGGTCAGGATGATGATGGTGGTCCTGCATTCGGGTCACAAGCAAAATCAAATGACGACACCCCAATG
GCTGACCATGCGGATCCGATGGATACAACAGAACAACATGGTCAAGTCGAGGAAGTAAATGACTCGATAGAGGGTGTGGGGAAAGACATGCAGATGGATACAACAGAACA
AGAAGTCACTGAAATAGGAGAACATGTAGATGACCCGATAGAGGGTGTGGGAAAGGATATGTCTGTTGTCGAAAGTCAAAATTCGCTGGGTGTCCAGTCCATTTCTGAAC
AGAACGAGCCGATAGAAAGACAGAGGACTCGTGGTCGACTGGATCCTCATGTTCATCCAAAAGAAATGGGGAGAACGACCAGAGTTATGCCGCAGGAAGTTCACCACTGG
GGACCTGTGTGTAACCGCAACGTGTTGAAGTACGAAATGGGCAAGCTTGCAGACCATAACATACCATGGAACACGGTTGATGCGGTGTACATGCCGTATAACATCGGTGG
GCTGTATTGGGTCCTCGTATGCATTGACTTCGAAGTGGGGGAGGTCGTTGTATCGGATTCCATGGTGGTGTTGAATAAGGACGAGGTGGTTGAGAAGGAGTTAAGGGTCC
TTTGCCAAGTCCTGCCAGCTCTGCTTTGGAAGATCGGGGTCATGGATGTGAGGAAGGATCTATCCGTTCAAAGATGGCCTGTGCGTCAGCAATTGTCAAGGTCGCAGCAG
AAACGTAGTGGCGAC
Protein sequenceShow/hide protein sequence
MEGFELDRLFPNINFESDEDAVKMAIFYFIELAMMGSERKQQMDTSLLGIIDDWQRFCNEDWSKLIFYKTIKGLKKAVGGKEVSYKERTDGKQETYSLYGFPYAFQVWTY
ETASSLTGRVANRLNDNAIPRILRWSCSHSPTLAALSREVFSSDMARVTTELVASEKEIQFMDHVMQPPRAPSPPPPPPPPPLPPPTALEDIPDEDTVIEDLETKNTNEV
VEGVGTYVTNDRICKRCKVLEDEIKVIKEDVKVIKEDVKVIKSIEKDLKAIRKFMRRLSKGKFVDASKYIDPDDGSDDGGAGSRPHSKGQDDDGGPAFGSQAKSNDDTPM
ADHADPMDTTEQHGQVEEVNDSIEGVGKDMQMDTTEQEVTEIGEHVDDPIEGVGKDMSVVESQNSLGVQSISEQNEPIERQRTRGRLDPHVHPKEMGRTTRVMPQEVHHW
GPVCNRNVLKYEMGKLADHNIPWNTVDAVYMPYNIGGLYWVLVCIDFEVGEVVVSDSMVVLNKDEVVEKELRVLCQVLPALLWKIGVMDVRKDLSVQRWPVRQQLSRSQQ
KRSGD