; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011206 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011206
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold5:7993994..7997871
RNA-Seq ExpressionSpg011206
SyntenySpg011206
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]8.3e-3447.9Show/hide
Query:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI
        +SF++  ++VSFG+REFDLITG+ H+   V  ++   RLR  Y  DS+ +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT  +G +
Subjt:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDS
        D W+ FCN DWS +IFD+TI  LK  +  K  +Y+++        ETYSLYGFPY  ++R  RV+ S
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDS

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]7.5e-5137.19Show/hide
Query:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI
        +SF++ G++VSFG+REFDLITG+ HR   V  ++   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +
Subjt:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV
        D W+ FCN DWS +IFD+TI  LK A+  K   Y+++        ETYSLYGFPYAF                            QVW YET+S+     
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV

Query:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEEIQFMDRVMQPPRA---PSPPPPP-----PPPPPPPP----PPPPAALGDILV
           L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP     P PP  P     P PP  P     P PPA +    +
Subjt:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEEIQFMDRVMQPPRA---PSPPPPP-----PPPPPPPP----PPPPAALGDILV

Query:  EDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTD
        ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +              LK I+ ++++L+KGKF D+SKY     G D
Subjt:  EDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTD

Query:  DGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG
        D G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  DGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]2.3e-5548.39Show/hide
Query:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI
        +SF + G +VSFG+REFDLITG+RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+ I YFIELAMMG+ERK +MDTSLLG +
Subjt:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV
        D W+ FCN DWS +IF++T+  LK A+  K   YK++        ETYSLY FPYAF                            QVW YET+S+L+ RV
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV

Query:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE
        A RLND+AIPR+LRWSC +S     L REVF +  ++V  +L A++ E
Subjt:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]2.1e-3736.55Show/hide
Query:  KDNSGERFLAVSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQ
        ++    R   ++ ++LG +VSFG  EF LITG+++     R + S  RLR+LY +D + +   E +  +  + FE D DAVK+ +  F+EL + GR+R  
Subjt:  KDNSGERFLAVSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQ

Query:  QMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETV
        ++D SLLG +DD +  CN  W+++ F+KTI+ LK     +A++ K R  G ++TYSLYGFP+AF                            QVW YET+
Subjt:  QMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETV

Query:  SSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVF
        S LT RVA+ +  + +PRIL+W C +SP    + +E+F
Subjt:  SSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVF

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]3.3e-3840.71Show/hide
Query:  ILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFIDDWQ
        +LG KVSFGRREFD+I+G+++    VR      R   LY N+S  +   EL++++ SI FE D DAVK V+ YF+EL ++GRER  + D  LLG +DDW+
Subjt:  ILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFIDDWQ

Query:  RFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRVANRLNDN
          CN DW+ L FDKTI  L+     +  S K +  G +++YSLYGFP+AF                            QVW YE +SSL+G +   ++ +
Subjt:  RFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRVANRLNDN

Query:  AIPRILRWSCNHSPTLAALSREVFSS
         +PRIL+W   HS     L+RE+F S
Subjt:  AIPRILRWSCNHSPTLAALSREVFSS

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207573.6e-5137.19Show/hide
Query:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI
        +SF++ G++VSFG+REFDLITG+ HR   V  ++   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +
Subjt:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV
        D W+ FCN DWS +IFD+TI  LK A+  K   Y+++        ETYSLYGFPYAF                            QVW YET+S+     
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV

Query:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEEIQFMDRVMQPPRA---PSPPPPP-----PPPPPPPP----PPPPAALGDILV
           L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP     P PP  P     P PP  P     P PPA +    +
Subjt:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEEIQFMDRVMQPPRA---PSPPPPP-----PPPPPPPP----PPPPAALGDILV

Query:  EDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTD
        ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +              LK I+ ++++L+KGKF D+SKY     G D
Subjt:  EDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTD

Query:  DGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG
        D G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  DGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG

A0A6J1DRZ7 uncharacterized protein LOC1110238471.1e-5548.39Show/hide
Query:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI
        +SF + G +VSFG+REFDLITG+RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+ I YFIELAMMG+ERK +MDTSLLG +
Subjt:  VSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV
        D W+ FCN DWS +IF++T+  LK A+  K   YK++        ETYSLY FPYAF                            QVW YET+S+L+ RV
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV

Query:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE
        A RLND+AIPR+LRWSC +S     L REVF +  ++V  +L A++ E
Subjt:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE

A0A6J1DSS5 uncharacterized protein LOC1110239692.2e-3237.9Show/hide
Query:  FEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRER-KQQMDTSLLGFI
        F+ILG  V+F + EF L+TG+ R   + ++  VS  RLRR Y  D + ++  E ++ +  I F +D+DAVK+ + Y+ E+ MMG+ + K  +D  L G +
Subjt:  FEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRER-KQQMDTSLLGFI

Query:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV
        +D   F N DW   I+ +T+KGL+ A+  K V+YK +  T+ K Q  YSL GFP AF                            QVW YE + SL    
Subjt:  DDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRV

Query:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE
         NRL+D A+PRI R+SC+ S T   L R+VF+S    +T  LV SE E
Subjt:  ANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEE

A0A6J1DWS4 uncharacterized protein LOC1110251361.0e-3736.55Show/hide
Query:  KDNSGERFLAVSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQ
        ++    R   ++ ++LG +VSFG  EF LITG+++     R + S  RLR+LY +D + +   E +  +  + FE D DAVK+ +  F+EL + GR+R  
Subjt:  KDNSGERFLAVSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQ

Query:  QMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETV
        ++D SLLG +DD +  CN  W+++ F+KTI+ LK     +A++ K R  G ++TYSLYGFP+AF                            QVW YET+
Subjt:  QMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETV

Query:  SSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVF
        S LT RVA+ +  + +PRIL+W C +SP    + +E+F
Subjt:  SSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVF

A0A6J1E0A9 uncharacterized protein LOC1110252091.6e-3840.71Show/hide
Query:  ILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFIDDWQ
        +LG KVSFGRREFD+I+G+++    VR      R   LY N+S  +   EL++++ SI FE D DAVK V+ YF+EL ++GRER  + D  LLG +DDW+
Subjt:  ILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFIDDWQ

Query:  RFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRVANRLNDN
          CN DW+ L FDKTI  L+     +  S K +  G +++YSLYGFP+AF                            QVW YE +SSL+G +   ++ +
Subjt:  RFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRVANRLNDN

Query:  AIPRILRWSCNHSPTLAALSREVFSS
         +PRIL+W   HS     L+RE+F S
Subjt:  AIPRILRWSCNHSPTLAALSREVFSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCATTGATTGCTTGCTTGCTTGCTTCAAGACCAGTCGATTGTTCAGTTCAACAGAAACCGAACGATCGAAACAATGGAATGTATTCCAGATTGCTGATTTCCCT
CTCGTTCTCCCTCAGATTTTTTCCTTCTCCTTGGTTAAGGCTTCATCGATTTCTTGCTTCTATTCGGTGTAAAGACAACTCAGGAGAACGATTTCTAGCTGTAAGTTTTG
AGATTCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGA
AGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGGTCATATTTTA
TTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGT
TAATTTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGAAAACAGGAAACATACAGTCTATATGGCTTC
CCATACGCGTTTCAGGTACGTTGGTTTAGAGTGATTGATAGTGTATACATTCGCCGTATATATTGTATCTCTAACAATATACTTTCTTTACTTACACAGGTATGGACATA
CGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGTAACCACTCACCTACACTTGCAGCGCTGA
GTCGGGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTAAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCGTGTGATGCAGCCACCTCGAGCCCCATCT
CCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCTAGTTGAAGATAATATCGTTGAGGATCTCGGGACTGAGAA
TCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGGTGAAGGTGATTAAAGACGATGTGAAGG
AGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACTTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGTAAATTCGTCGACGCCAGTAAGTAC
ATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTGATCCATCCGGGTCACAAGGAAAAGCAGA
TGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATAGGAGAACATGAAGTAATTGAAATAGGCG
AACATGTAGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCTTGTTGTCGAAAGTCAACATTCTCTGGGTGTCCAATCCATTTCTGAACAGAACGAGCCGATAGAAAGA
CGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACGGGAAGAACGTAAGAAACGCAAGGCTGTGAAGTACGATCCTCTTCCCCAGAT
CCCCCATGATCTGGATGCTCCATTCAAAAGATGGCTTGACACTGAGGATCCAGAAGATAATGTTCGGACAACTGCGTATGCTGTCCGAGATAAGACGTGGTTTCGTGATC
TTATCACTCCATCGAAATGGATGACCGATGAGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCATTGATTGCTTGCTTGCTTGCTTCAAGACCAGTCGATTGTTCAGTTCAACAGAAACCGAACGATCGAAACAATGGAATGTATTCCAGATTGCTGATTTCCCT
CTCGTTCTCCCTCAGATTTTTTCCTTCTCCTTGGTTAAGGCTTCATCGATTTCTTGCTTCTATTCGGTGTAAAGACAACTCAGGAGAACGATTTCTAGCTGTAAGTTTTG
AGATTCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGA
AGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGGTCATATTTTA
TTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGT
TAATTTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGAAAACAGGAAACATACAGTCTATATGGCTTC
CCATACGCGTTTCAGGTACGTTGGTTTAGAGTGATTGATAGTGTATACATTCGCCGTATATATTGTATCTCTAACAATATACTTTCTTTACTTACACAGGTATGGACATA
CGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGTAACCACTCACCTACACTTGCAGCGCTGA
GTCGGGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTAAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCGTGTGATGCAGCCACCTCGAGCCCCATCT
CCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCTAGTTGAAGATAATATCGTTGAGGATCTCGGGACTGAGAA
TCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGGTGAAGGTGATTAAAGACGATGTGAAGG
AGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACTTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGTAAATTCGTCGACGCCAGTAAGTAC
ATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTGATCCATCCGGGTCACAAGGAAAAGCAGA
TGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATAGGAGAACATGAAGTAATTGAAATAGGCG
AACATGTAGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCTTGTTGTCGAAAGTCAACATTCTCTGGGTGTCCAATCCATTTCTGAACAGAACGAGCCGATAGAAAGA
CGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACGGGAAGAACGTAAGAAACGCAAGGCTGTGAAGTACGATCCTCTTCCCCAGAT
CCCCCATGATCTGGATGCTCCATTCAAAAGATGGCTTGACACTGAGGATCCAGAAGATAATGTTCGGACAACTGCGTATGCTGTCCGAGATAAGACGTGGTTTCGTGATC
TTATCACTCCATCGAAATGGATGACCGATGAGGTATGA
Protein sequenceShow/hide protein sequence
MIPLIACLLASRPVDCSVQQKPNDRNNGMYSRLLISLSFSLRFFPSPWLRLHRFLASIRCKDNSGERFLAVSFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLR
RLYLNDSISMKGFELDRLFPSINFESDEDAVKMVIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGF
PYAFQVRWFRVIDSVYIRRIYCISNNILSLLTQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTKLVASEEEIQFMDRVMQPPRAPS
PPPPPPPPPPPPPPPPPAALGDILVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKY
IEPDDGTDDGGGGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGVGKDILVVESQHSLGVQSISEQNEPIER
RGTRKRKTAWKLRSPWKDTREERKKRKAVKYDPLPQIPHDLDAPFKRWLDTEDPEDNVRTTAYAVRDKTWFRDLITPSKWMTDEV