; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024364 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024364
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr10:2479320..2481291
RNA-Seq ExpressionLag0024364
SyntenyLag0024364
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]3.3e-6740.97Show/hide
Query:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED
        F   LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  ++   RLR  Y  D + +K  EL+++F    F  DED
Subjt:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED

Query:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVWTYETVSSLT
         VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK A+  K   Y+++        ETYSLYGFPYAFQVW YET+S+  
Subjt:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVWTYETVSSLT

Query:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA---PSPP--------PPPPPPP-----PPPPPPAAL
              L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP     P PP        P PP  P     P PP    +
Subjt:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA---PSPP--------PPPPPPP-----PPPPPPAAL

Query:  GDILVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEP
        G +  ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +              LK I+ ++++L+KGKF D+SKY   
Subjt:  GDILVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEP

Query:  DDGTDDGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG
          G DD G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  DDGTDDGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]2.7e-6954.58Show/hide
Query:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED
        F   L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DED
Subjt:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED

Query:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVWTYETVSSLT
        AVK+AI YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  LK A+  K   YK++        ETYSLY FPYAFQVW YET+S+L+
Subjt:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVWTYETVSSLT

Query:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
         RVA RLND+AIPR+LRWSC +S     L REVF +  ++V   L A++ E
Subjt:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]1.1e-4643.7Show/hide
Query:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESD
        +F  F+D  +MF   L+HYFLLREV + R DV+ F+ILG  V+F + EF L+TG+ R   + ++  VS  RLRR Y  D + ++  E ++ +  I F +D
Subjt:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESD

Query:  EDAVKMAIFYFIELAMMGRER-KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVWTYETVS
        +DAVK+++ Y+ E+ MMG+ + K  +D  L G ++D   F N DW   I+ +T+KGL+ A+  K V+YK +  T+ K Q  YSL GFP AFQVW YE + 
Subjt:  EDAVKMAIFYFIELAMMGRER-KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVWTYETVS

Query:  SLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
        SL     NRL+D A+PRI R+SC+ S T   L R+VF+S    +T  LV SE E
Subjt:  SLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]1.1e-4644.34Show/hide
Query:  LLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMAIFYFIELAMMGRER
        LLRE+   R+DVI+ ++LG +VSFG  EF LITG+++     R + S  RLR+LY +D + +   E +  +  + FE D DAVK+++  F+EL + GR+R
Subjt:  LLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMAIFYFIELAMMGRER

Query:  KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNH
          ++D SLLG +DD +  CN  W+++ F+KTI+ LK     +A++ K R  G ++TYSLYGFP+AFQVW YET+S LT RVA+ +  + +PRIL+W C +
Subjt:  KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNH

Query:  SPTLAALSREVF
        SP    + +E+F
Subjt:  SPTLAALSREVF

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]3.9e-4442.74Show/hide
Query:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDE
        +F H LD  L+FNG L                     LG KVSFGRREFD+I+G+++    VR      R   LY N+S  +   EL++++ SI FE D 
Subjt:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDE

Query:  DAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGR
        DAVK+ + YF+EL ++GRER  + D  LLG +DDW+  CN DW+ L FDKTI  L+     +  S K +  G +++YSLYGFP+AFQVW YE +SSL+G 
Subjt:  DAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGR

Query:  VANRLNDNAIPRILRWSCNHSPTLAALSREVFSS
        +   ++ + +PRIL+W   HS     L+RE+F S
Subjt:  VANRLNDNAIPRILRWSCNHSPTLAALSREVFSS

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207571.6e-6740.97Show/hide
Query:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED
        F   LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  ++   RLR  Y  D + +K  EL+++F    F  DED
Subjt:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED

Query:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVWTYETVSSLT
         VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK A+  K   Y+++        ETYSLYGFPYAFQVW YET+S+  
Subjt:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQ---ETYSLYGFPYAFQVWTYETVSSLT

Query:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA---PSPP--------PPPPPPP-----PPPPPPAAL
              L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP     P PP        P PP  P     P PP    +
Subjt:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA---PSPP--------PPPPPPP-----PPPPPPAAL

Query:  GDILVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEP
        G +  ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +              LK I+ ++++L+KGKF D+SKY   
Subjt:  GDILVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEP

Query:  DDGTDDGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG
          G DD G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  DDGTDDGG-GGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHG

A0A6J1DP34 uncharacterized protein LOC1110218021.4e-4234.92Show/hide
Query:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED
        F H LD  L+FNG LIH  LLREV E   + ISF +   ++SF R +F LI+G+++    VR N    RL  LY ND   +   + ++++ +  FE D D
Subjt:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED

Query:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGK-QETYSLYGFPYAFQVWTYETVSSLTGR
         VK+ I Y + + ++GRER  + D +LLG +DDW+  CN +W+ L F+KTI  L++         K   DGK +++YSLYGFP+ FQVW Y+T+SSL+ R
Subjt:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGK-QETYSLYGFPYAFQVWTYETVSSLTGR

Query:  VANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRAPSPPPPPPPPPPPPPPPAALGDILVEDNIVEDLGTENP
        VAN++  + +P I +W  +HS     L R++F S   R  T L  ++ E  F++R   PP +              P     G    +++   D+     
Subjt:  VANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRAPSPPPPPPPPPPPPPPPAALGDILVEDNIVEDLGTENP

Query:  NEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFM
        +   E   T G N +VC     L+   K +K   K + E +     +E +LK+I+KF+
Subjt:  NEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFM

A0A6J1DRZ7 uncharacterized protein LOC1110238471.3e-6954.58Show/hide
Query:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED
        F   L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DED
Subjt:  FWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDED

Query:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVWTYETVSSLT
        AVK+AI YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  LK A+  K   YK++        ETYSLY FPYAFQVW YET+S+L+
Subjt:  AVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERT---DGKQETYSLYGFPYAFQVWTYETVSSLT

Query:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
         RVA RLND+AIPR+LRWSC +S     L REVF +  ++V   L A++ E
Subjt:  GRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

A0A6J1DSS5 uncharacterized protein LOC1110239695.4e-4743.7Show/hide
Query:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESD
        +F  F+D  +MF   L+HYFLLREV + R DV+ F+ILG  V+F + EF L+TG+ R   + ++  VS  RLRR Y  D + ++  E ++ +  I F +D
Subjt:  MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI-RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESD

Query:  EDAVKMAIFYFIELAMMGRER-KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVWTYETVS
        +DAVK+++ Y+ E+ MMG+ + K  +D  L G ++D   F N DW   I+ +T+KGL+ A+  K V+YK +  T+ K Q  YSL GFP AFQVW YE + 
Subjt:  EDAVKMAIFYFIELAMMGRER-KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKER--TDGK-QETYSLYGFPYAFQVWTYETVS

Query:  SLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
        SL     NRL+D A+PRI R+SC+ S T   L R+VF+S    +T  LV SE E
Subjt:  SLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

A0A6J1DWS4 uncharacterized protein LOC1110251365.4e-4744.34Show/hide
Query:  LLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMAIFYFIELAMMGRER
        LLRE+   R+DVI+ ++LG +VSFG  EF LITG+++     R + S  RLR+LY +D + +   E +  +  + FE D DAVK+++  F+EL + GR+R
Subjt:  LLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMAIFYFIELAMMGRER

Query:  KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNH
          ++D SLLG +DD +  CN  W+++ F+KTI+ LK     +A++ K R  G ++TYSLYGFP+AFQVW YET+S LT RVA+ +  + +PRIL+W C +
Subjt:  KQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNH

Query:  SPTLAALSREVF
        SP    + +E+F
Subjt:  SPTLAALSREVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGGCATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCACTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTATTAGTTTTGAGAT
TCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGAAGAC
TGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGGCCATATTTTATTTC
ATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGTTAAT
TTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGAAAACAGGAAACATACAGTCTATATGGCTTCCCAT
ACGCGTTTCAGGTATGGACATACGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGTAACCAC
TCACCTACACTTGCAGCGCTGAGTCGGGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTGAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCGTGTGAT
GCAGCCACCTCGAGCCCCATCTCCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCTAGTTGAAGATAATATCGTTGAGG
ATCTCGGGACTGAGAATCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGGTGAAGGTGATT
AAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGTAAATTCGT
CGACGCCAGTAAGTACATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTGATCCATCCGGGT
CACAAGGAAAAGCAGATGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATAGGAGAACATACG
TGTAATGGTATGGTTGATGATTTGGATCCATATACGAAAAGTACTGCACTCCCATATATGGAGTATTTGGTACAGTATATGGAATGGGATATGGACGAAACCCTTGGTGA
CTGGACTGGGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTGGCATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCACTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTATTAGTTTTGAGAT
TCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGAAGAC
TGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGGCCATATTTTATTTC
ATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTGGAGTAAGTTAAT
TTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGAAAACAGGAAACATACAGTCTATATGGCTTCCCAT
ACGCGTTTCAGGTATGGACATACGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGTAACCAC
TCACCTACACTTGCAGCGCTGAGTCGGGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTGAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCGTGTGAT
GCAGCCACCTCGAGCCCCATCTCCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCTAGTTGAAGATAATATCGTTGAGG
ATCTCGGGACTGAGAATCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGGTGAAGGTGATT
AAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAAGGGTAAATTCGT
CGACGCCAGTAAGTACATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTGATCCATCCGGGT
CACAAGGAAAAGCAGATGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATAGGAGAACATACG
TGTAATGGTATGGTTGATGATTTGGATCCATATACGAAAAGTACTGCACTCCCATATATGGAGTATTTGGTACAGTATATGGAATGGGATATGGACGAAACCCTTGGTGA
CTGGACTGGGGGATGA
Protein sequenceShow/hide protein sequence
MFWHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMAIFYF
IELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTDGKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSCNH
SPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRAPSPPPPPPPPPPPPPPPAALGDILVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVI
KDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGPDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHT
CNGMVDDLDPYTKSTALPYMEYLVQYMEWDMDETLGDWTGG