; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001738 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001738
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr4:34854367..34857160
RNA-Seq ExpressionLag0001738
SyntenyLag0001738
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]1.5e-5853.15Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H++KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LL EV EPR DVISF++  ++VSFG+REFDLITG+ H+   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  DS+ +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT  +G +D W+ FCN DWS +IFD+TI  LK  +  K  +
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YKERTNCKQ---ETYSLYGFPY
        Y+++        ETYSLYGFPY
Subjt:  YKERTNCKQ---ETYSLYGFPY

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.3e-7038.02Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H++KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK A+  K   
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YKERTNCKQ---ETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS-----------------------------------------
        Y+++        ETYSLYGFPYAFQVW YET+S+        L+D+AIPR+LRWS   S                                         
Subjt:  YKERTNCKQ---ETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS-----------------------------------------

Query:  ------------------PISTSTSTSTSAPAALEDIPVEDTIV------EDLGTENPNEVVE-GVGTSGTNDRVCKRCKVLEDEMKVIKDDVKEIKEDL
                          P S   +     PA +E  P+ED +V      E   + N  E +E  +  +    R+ +R K L++ +  I+D + +     
Subjt:  ------------------PISTSTSTSTSAPAALEDIPVEDTIV------EDLGTENPNEVVE-GVGTSGTNDRVCKRCKVLEDEMKVIKDDVKEIKEDL

Query:  KVIKSMEKDLKAIRKFMRRLSKGKFVDANKYIEPDDGTDDGGGGSRPHSKGQDDGGGPPSGSQGKANDNTPMADHADPMDTTKQ
                 LK I+ ++++L+KGKF D++KY     G DD G   +   +     GG  S  + + +D     D  + ++T K+
Subjt:  KVIKSMEKDLKAIRKFMRRLSKGKFVDANKYIEPDDGTDDGGGGSRPHSKGQDDGGGPPSGSQGKANDNTPMADHADPMDTTKQ

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]4.1e-5653.47Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FP  LT  +H +KT   +K +LT TQ+ MFRQTCFG  LD  ++FNG LIH+ LLREV EPR D+ISF++ G++VSFG+REFDLITG+ +R   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  DS+ +K  EL+++F    F  DEDAVK+ I YF+ELAMMG+ERKQ +D +LLG +D W+ FCN DWS LIF++T+  LK AV  K  +
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YK
        Y+
Subjt:  YK

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.5e-7756.13Show/hide
Query:  MALVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI
        M +  KI   D+FPAAL+  +H+ KT   +K +LT +QL MF QTCFG  L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+
Subjt:  MALVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI

Query:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL
        RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+AI YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  L
Subjt:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL

Query:  KKAVGGKAVSYKERT---NCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS
        K A+  K   YK++    +   ETYSLY FPYAFQVW YET+S+L+ RVA RLND+AIPR+LRWS T S
Subjt:  KKAVGGKAVSYKERT---NCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]6.6e-5444.4Show/hide
Query:  LVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRH
        ++PKI PA Y  A L C SH+ KT  +IK KLT  QL MFR+T F H LD  L+FNG L                     LG KVSFGRREFD+I+G+++
Subjt:  LVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRH

Query:  RTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKK
            VR      R   LY N+S  +   EL++++ +I FE D DAVK+ + YF+EL ++GRER  + D  LLG +DDW+  CN DW+ L FDKTI  L+ 
Subjt:  RTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKK

Query:  AVGGKAVSYKERTNCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRW
            +  S K +    +++YSLYGFP+AFQVW YE +SSL+G +   ++ + +PRIL+W
Subjt:  AVGGKAVSYKERTNCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRW

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156007.3e-5953.15Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H++KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LL EV EPR DVISF++  ++VSFG+REFDLITG+ H+   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  DS+ +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT  +G +D W+ FCN DWS +IFD+TI  LK  +  K  +
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YKERTNCKQ---ETYSLYGFPY
        Y+++        ETYSLYGFPY
Subjt:  YKERTNCKQ---ETYSLYGFPY

A0A6J1DJX9 uncharacterized protein LOC1110207576.4e-7138.02Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H++KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK A+  K   
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YKERTNCKQ---ETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS-----------------------------------------
        Y+++        ETYSLYGFPYAFQVW YET+S+        L+D+AIPR+LRWS   S                                         
Subjt:  YKERTNCKQ---ETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS-----------------------------------------

Query:  ------------------PISTSTSTSTSAPAALEDIPVEDTIV------EDLGTENPNEVVE-GVGTSGTNDRVCKRCKVLEDEMKVIKDDVKEIKEDL
                          P S   +     PA +E  P+ED +V      E   + N  E +E  +  +    R+ +R K L++ +  I+D + +     
Subjt:  ------------------PISTSTSTSTSAPAALEDIPVEDTIV------EDLGTENPNEVVE-GVGTSGTNDRVCKRCKVLEDEMKVIKDDVKEIKEDL

Query:  KVIKSMEKDLKAIRKFMRRLSKGKFVDANKYIEPDDGTDDGGGGSRPHSKGQDDGGGPPSGSQGKANDNTPMADHADPMDTTKQ
                 LK I+ ++++L+KGKF D++KY     G DD G   +   +     GG  S  + + +D     D  + ++T K+
Subjt:  KVIKSMEKDLKAIRKFMRRLSKGKFVDANKYIEPDDGTDDGGGGSRPHSKGQDDGGGPPSGSQGKANDNTPMADHADPMDTTKQ

A0A6J1DM82 uncharacterized protein LOC1110223002.0e-5653.47Show/hide
Query:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FP  LT  +H +KT   +K +LT TQ+ MFRQTCFG  LD  ++FNG LIH+ LLREV EPR D+ISF++ G++VSFG+REFDLITG+ +R   V  +
Subjt:  DYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS
        +   RLR  Y  DS+ +K  EL+++F    F  DEDAVK+ I YF+ELAMMG+ERKQ +D +LLG +D W+ FCN DWS LIF++T+  LK AV  K  +
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVS

Query:  YK
        Y+
Subjt:  YK

A0A6J1DRZ7 uncharacterized protein LOC1110238477.0e-7856.13Show/hide
Query:  MALVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI
        M +  KI   D+FPAAL+  +H+ KT   +K +LT +QL MF QTCFG  L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+
Subjt:  MALVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI

Query:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL
        RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+AI YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  L
Subjt:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL

Query:  KKAVGGKAVSYKERT---NCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS
        K A+  K   YK++    +   ETYSLY FPYAFQVW YET+S+L+ RVA RLND+AIPR+LRWS T S
Subjt:  KKAVGGKAVSYKERT---NCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSS

A0A6J1E0A9 uncharacterized protein LOC1110252093.2e-5444.4Show/hide
Query:  LVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRH
        ++PKI PA Y  A L C SH+ KT  +IK KLT  QL MFR+T F H LD  L+FNG L                     LG KVSFGRREFD+I+G+++
Subjt:  LVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRH

Query:  RTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKK
            VR      R   LY N+S  +   EL++++ +I FE D DAVK+ + YF+EL ++GRER  + D  LLG +DDW+  CN DW+ L FDKTI  L+ 
Subjt:  RTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKK

Query:  AVGGKAVSYKERTNCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRW
            +  S K +    +++YSLYGFP+AFQVW YE +SSL+G +   ++ + +PRIL+W
Subjt:  AVGGKAVSYKERTNCKQETYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTGGTACCAAAGATTGCACCCGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGAAAAAACCGTTAAAAATATTAAGGATAAATTAACTGATAC
CCAGTTAGGAATGTTTAGGCAAACATGTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTA
GGATTGATGTTATTAGCTTTGAGATTCTGGGGGAGAAAGTTTCATTTGGTCGAAGGGAATTTGACCTTATTACTGGAATTAGGCATAGGACCCAACATGTTAGGGGTAAT
GTATCTAGTACTAGGCTGAGAAGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTACCATTAATTTTGAGAGCGACGAGGATGC
TGTGAAGATGGCCATATTTTATTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTT
GTAATGAGGATTGGAGTAAGTTAATTTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGAATTGCAAACAGGAA
ACGTACAGTCTGTATGGCTTCCCATATGCGTTTCAGGTATGGACATACGAGACAGTATCTTCTTTGACCGGACGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACG
CATATTAAGATGGTCATCCACCTCGAGCCCCATCTCCACCTCCACCTCCACCTCCACCTCCGCCCCAGCAGCTTTGGAAGATATTCCAGTTGAAGATACTATCGTTGAGG
ATCTCGGGACTGAGAATCCAAATGAAGTGGTGGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGATGAAAGTGATT
AAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCTATAAGGAAGTTCATGCGTCGACTTTCAAAGGGTAAATTCGT
CGACGCCAACAAGTATATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTCCATCCGGGTCAC
AAGGAAAAGCAAATGACAACACCCCAATGGCTGACCATGCGGATCCGATGGATACAACAAAACAACTTGGTCGAGTCGAGGAAGTAAATGACCCGATAGAGGGTGTGGGA
AAAGACGTACAGATGGAAGTAACTGAAATAGGAGAACATGAAGTAATTGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCCTGTTGTCGAAAGTCAAAATTCTCTGGG
TGTCCAGTCCATTTCTGAACAGAACGAGCCGATAGAAAGACGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACGGGAAGACCGTA
AGAAACGCAAGGCTCTGAAATACGATCCTCTTCCCCAGATCCCCCACGATCTCGATGCTCCATTCAAAATTTGGCTTGACACTGAGGACCCAGAAGACAATGTTCGAATG
CACTTTAGTGAACAACTTGTTGTGGATTCAAGAGAGAAGATGAGGGGTTGGATGAAGCATCCCGGCCGGGATGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTGGTACCAAAGATTGCACCCGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGAAAAAACCGTTAAAAATATTAAGGATAAATTAACTGATAC
CCAGTTAGGAATGTTTAGGCAAACATGTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTA
GGATTGATGTTATTAGCTTTGAGATTCTGGGGGAGAAAGTTTCATTTGGTCGAAGGGAATTTGACCTTATTACTGGAATTAGGCATAGGACCCAACATGTTAGGGGTAAT
GTATCTAGTACTAGGCTGAGAAGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTACCATTAATTTTGAGAGCGACGAGGATGC
TGTGAAGATGGCCATATTTTATTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTT
GTAATGAGGATTGGAGTAAGTTAATTTTTGATAAGACCATAAAGGGACTCAAGAAGGCTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGAATTGCAAACAGGAA
ACGTACAGTCTGTATGGCTTCCCATATGCGTTTCAGGTATGGACATACGAGACAGTATCTTCTTTGACCGGACGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACG
CATATTAAGATGGTCATCCACCTCGAGCCCCATCTCCACCTCCACCTCCACCTCCACCTCCGCCCCAGCAGCTTTGGAAGATATTCCAGTTGAAGATACTATCGTTGAGG
ATCTCGGGACTGAGAATCCAAATGAAGTGGTGGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAGATGAAAGTGATT
AAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCTATAAGGAAGTTCATGCGTCGACTTTCAAAGGGTAAATTCGT
CGACGCCAACAAGTATATAGAACCAGATGACGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCCTCCATCCGGGTCAC
AAGGAAAAGCAAATGACAACACCCCAATGGCTGACCATGCGGATCCGATGGATACAACAAAACAACTTGGTCGAGTCGAGGAAGTAAATGACCCGATAGAGGGTGTGGGA
AAAGACGTACAGATGGAAGTAACTGAAATAGGAGAACATGAAGTAATTGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCCTGTTGTCGAAAGTCAAAATTCTCTGGG
TGTCCAGTCCATTTCTGAACAGAACGAGCCGATAGAAAGACGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACGGGAAGACCGTA
AGAAACGCAAGGCTCTGAAATACGATCCTCTTCCCCAGATCCCCCACGATCTCGATGCTCCATTCAAAATTTGGCTTGACACTGAGGACCCAGAAGACAATGTTCGAATG
CACTTTAGTGAACAACTTGTTGTGGATTCAAGAGAGAAGATGAGGGGTTGGATGAAGCATCCCGGCCGGGATGCATGA
Protein sequenceShow/hide protein sequence
MALVPKIAPADYFPAALTCCSHLEKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
VSSTRLRRLYLNDSISMKGFELDRLFPTINFESDEDAVKMAIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKAVGGKAVSYKERTNCKQE
TYSLYGFPYAFQVWTYETVSSLTGRVANRLNDNAIPRILRWSSTSSPISTSTSTSTSAPAALEDIPVEDTIVEDLGTENPNEVVEGVGTSGTNDRVCKRCKVLEDEMKVI
KDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDANKYIEPDDGTDDGGGGSRPHSKGQDDGGGPPSGSQGKANDNTPMADHADPMDTTKQLGRVEEVNDPIEGVG
KDVQMEVTEIGEHEVIEAPIEGVGKDIPVVESQNSLGVQSISEQNEPIERRGTRKRKTAWKLRSPWKDTREDRKKRKALKYDPLPQIPHDLDAPFKIWLDTEDPEDNVRM
HFSEQLVVDSREKMRGWMKHPGRDA