; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035752 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035752
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr3:29637639..29639632
RNA-Seq ExpressionLag0035752
SyntenyLag0035752
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]3.9e-7145.95Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML
        M+ES  Q  TIHG+PLG EN+RV VD+ + +D AL IP+  +++ L QA+ NFV WPR+LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM 
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        +MQ ED   I++ E I GKE +++L  +DI+QYCG  EIGYSCILTYI                        LW V + EIT +F +VDQATISS++KSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------
        E RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRSK+  EF G I                                       
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------

Query:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                    NT + N FNT +A+ Q+EID VR+EWA FV  FV
Subjt:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]2.5e-7346.67Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML
        ++++  Q  T+HGVPLGV+NVRV+VD+VI +   + IPV  E++ L Q +  FV WPRRLVI   +K    ++ ++   Q SKHTDVHV+I+LLNRY ML
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ EDT  IN+ + I GKE +++L   DIMQYC  +EIGYSCILTYI                       YLW V + EIT KF +VD ATIS YVKSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------
        E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L                                    
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------

Query:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                        N FNTKNA+ Q+EID+VRIEWA+FVGG V
Subjt:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.5e-7346.67Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML
        ++++  Q  T+HGVPLGV+NVRV+VD+VI +   + IPV  E++ L Q +  FV WPRRLVI   +K    ++ ++   Q SKHTDVHV+I+LLNRY ML
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ EDT  IN+ + I GKE +++L   DIMQYC  +EIGYSCILTYI                       YLW V + EIT KF +VD ATIS YVKSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------
        E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L                                    
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------

Query:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                        N FNTKNA+ Q+EID+VRIEWA+FVGG V
Subjt:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]2.0e-7548.27Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAML
        M+ES +Q  +I+ +PLG +NVR +VD+V+G+D AL IP  D+++ L QA+ NFV WPR+LVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM 
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ +D   IN+ E+ILGKE +++L  +DI+QYCG  EIGYSCIL YI                        LW   D EIT KF +VDQATISS+VK Q
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL-----------------------------------
        ELRS+NL NRL+MV LDQLVLIP+NTG  HW+LI I  +EN VY+++SLRSK+ EEF G INT L                                   
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL-----------------------------------

Query:  ----------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                         N FNT+ A+ Q EID VR+EWA FV  FV
Subjt:  ----------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.4e-7648.41Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAML
        M+ES +Q  +I+ +PLG +NVR +VD+V+G+D AL IP  D+++ L QA+ NFV WPR+LVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM 
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ +D   IN+ E+ILGKE +++L  +DI+QYCG  EIGYSCIL YI                        LW   D EIT KF +VDQATISS+VK Q
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------
        ELRS+NL NRL+MV LDQLVLIP+NTG HW+LI I  +EN VY+++SLRSK+ EEF G INT L                                    
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------

Query:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                        N FNT+ A+ Q EID VR+EWA FV  FV
Subjt:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.9e-7145.95Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML
        M+ES  Q  TIHG+PLG EN+RV VD+ + +D AL IP+  +++ L QA+ NFV WPR+LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM 
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        +MQ ED   I++ E I GKE +++L  +DI+QYCG  EIGYSCILTYI                        LW V + EIT +F +VDQATISS++KSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------
        E RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRSK+  EF G I                                       
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------

Query:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                    NT + N FNT +A+ Q+EID VR+EWA FV  FV
Subjt:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.9e-7145.95Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML
        M+ES  Q  TIHG+PLG EN+RV VD+ + +D AL IP+  +++ L QA+ NFV WPR+LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM 
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        +MQ ED   I++ E I GKE +++L  +DI+QYCG  EIGYSCILTYI                        LW V + EIT +F +VDQATISS++KSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------
        E RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRSK+  EF G I                                       
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTI---------------------------------------

Query:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                    NT + N FNT +A+ Q+EID VR+EWA FV  FV
Subjt:  ------------NTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.2e-7346.67Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML
        ++++  Q  T+HGVPLGV+NVRV+VD+VI +   + IPV  E++ L Q +  FV WPRRLVI   +K    ++ ++   Q SKHTDVHV+I+LLNRY ML
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ EDT  IN+ + I GKE +++L   DIMQYC  +EIGYSCILTYI                       YLW V + EIT KF +VD ATIS YVKSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------
        E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L                                    
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------

Query:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                        N FNTKNA+ Q+EID+VRIEWA+FVGG V
Subjt:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X44.8e-6752.65Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML
        ++++  Q  T+HGVPLGV+NVRV+VD+VI +   + IPV  E++ L Q +  FV WPRRLVI   +K    ++ ++   Q SKHTDVHV+I+LLNRY ML
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ EDT  IN+ + I GKE +++L   DIMQYC  +EIGYSCILTYI                       YLW V + EIT KF +VD ATIS YVKSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL
        E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.2e-7346.67Show/hide
Query:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML
        ++++  Q  T+HGVPLGV+NVRV+VD+VI +   + IPV  E++ L Q +  FV WPRRLVI   +K    ++ ++   Q SKHTDVHV+I+LLNRY ML
Subjt:  MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAML

Query:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ
        SMQ EDT  IN+ + I GKE +++L   DIMQYC  +EIGYSCILTYI                       YLW V + EIT KF +VD ATIS YVKSQ
Subjt:  SMQQEDTPTINMPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQ

Query:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------
        E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L                                    
Subjt:  ELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVL------------------------------------

Query:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV
                        N FNTKNA+ Q+EID+VRIEWA+FVGG V
Subjt:  ---------------VNQFNTKNAFTQDEIDKVRIEWANFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGAGTCGCCTTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACTGAT
TCCTGTGAACGATGAACTACAAATGTTGTATCAAGCAGTCGATAATTTTGTGGGATGGCCTCGCAGACTTGTTATTACTGTAGGTGACAAAGAGGAGCCTCCTGCCAAAG
CAAAGCCTATAGTACAATCAAGCAAACATACAGATGTCCATGTTACCATTAGGCTCCTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATACACCAACGATCAAT
ATGCCCGAGCGTATCTTGGGAAAGGAAGCATCAGTATTTTTACATTGCGAAGACATTATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACAT
TACGGGGGGGGTGATGTATTATGCTTTGGAAGTGTTTGAGTTGATGAATACTTGTATCATTTTCTTTGGGTACCTCTGGACTGTACTTGATCCCGAGATAACAAACAAGT
TTTTTGTGGTTGATCAAGCAACAATCTCGTCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTC
ATTCCCTTTAACACTGGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGTATATATTGAATTCTTTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGG
AACTATAAATACTGTCCTCGTCAATCAGTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACAAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTTT
AA
mRNA sequenceShow/hide mRNA sequence
ATGTATGAGTCGCCTTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACTGAT
TCCTGTGAACGATGAACTACAAATGTTGTATCAAGCAGTCGATAATTTTGTGGGATGGCCTCGCAGACTTGTTATTACTGTAGGTGACAAAGAGGAGCCTCCTGCCAAAG
CAAAGCCTATAGTACAATCAAGCAAACATACAGATGTCCATGTTACCATTAGGCTCCTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATACACCAACGATCAAT
ATGCCCGAGCGTATCTTGGGAAAGGAAGCATCAGTATTTTTACATTGCGAAGACATTATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACAT
TACGGGGGGGGTGATGTATTATGCTTTGGAAGTGTTTGAGTTGATGAATACTTGTATCATTTTCTTTGGGTACCTCTGGACTGTACTTGATCCCGAGATAACAAACAAGT
TTTTTGTGGTTGATCAAGCAACAATCTCGTCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTC
ATTCCCTTTAACACTGGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGTATATATTGAATTCTTTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGG
AACTATAAATACTGTCCTCGTCAATCAGTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACAAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTTT
AA
Protein sequenceShow/hide protein sequence
MYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALLIPVNDELQMLYQAVDNFVGWPRRLVITVGDKEEPPAKAKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTPTIN
MPERILGKEASVFLHCEDIMQYCGNVEIGYSCILTYITGGVMYYALEVFELMNTCIIFFGYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVL
IPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTVLVNQFNTKNAFTQDEIDKVRIEWANFVGGFV