; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017698 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017698
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr5:7087902..7099706
RNA-Seq ExpressionLag0017698
SyntenyLag0017698
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]2.4e-9458.31Show/hide
Query:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED
        V D ALPIP+  +++TL QA+GNFV WPR+ VI   +K+ P  T ++   QSSK+T+ HVTIKLLNRYA+ +M+ ED + I++ E I GKE +++L R+D
Subjt:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED

Query:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS
        I QYCG  EIGYSCILTYI  LW V + EIT +F LVDQ TISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRS
Subjt:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS

Query:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        K+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID VR+EWA FV  FV
Subjt:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.5e-9659.17Show/hide
Query:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC
        +PIPV  E++TL Q +G FV WPRR VI  ++K    ++      Q SKHT+ HV+IKLLNRY + SM+ EDT+ I + + I GKE +++L R DI QYC
Subjt:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC

Query:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF
          +EIGYSCILTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++
Subjt:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF

Query:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
           INT L++WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.5e-9659.17Show/hide
Query:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC
        +PIPV  E++TL Q +G FV WPRR VI  ++K    ++      Q SKHT+ HV+IKLLNRY + SM+ EDT+ I + + I GKE +++L R DI QYC
Subjt:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC

Query:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF
          +EIGYSCILTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++
Subjt:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF

Query:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
           INT L++WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]3.1e-10263.14Show/hide
Query:  DCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEP-PTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQ
        D ALPIP  D+++TL QA+GNFV WPR+ VIT  +K+ P PT +K + QSSK+T+ HVTIKLLNRYA+ SM+ +D + I + E+ILGKE +++L R+DI 
Subjt:  DCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEP-PTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQ

Query:  QYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRSKV
        QYCG  EIGYSCIL YI  LW   D EIT KF +VDQ TISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG  HW+LI I  +EN VY+++SLRSK+
Subjt:  QYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRSKV

Query:  EEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
         EEF G INT L+ WQAKHSL QYR+ I WK +KCPRQ G+ ECGY+VQKYIREI+ NS T I+ LFNT+ A+ Q EID VR+EWA FV  FV
Subjt:  EEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]2.1e-10363.36Show/hide
Query:  DCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEP-PTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQ
        D ALPIP  D+++TL QA+GNFV WPR+ VIT  +K+ P PT +K + QSSK+T+ HVTIKLLNRYA+ SM+ +D + I + E+ILGKE +++L R+DI 
Subjt:  DCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEP-PTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQ

Query:  QYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVE
        QYCG  EIGYSCIL YI  LW   D EIT KF +VDQ TISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG HW+LI I  +EN VY+++SLRSK+ 
Subjt:  QYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVE

Query:  EEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        EEF G INT L+ WQAKHSL QYR+ I WK +KCPRQ G+ ECGY+VQKYIREI+ NS T I+ LFNT+ A+ Q EID VR+EWA FV  FV
Subjt:  EEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.1e-9458.31Show/hide
Query:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED
        V D ALPIP+  +++TL QA+GNFV WPR+ VI   +K+ P  T ++   QSSK+T+ HVTIKLLNRYA+ +M+ ED + I++ E I GKE +++L R+D
Subjt:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED

Query:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS
        I QYCG  EIGYSCILTYI  LW V + EIT +F LVDQ TISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRS
Subjt:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS

Query:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        K+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID VR+EWA FV  FV
Subjt:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A5D3C2U9 Transposase1.2e-8656.25Show/hide
Query:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYCG
        +P PV  +++TL QA+GN + WPRR V T++DK+E  T+ K VV  S +T+ +  IKLLNR+A+ +M   D + I M E I G +  V+L RED+  YCG
Subjt:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYCG

Query:  NVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEFS
         VEIGY CIL YIT LW   D      FF++DQ+ ISS++K ++LRSRNL+N+L+ V+L+Q VLIP+NTG HWML  I  REN VY+L+SLRSKV E+  
Subjt:  NVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEFS

Query:  GTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        G IN GL+ WQAKH L +YRS   W+ VKCPRQ  S  CGY+VQKYI EI+HNS+T IT LFNTKNA+ Q+EIDE+R EWA FV  FV
Subjt:  GTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.1e-9458.31Show/hide
Query:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED
        V D ALPIP+  +++TL QA+GNFV WPR+ VI   +K+ P  T ++   QSSK+T+ HVTIKLLNRYA+ +M+ ED + I++ E I GKE +++L R+D
Subjt:  VTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPP-TKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHRED

Query:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS
        I QYCG  EIGYSCILTYI  LW V + EIT +F LVDQ TISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRS
Subjt:  IQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGR-HWMLIAIQPRENTVYILNSLRS

Query:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        K+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID VR+EWA FV  FV
Subjt:  KVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X17.2e-9759.17Show/hide
Query:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC
        +PIPV  E++TL Q +G FV WPRR VI  ++K    ++      Q SKHT+ HV+IKLLNRY + SM+ EDT+ I + + I GKE +++L R DI QYC
Subjt:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC

Query:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF
          +EIGYSCILTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++
Subjt:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF

Query:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
           INT L++WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X27.2e-9759.17Show/hide
Query:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC
        +PIPV  E++TL Q +G FV WPRR VI  ++K    ++      Q SKHT+ HV+IKLLNRY + SM+ EDT+ I + + I GKE +++L R DI QYC
Subjt:  LPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPV-VQSSKHTEAHVTIKLLNRYAVFSMRQEDTLMITMPERILGKEASVFLHREDIQQYC

Query:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF
          +EIGYSCILTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++
Subjt:  GNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQPRENTVYILNSLRSKVEEEF

Query:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
           INT L++WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  SGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAGCATTTCAGATCTGTGGAGAGTTTCAGATCTCAAGTGGGGTTTCTCCAAACTTACAGACCGTCGGCCAGCACTTCTTCGTGGGCTAGATCTCTGTTTTTGGGT
AACAGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGCCTCGCAGATTTGTTATTACTGTAGATGACA
AAGAGGAGCCTCCTACCAAAGCTAAGCCCGTAGTACAATCGAGCAAACACACAGAGGCCCATGTTACTATTAAGCTCCTAAATAGATACGCAGTGTTTTCGATGAGACAA
GAAGATACACTAATGATCACAATGCCCGAGCGTATCTTGGGAAAGGAAGCATCGGTATTTTTACATCGCGAAGACATCCAACAATATTGTGGGAATGTGGAGATAGGTTA
CTCATGCATACTCACGTACATTACGTACCTTTGGACTGTACTTGATCCCGAGATAACAAATAAGTTTTTTCTGGTTGATCAAACAACAATCTCATCGTACGTGAAGTCTC
AAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGTCGTCATTGGATGTTGATCGCGATCCAG
CCTCGAGAAAACACTGTGTATATATTGAATTCTCTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCT
TCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTGTCCTCGTCAAACAGGTTCGACAGAGTGTGGGTATTTTGTGCAAAAATATATAAGAGAAATAATGCACA
ACTCTACTACCCCTATAACTAAACTTTTTAACACAAAGAATGCATTTACACAAGACGAGATTGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTATTA
AATTCACCAACTGCTTGTGAAATAATCACGAACTACTGCGAACGCCACCACTATGGCTACACCGGAGTTCCTCTCGGGCCAGGAGAGGACGGCACGCCTTTGTTCAAGCC
CCGGAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAATGAGTGAATTTCATCTTGTACTATTATATTCCCAACCCCCATTCAGTCTTGCCCCTGAAA
TGGATACCCCCACACGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCAAATACAAGGTGGGTCGTATCACATAGTGTCACCAGGATAAGAGTATCTGCC
TTACAGCATCAAGTACGCTTACCATATCTTATTGCATTGCAAGTCCCAATAGACAATTCCAATTGGTTTCACGTGGTCATTCTAATTTGGTCACGTCTTCCCTTCCAAAC
AAATTTACCATTCCTGTCACGTGAAGGTCAGACGAGTTTCTCCTACTCGGATATTGACATCAACAGGCTTCGTTTTGTTTGGAGAGGACAATCATATTACAATCTCCTAG
CAATTTACTCATTTGTCTACAGAATTCAGGGACTAAGCAGGACAGCGCAACACCAACACGAAAGCTTGACCAGGAAGCCGACCCTGGAGGAGAGCGAGCCGAAGGGATTG
GGTCGTCTCGACCCAATCCCAAGGTCGAGGCTGAGCATATGGTCAGCCTCGGACCAAGGTCGGGGCCGACCACTCGACCCACTCGTGTGGGTCGAGTTCCCTCCCCTTCG
TTCGGTCCCTGGTGCCTCTGACTGTCTCGGTTCCACCTGGTTCAGCCCGAATCGCCTCCGAATGCCTAAAACCCTAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCAGCATTTCAGATCTGTGGAGAGTTTCAGATCTCAAGTGGGGTTTCTCCAAACTTACAGACCGTCGGCCAGCACTTCTTCGTGGGCTAGATCTCTGTTTTTGGGT
AACAGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGCCTCGCAGATTTGTTATTACTGTAGATGACA
AAGAGGAGCCTCCTACCAAAGCTAAGCCCGTAGTACAATCGAGCAAACACACAGAGGCCCATGTTACTATTAAGCTCCTAAATAGATACGCAGTGTTTTCGATGAGACAA
GAAGATACACTAATGATCACAATGCCCGAGCGTATCTTGGGAAAGGAAGCATCGGTATTTTTACATCGCGAAGACATCCAACAATATTGTGGGAATGTGGAGATAGGTTA
CTCATGCATACTCACGTACATTACGTACCTTTGGACTGTACTTGATCCCGAGATAACAAATAAGTTTTTTCTGGTTGATCAAACAACAATCTCATCGTACGTGAAGTCTC
AAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGTCGTCATTGGATGTTGATCGCGATCCAG
CCTCGAGAAAACACTGTGTATATATTGAATTCTCTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCT
TCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTGTCCTCGTCAAACAGGTTCGACAGAGTGTGGGTATTTTGTGCAAAAATATATAAGAGAAATAATGCACA
ACTCTACTACCCCTATAACTAAACTTTTTAACACAAAGAATGCATTTACACAAGACGAGATTGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTATTA
AATTCACCAACTGCTTGTGAAATAATCACGAACTACTGCGAACGCCACCACTATGGCTACACCGGAGTTCCTCTCGGGCCAGGAGAGGACGGCACGCCTTTGTTCAAGCC
CCGGAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAATGAGTGAATTTCATCTTGTACTATTATATTCCCAACCCCCATTCAGTCTTGCCCCTGAAA
TGGATACCCCCACACGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCAAATACAAGGTGGGTCGTATCACATAGTGTCACCAGGATAAGAGTATCTGCC
TTACAGCATCAAGTACGCTTACCATATCTTATTGCATTGCAAGTCCCAATAGACAATTCCAATTGGTTTCACGTGGTCATTCTAATTTGGTCACGTCTTCCCTTCCAAAC
AAATTTACCATTCCTGTCACGTGAAGGTCAGACGAGTTTCTCCTACTCGGATATTGACATCAACAGGCTTCGTTTTGTTTGGAGAGGACAATCATATTACAATCTCCTAG
CAATTTACTCATTTGTCTACAGAATTCAGGGACTAAGCAGGACAGCGCAACACCAACACGAAAGCTTGACCAGGAAGCCGACCCTGGAGGAGAGCGAGCCGAAGGGATTG
GGTCGTCTCGACCCAATCCCAAGGTCGAGGCTGAGCATATGGTCAGCCTCGGACCAAGGTCGGGGCCGACCACTCGACCCACTCGTGTGGGTCGAGTTCCCTCCCCTTCG
TTCGGTCCCTGGTGCCTCTGACTGTCTCGGTTCCACCTGGTTCAGCCCGAATCGCCTCCGAATGCCTAAAACCCTAGAGTAG
Protein sequenceShow/hide protein sequence
MVSISDLWRVSDLKWGFSKLTDRRPALLRGLDLCFWVTDCALPIPVNDELQTLYQAVGNFVGWPRRFVITVDDKEEPPTKAKPVVQSSKHTEAHVTIKLLNRYAVFSMRQ
EDTLMITMPERILGKEASVFLHREDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGRHWMLIAIQ
PRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQTGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFVL
NSPTACEIITNYCERHHYGYTGVPLGPGEDGTPLFKPRNQPLREHTSTYPNRGMSEFHLVLLYSQPPFSLAPEMDTPTRMSPTWMLWIIASVSNTRWVVSHSVTRIRVSA
LQHQVRLPYLIALQVPIDNSNWFHVVILIWSRLPFQTNLPFLSREGQTSFSYSDIDINRLRFVWRGQSYYNLLAIYSFVYRIQGLSRTAQHQHESLTRKPTLEESEPKGL
GRLDPIPRSRLSIWSASDQGRGRPLDPLVWVEFPPLRSVPGASDCLGSTWFSPNRLRMPKTLE