; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014453 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014453
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:936023..949955
RNA-Seq ExpressionLag0014453
SyntenyLag0014453
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141973.1 probable ubiquitin-like-specific protease 2A isoform X2 [Momordica charantia]3.8e-15166.38Show/hide
Query:  YSFASLPY--FWGLETFC--GRGSTREERKRD----ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG
        Y F  + Y   W L   C  G     +++K D      C L  D       GL     +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCG
Subjt:  YSFASLPY--FWGLETFC--GRGSTREERKRD----ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG

Query:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD
        LFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+DH KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC D
Subjt:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD

Query:  KFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEG
        KF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+ENGE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EG
Subjt:  KFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEG

Query:  RFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEELATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGS
        RFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEELATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG+
Subjt:  RFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEELATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGS

Query:  TMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKRPGV
        +M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKRP V
Subjt:  TMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKRPGV

XP_022141975.1 probable ubiquitin-like-specific protease 2A isoform X4 [Momordica charantia]1.9e-15072.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

XP_022141976.1 probable ubiquitin-like-specific protease 2A isoform X5 [Momordica charantia]1.9e-15072.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

XP_022141978.1 probable ubiquitin-like-specific protease 2A isoform X6 [Momordica charantia]1.9e-15072.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

XP_022141980.1 probable ubiquitin-like-specific protease 2A isoform X8 [Momordica charantia]1.9e-15072.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

TrEMBL top hitse value%identityAlignment
A0A6J1CJL7 probable ubiquitin-like-specific protease 2A isoform X49.1e-15172.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

A0A6J1CK96 probable ubiquitin-like-specific protease 2A isoform X79.1e-15172.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

A0A6J1CLA8 probable ubiquitin-like-specific protease 2A isoform X19.1e-15172.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

A0A6J1CM36 probable ubiquitin-like-specific protease 2A isoform X21.8e-15166.38Show/hide
Query:  YSFASLPY--FWGLETFC--GRGSTREERKRD----ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG
        Y F  + Y   W L   C  G     +++K D      C L  D       GL     +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCG
Subjt:  YSFASLPY--FWGLETFC--GRGSTREERKRD----ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG

Query:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD
        LFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+DH KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC D
Subjt:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD

Query:  KFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEG
        KF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+ENGE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EG
Subjt:  KFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEG

Query:  RFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEELATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGS
        RFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEELATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG+
Subjt:  RFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEELATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGS

Query:  TMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKRPGV
        +M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKRP V
Subjt:  TMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKRPGV

A0A6J1CM38 probable ubiquitin-like-specific protease 2A isoform X69.1e-15172.46Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD
        +YLCEEWKER+GD D  + AKFLAL+FVPLELPQQENSFDCGLFLLHYVE FL+GAP+NF+PFKIS+FSNFL+QDWF P EASLKRAHILQLIY+IMV+D
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSD

Query:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN
        H KEFSGSIGKY SS V  SD+DLS  VYLE  HT T TC DKF+ G KE ENEMS+L     KRF+E G VSKVSSD NYQ+IGGQSRSVMSPIEE+EN
Subjt:  HEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSL-----KRFKEFGSVSKVSSDRNYQRIGGQSRSVMSPIEENEN

Query:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE
        GE++DSPQCLEDR+ ASAIVSECSSASSFGQQFRELEIS EGRFSRN +DK RR +S PSLGES T+S  G+D SPQA KRL+HPTEADE E L+TSSEE
Subjt:  GEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILSTSSEE

Query:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR
        LATCVVEDSE       E+ +VEDSE A+E  +GIE ++SG++M DSKEI++SSS +N+SFL R+  ES+A+ +DNRQHDL+VS+ H+ + SSKQHSAKR
Subjt:  LATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIE-IESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKR

Query:  PGV
        P V
Subjt:  PGV

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.3e-1532.39Show/hide
Query:  PVTVCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSLSKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFE-----WVSGGFKGTS
        P ++   +++L R FLW    E +  HLV W  +     +GGL V   ++ N+AL++K  WR   E  +LW  ++  KY  H  E     W+    KG+ 
Subjt:  PVTVCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSLSKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFE-----WVSGGFKGTS

Query:  RNPWKEISAELPS-LSLFIHHIVGNGEETYFWEDRWVGDRPL
         + W+ I+  L   +S  +  I G+G++  FW DRWV  +PL
Subjt:  RNPWKEISAELPS-LSLFIHHIVGNGEETYFWEDRWVGDRPL

P93295 Uncharacterized mitochondrial protein AtMg003102.8e-0829.77Show/hide
Query:  VCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSL-SKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFEWVSGGFKGTSRNPWKEI
        +CK +   M +F W   +  R    V+W+ + KS    GGL   +L   N+ALLAK  +R   +  TL  R++ S+Y  H    +           W+ I
Subjt:  VCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSL-SKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFEWVSGGFKGTSRNPWKEI

Query:  SAELPSLSLFIHHIVGNGEETYFWEDRWVGD
              LS  +   +G+G  T  W DRW+ D
Subjt:  SAELPSLSLFIHHIVGNGEETYFWEDRWVGD

Q0WKV8 Probable ubiquitin-like-specific protease 2A1.5e-2254.17Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEI
        SYL EEWK R+ +   D  ++   ++ + LELPQQENSFDCGLFLLHY++LF+  AP  FNP  IS+ +NFL+++WF   EASLKR +IL+L+Y +
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEI

Q8L7S0 Probable ubiquitin-like-specific protease 2B1.1e-3135.86Show/hide
Query:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG
        Y F  + Y   W L   C  G        D        C L  D       GL+    +YLCEEWKER+ +  +D+ ++F+ LRFV LELPQQENSFDCG
Subjt:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG

Query:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD
        LFLLHY+ELFL  AP+NF+PFKI   SNFL  +WF PAEASLKR  I +LI+E++  +  +E S    +   S V  + ND  G   L    +  + C  
Subjt:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD

Query:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG
          ++   +   EM+ L+R          SS R+ Q     G   R +      N    +    Q  ED      + ++ S+        G+QF  L  +G
Subjt:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG

Query:  EGRF
        EG F
Subjt:  EGRF

Q8RWN0 Ubiquitin-like-specific protease 1C4.8e-0831.67Show/hide
Query:  VDLWGLELENL------SYLCEEWKERYGDDDEDLPAKFLALRFVP-------LELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDW
        +D  GL   NL       +L EEW     D   DLP      R +P       +++PQQ+N FDCGLFLL ++  F++ AP       +      + + W
Subjt:  VDLWGLELENL------SYLCEEWKERYGDDDEDLPAKFLALRFVP-------LELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDW

Query:  FSPAEASLKRAHILQLIYEI
        F P EAS  R  I  ++ ++
Subjt:  FSPAEASLKRAHILQLIYEI

Arabidopsis top hitse value%identityAlignment
AT1G09730.1 Cysteine proteinases superfamily protein7.6e-3335.86Show/hide
Query:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG
        Y F  + Y   W L   C  G        D        C L  D       GL+    +YLCEEWKER+ +  +D+ ++F+ LRFV LELPQQENSFDCG
Subjt:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG

Query:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD
        LFLLHY+ELFL  AP+NF+PFKI   SNFL  +WF PAEASLKR  I +LI+E++  +  +E S    +   S V  + ND  G   L    +  + C  
Subjt:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD

Query:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG
          ++   +   EM+ L+R          SS R+ Q     G   R +      N    +    Q  ED      + ++ S+        G+QF  L  +G
Subjt:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG

Query:  EGRF
        EG F
Subjt:  EGRF

AT1G09730.2 Cysteine proteinases superfamily protein7.6e-3335.86Show/hide
Query:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG
        Y F  + Y   W L   C  G        D        C L  D       GL+    +YLCEEWKER+ +  +D+ ++F+ LRFV LELPQQENSFDCG
Subjt:  YSFASLPY--FWGLETFCGRGSTREERKRD------ATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCG

Query:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD
        LFLLHY+ELFL  AP+NF+PFKI   SNFL  +WF PAEASLKR  I +LI+E++  +  +E S    +   S V  + ND  G   L    +  + C  
Subjt:  LFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPD

Query:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG
          ++   +   EM+ L+R          SS R+ Q     G   R +      N    +    Q  ED      + ++ S+        G+QF  L  +G
Subjt:  KFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRI---GGQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSF----GQQFRELEISG

Query:  EGRF
        EG F
Subjt:  EGRF

AT1G10570.1 Cysteine proteinases superfamily protein3.4e-0931.67Show/hide
Query:  VDLWGLELENL------SYLCEEWKERYGDDDEDLPAKFLALRFVP-------LELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDW
        +D  GL   NL       +L EEW     D   DLP      R +P       +++PQQ+N FDCGLFLL ++  F++ AP       +      + + W
Subjt:  VDLWGLELENL------SYLCEEWKERYGDDDEDLPAKFLALRFVP-------LELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDW

Query:  FSPAEASLKRAHILQLIYEI
        F P EAS  R  I  ++ ++
Subjt:  FSPAEASLKRAHILQLIYEI

AT4G29090.1 Ribonuclease H-like superfamily protein7.4e-2022.75Show/hide
Query:  PVTVCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSLSKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFEWVSGGFKGTSRNPWK
        P TVCK I  ++ DF W    E +  H  +W+ +    ++GG+   ++   N ALL K +WR  S   +L  ++  S+Y  H  + ++          WK
Subjt:  PVTVCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSLSKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFEWVSGGFKGTSRNPWK

Query:  EISAELPSLSLFIHHIVGNGEETYFWEDRWVGDRPLCVAYPRLYHLSSMKNHFVAEVLNPSGSSISFSFGFSRSLSERETSDVMSLLSLIEEVSFRPGRR
         I A    L      +VGNGE+   W  +W+  +P   A  R+  +   +   V+ +L  S         + + + E    +V     LI E+  RPG R
Subjt:  EISAELPSLSLFIHHIVGNGEETYFWEDRWVGDRPLCVAYPRLYHLSSMKNHFVAEVLNPSGSSISFSFGFSRSLSERETSDVMSLLSLIEEVSFRPGRR

Query:  ---DSRVWNPTPLRVSLAIRSFTV---FWTLLLLLRDRFT----------------------------LW----------------------------MA
           DS  W+ T      +   +TV   +W L  ++  R +                            LW                              
Subjt:  ---DSRVWNPTPLRVSLAIRSFTV---FWTLLLLLRDRFT----------------------------LW----------------------------MA

Query:  EEDLDHILWHCDFARSVWNNLFEIFELQFTRHRDLREMIDEFLPHPPFRDQGNFLWQAG---ICAIIWVLWGERNNRIFRGKERNAEDVWDTIKYSVSLW
        +E ++H+L+ C FAR  W     I  +      +  + I   L        GN  W+     +  ++W LW  RN  +FRG+E NA++V    +  +  W
Subjt:  EEDLDHILWHCDFARSVWNNLFEIFELQFTRHRDLREMIDEFLPHPPFRDQGNFLWQAG---ICAIIWVLWGERNNRIFRGKERNAEDVWDTIKYSVSLW

AT4G33620.1 Cysteine proteinases superfamily protein1.1e-2354.17Show/hide
Query:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEI
        SYL EEWK R+ +   D  ++   ++ + LELPQQENSFDCGLFLLHY++LF+  AP  FNP  IS+ +NFL+++WF   EASLKR +IL+L+Y +
Subjt:  SYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSNFLSQDWFSPAEASLKRAHILQLIYEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCTATTGAAGAGTATAGACGTAGTGGTCGAGAAGGAGTTATCTTCAAGATTGATTTCGAAAAAGCTTATGACCACATCCAGGGGGAGAATTCGTGCTACCCGTGG
CCTTCGTCAGGGTGATCCCCTATCTCCTTTCCTGTTCCTTTTGGTTGTTGACACGCTCAGTAGGTTGGTCTCTAAAGGGGTTGAGGGTATGATTATTGAAGGTTTCGAGG
TGGGGGGAAACAGGATCCTCTTTCTCACCTCCAGTTTGCTGATGACACGATTTTCTTCTGCTCTGGGCTCCCGCTGGGACATAATCCGAGGACCTCTACTTTTTGGGAGC
CAGTGGTGGGCAAGGGCTCCTGTCACCGTGTGCAAATCCATTGAGAAGCTCATGAGAGATTTCTTGTGGGAAAGGGTGGATGAAGGGCGGAGTGCTCACTTGGTCAGTTG
GGAAACTATGGGAAAGTCTTTGAGTAAAGGAGGTCTAGAGGTGGGTAACTTGCGAACTCGCAACAAAGCTCTGTTGGCTAAATGGTTATGGCGCTTCTCCTCTGAGTCCA
CTACCCTTTGGCATAGGATAATTGTAAGCAAATACGGTCGTCATCCTTTTGAGTGGGTGTCGGGTGGGTTCAAAGGCACTTCTAGAAATCCTTGGAAAGAAATTTCAGCT
GAGCTTCCCTCTCTTTCTTTATTCATTCATCATATTGTGGGTAATGGGGAGGAAACCTATTTTTGGGAAGACCGGTGGGTGGGGGATAGACCCCTTTGTGTTGCATACCC
TCGCCTTTATCATTTATCTTCGATGAAGAACCATTTTGTGGCCGAGGTGTTGAACCCTTCGGGAAGCTCCATTTCTTTTTCCTTTGGCTTTTCTCGCTCCCTATCTGAGA
GGGAAACTTCTGATGTTATGTCTCTTCTTTCCTTGATAGAAGAGGTTTCCTTTAGACCGGGTAGGAGGGATAGTCGTGTTTGGAACCCAACCCCTCTAAGGGTTTCTCTT
GCCATTCGTTCTTTCACTGTCTTCTGGACCCTTCTCCTCCTCCTGAGAGATCGGTTTACTCTTTGGATGGCGGAGGAAGACCTCGACCATATTTTATGGCATTGTGACTT
TGCGCGCTCGGTTTGGAACAATTTGTTTGAGATTTTTGAGCTTCAATTCACTAGACACAGGGATCTCAGGGAGATGATCGATGAGTTCCTTCCCCATCCTCCTTTTCGGG
ACCAAGGAAATTTCTTATGGCAAGCTGGGATTTGTGCTATCATTTGGGTTTTGTGGGGGGAGAGAAATAACAGAATTTTTAGAGGGAAGGAGAGGAATGCGGAGGATGTT
TGGGACACTATTAAATATTCTGTCTCTTTATGGGCGTCGGTGACGCGTTTATTTTATCGGCGAGTGGGCTACGTTTTGAGCTGGGCAGTGACAAAAAAATCCTCCGGTGA
CAGACCAAAACGGCGGCGGCGACGGCGGCGGAATCTGGTGGTGGCGGCGGCGGTTTCAAGCAGCAAACGTTTTGGGCAACCCAAAACTCTCAAAAGTGTGTTTTATACAG
TCATACCTTTAAAGGAATTGGAGCCAACGAGCATGCATATGACCAATTTTCTTTTGAGATTGAAGAAACTTCAAAGAAAAGTGATTGATAGTAAAATAAATTCATCGGCA
AAAGTGGAAGAGCTTCGTTGTTGGAAAGTTCGATTAGAGGCTCCTTCTTTGAGGTTGTTGTTGTTGTTTGTCCCACGTAGTTCTCCTTCGTGTCAGTTGGATTATGGCTT
GTTGAACATCAAGGAGTTGCTTCTTTATGGCCTTGGAACTTCTAAGCAACTGGCTGGTAACTCTAAAACTAATAATCTTTACAAAGGAGATTGCCTTGAGGAAAAAACAA
GATTACAATCAAGAAGCTTTTTGGAGGTCGGCGCCCGTGTTCTGGCTCTGGGGGGGCCAAGGGCAAGGAAAAAGAACCGATATAGCTTCGCTTCCTTACCTTACTTCTGG
GGCTTGGAGACTTTTTGTGGGAGGGGGTCGACGAGGGAAGAGCGGAAGAGAGACGCAACTTGTGGGTTAGGAGGTGACTGTGAAGCTGTAGACCTTTGGGGGCTAGAATT
AGAGAACTTAAGTTATTTATGTGAAGAGTGGAAAGAGAGGTATGGAGATGATGATGAGGATCTTCCTGCAAAGTTCCTAGCCTTGCGATTTGTCCCCCTGGAGTTGCCAC
AGCAGGAAAATTCATTTGATTGTGGTCTCTTCTTACTTCATTATGTGGAACTTTTTTTGGACGGTGCGCCAATTAACTTCAACCCATTCAAAATCTCGAAGTTCTCAAAC
TTTCTAAGCCAGGATTGGTTCTCTCCTGCGGAGGCTTCTCTTAAACGTGCACATATCCTACAGTTAATTTATGAAATCATGGTCAGCGACCATGAAAAAGAATTTTCTGG
TAGCATTGGTAAATATTCTTCTTCCAATGTTTTTGACTCGGACAATGATTTATCTGGACAAGTGTATCTTGAAGGGGCACATACTTTCACAATGACTTGCCCTGACAAGT
TCTCAAGAGGTGGAAAAGAACTGGAAAATGAAATGTCCTCTCTAAAACGATTTAAAGAATTTGGATCGGTGTCTAAAGTATCATCTGACAGAAATTATCAACGAATAGGT
GGACAATCAAGGAGTGTCATGTCACCCATTGAGGAAAATGAAAATGGTGAAATGGCTGATTCACCACAATGTTTGGAAGATCGTCACCAAGCTTCTGCAATTGTTTCCGA
ATGTTCATCAGCGTCCAGTTTTGGCCAACAATTTAGAGAATTAGAAATATCTGGTGAGGGCAGATTTTCTAGAAATTTCGAAGATAAAGGTAGAAGACTGGCGTCTCCGC
CATCTCTTGGTGAGTCTCGAACCGTATCTGAATTGGGGCAAGATCGTTCACCTCAAGCAACCAAGAGACTGAACCATCCAACGGAGGCAGATGAACTAGAGATCTTATCC
ACCTCAAGTGAAGAACTTGCAACTTGTGTAGTAGAAGATTCAGAGGAGTATGTAGTTGAAGATCTAGAGGATTGTGTAGTTGAAGATTCAGAGGTGGCAAACGAAACGAA
TGATGGAATCGAAATTGAATCGGGCAGTACTATGCGTGATAGCAAGGAAATTGACGTTTCTTCCTCCTCAAGGAACAACTCGTTCCTACCGAGGCAAGTGGTTGAATCTA
CTGCAAACTTAGATGATAATAGACAACATGATCTGCTTGTAAGCAGTGAACATTCAACACATGATTCTAGCAAGCAGCATTCTGCTAAAAGGCCTGGAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGCTATTGAAGAGTATAGACGTAGTGGTCGAGAAGGAGTTATCTTCAAGATTGATTTCGAAAAAGCTTATGACCACATCCAGGGGGAGAATTCGTGCTACCCGTGG
CCTTCGTCAGGGTGATCCCCTATCTCCTTTCCTGTTCCTTTTGGTTGTTGACACGCTCAGTAGGTTGGTCTCTAAAGGGGTTGAGGGTATGATTATTGAAGGTTTCGAGG
TGGGGGGAAACAGGATCCTCTTTCTCACCTCCAGTTTGCTGATGACACGATTTTCTTCTGCTCTGGGCTCCCGCTGGGACATAATCCGAGGACCTCTACTTTTTGGGAGC
CAGTGGTGGGCAAGGGCTCCTGTCACCGTGTGCAAATCCATTGAGAAGCTCATGAGAGATTTCTTGTGGGAAAGGGTGGATGAAGGGCGGAGTGCTCACTTGGTCAGTTG
GGAAACTATGGGAAAGTCTTTGAGTAAAGGAGGTCTAGAGGTGGGTAACTTGCGAACTCGCAACAAAGCTCTGTTGGCTAAATGGTTATGGCGCTTCTCCTCTGAGTCCA
CTACCCTTTGGCATAGGATAATTGTAAGCAAATACGGTCGTCATCCTTTTGAGTGGGTGTCGGGTGGGTTCAAAGGCACTTCTAGAAATCCTTGGAAAGAAATTTCAGCT
GAGCTTCCCTCTCTTTCTTTATTCATTCATCATATTGTGGGTAATGGGGAGGAAACCTATTTTTGGGAAGACCGGTGGGTGGGGGATAGACCCCTTTGTGTTGCATACCC
TCGCCTTTATCATTTATCTTCGATGAAGAACCATTTTGTGGCCGAGGTGTTGAACCCTTCGGGAAGCTCCATTTCTTTTTCCTTTGGCTTTTCTCGCTCCCTATCTGAGA
GGGAAACTTCTGATGTTATGTCTCTTCTTTCCTTGATAGAAGAGGTTTCCTTTAGACCGGGTAGGAGGGATAGTCGTGTTTGGAACCCAACCCCTCTAAGGGTTTCTCTT
GCCATTCGTTCTTTCACTGTCTTCTGGACCCTTCTCCTCCTCCTGAGAGATCGGTTTACTCTTTGGATGGCGGAGGAAGACCTCGACCATATTTTATGGCATTGTGACTT
TGCGCGCTCGGTTTGGAACAATTTGTTTGAGATTTTTGAGCTTCAATTCACTAGACACAGGGATCTCAGGGAGATGATCGATGAGTTCCTTCCCCATCCTCCTTTTCGGG
ACCAAGGAAATTTCTTATGGCAAGCTGGGATTTGTGCTATCATTTGGGTTTTGTGGGGGGAGAGAAATAACAGAATTTTTAGAGGGAAGGAGAGGAATGCGGAGGATGTT
TGGGACACTATTAAATATTCTGTCTCTTTATGGGCGTCGGTGACGCGTTTATTTTATCGGCGAGTGGGCTACGTTTTGAGCTGGGCAGTGACAAAAAAATCCTCCGGTGA
CAGACCAAAACGGCGGCGGCGACGGCGGCGGAATCTGGTGGTGGCGGCGGCGGTTTCAAGCAGCAAACGTTTTGGGCAACCCAAAACTCTCAAAAGTGTGTTTTATACAG
TCATACCTTTAAAGGAATTGGAGCCAACGAGCATGCATATGACCAATTTTCTTTTGAGATTGAAGAAACTTCAAAGAAAAGTGATTGATAGTAAAATAAATTCATCGGCA
AAAGTGGAAGAGCTTCGTTGTTGGAAAGTTCGATTAGAGGCTCCTTCTTTGAGGTTGTTGTTGTTGTTTGTCCCACGTAGTTCTCCTTCGTGTCAGTTGGATTATGGCTT
GTTGAACATCAAGGAGTTGCTTCTTTATGGCCTTGGAACTTCTAAGCAACTGGCTGGTAACTCTAAAACTAATAATCTTTACAAAGGAGATTGCCTTGAGGAAAAAACAA
GATTACAATCAAGAAGCTTTTTGGAGGTCGGCGCCCGTGTTCTGGCTCTGGGGGGGCCAAGGGCAAGGAAAAAGAACCGATATAGCTTCGCTTCCTTACCTTACTTCTGG
GGCTTGGAGACTTTTTGTGGGAGGGGGTCGACGAGGGAAGAGCGGAAGAGAGACGCAACTTGTGGGTTAGGAGGTGACTGTGAAGCTGTAGACCTTTGGGGGCTAGAATT
AGAGAACTTAAGTTATTTATGTGAAGAGTGGAAAGAGAGGTATGGAGATGATGATGAGGATCTTCCTGCAAAGTTCCTAGCCTTGCGATTTGTCCCCCTGGAGTTGCCAC
AGCAGGAAAATTCATTTGATTGTGGTCTCTTCTTACTTCATTATGTGGAACTTTTTTTGGACGGTGCGCCAATTAACTTCAACCCATTCAAAATCTCGAAGTTCTCAAAC
TTTCTAAGCCAGGATTGGTTCTCTCCTGCGGAGGCTTCTCTTAAACGTGCACATATCCTACAGTTAATTTATGAAATCATGGTCAGCGACCATGAAAAAGAATTTTCTGG
TAGCATTGGTAAATATTCTTCTTCCAATGTTTTTGACTCGGACAATGATTTATCTGGACAAGTGTATCTTGAAGGGGCACATACTTTCACAATGACTTGCCCTGACAAGT
TCTCAAGAGGTGGAAAAGAACTGGAAAATGAAATGTCCTCTCTAAAACGATTTAAAGAATTTGGATCGGTGTCTAAAGTATCATCTGACAGAAATTATCAACGAATAGGT
GGACAATCAAGGAGTGTCATGTCACCCATTGAGGAAAATGAAAATGGTGAAATGGCTGATTCACCACAATGTTTGGAAGATCGTCACCAAGCTTCTGCAATTGTTTCCGA
ATGTTCATCAGCGTCCAGTTTTGGCCAACAATTTAGAGAATTAGAAATATCTGGTGAGGGCAGATTTTCTAGAAATTTCGAAGATAAAGGTAGAAGACTGGCGTCTCCGC
CATCTCTTGGTGAGTCTCGAACCGTATCTGAATTGGGGCAAGATCGTTCACCTCAAGCAACCAAGAGACTGAACCATCCAACGGAGGCAGATGAACTAGAGATCTTATCC
ACCTCAAGTGAAGAACTTGCAACTTGTGTAGTAGAAGATTCAGAGGAGTATGTAGTTGAAGATCTAGAGGATTGTGTAGTTGAAGATTCAGAGGTGGCAAACGAAACGAA
TGATGGAATCGAAATTGAATCGGGCAGTACTATGCGTGATAGCAAGGAAATTGACGTTTCTTCCTCCTCAAGGAACAACTCGTTCCTACCGAGGCAAGTGGTTGAATCTA
CTGCAAACTTAGATGATAATAGACAACATGATCTGCTTGTAAGCAGTGAACATTCAACACATGATTCTAGCAAGCAGCATTCTGCTAAAAGGCCTGGAGTTTAG
Protein sequenceShow/hide protein sequence
MRLLKSIDVVVEKELSSRLISKKLMTTSRGRIRATRGLRQGDPLSPFLFLLVVDTLSRLVSKGVEGMIIEGFEVGGNRILFLTSSLLMTRFSSALGSRWDIIRGPLLFGS
QWWARAPVTVCKSIEKLMRDFLWERVDEGRSAHLVSWETMGKSLSKGGLEVGNLRTRNKALLAKWLWRFSSESTTLWHRIIVSKYGRHPFEWVSGGFKGTSRNPWKEISA
ELPSLSLFIHHIVGNGEETYFWEDRWVGDRPLCVAYPRLYHLSSMKNHFVAEVLNPSGSSISFSFGFSRSLSERETSDVMSLLSLIEEVSFRPGRRDSRVWNPTPLRVSL
AIRSFTVFWTLLLLLRDRFTLWMAEEDLDHILWHCDFARSVWNNLFEIFELQFTRHRDLREMIDEFLPHPPFRDQGNFLWQAGICAIIWVLWGERNNRIFRGKERNAEDV
WDTIKYSVSLWASVTRLFYRRVGYVLSWAVTKKSSGDRPKRRRRRRRNLVVAAAVSSSKRFGQPKTLKSVFYTVIPLKELEPTSMHMTNFLLRLKKLQRKVIDSKINSSA
KVEELRCWKVRLEAPSLRLLLLFVPRSSPSCQLDYGLLNIKELLLYGLGTSKQLAGNSKTNNLYKGDCLEEKTRLQSRSFLEVGARVLALGGPRARKKNRYSFASLPYFW
GLETFCGRGSTREERKRDATCGLGGDCEAVDLWGLELENLSYLCEEWKERYGDDDEDLPAKFLALRFVPLELPQQENSFDCGLFLLHYVELFLDGAPINFNPFKISKFSN
FLSQDWFSPAEASLKRAHILQLIYEIMVSDHEKEFSGSIGKYSSSNVFDSDNDLSGQVYLEGAHTFTMTCPDKFSRGGKELENEMSSLKRFKEFGSVSKVSSDRNYQRIG
GQSRSVMSPIEENENGEMADSPQCLEDRHQASAIVSECSSASSFGQQFRELEISGEGRFSRNFEDKGRRLASPPSLGESRTVSELGQDRSPQATKRLNHPTEADELEILS
TSSEELATCVVEDSEEYVVEDLEDCVVEDSEVANETNDGIEIESGSTMRDSKEIDVSSSSRNNSFLPRQVVESTANLDDNRQHDLLVSSEHSTHDSSKQHSAKRPGV