; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g28760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g28760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr4:21359090..21367361
RNA-Seq ExpressionMoc04g28760
SyntenyMoc04g28760
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]2.9e-9164.72Show/hide
Query:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL
        MDLR+IL+R+DWFPA LTNLAHVDKTT+R+K RLTPTQLDMFRQTCFGPILDM VVFNGPLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLIT L
Subjt:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL

Query:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDRINYRRTNRRREMTPHTKRLIV
        SH+M RV+N IP RRLRARYFKDSVRVKCSELEKIF+E +F DDED VKVGIVYF+ELA+MGKERKQFIDT  +GVVDR      +    M     R I 
Subjt:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDRINYRRTNRRREMTPHTKRLIV

Query:  STGFRIGLRDDIDVESAHSLSDDAIPRLLRWSCTYS------RGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPTEARAIPAPPAVPDPPAVPD
        S   +  L+D +   SA+     A P  +     Y       R    L  +VFDNT   VKE+L++T+AE +HMVR++ P E R IP PPAVPD   VPD
Subjt:  STGFRIGLRDDIDVESAHSLSDDAIPRLLRWSCTYS------RGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPTEARAIPAPPAVPDPPAVPD

Query:  PTVVPAPAA
          VVP P A
Subjt:  PTVVPAPAA

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.4e-14661.76Show/hide
Query:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL
        MDLR+I++R+DWFPA LTNLAH+DKT++R+K RLTPTQLDMFRQTCFGPILD+DVVFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLIT L
Subjt:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL

Query:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDR----INY-------------R
        SHRM RVDN IP RRLRARYFKD VRVKCSELEKIF+E VF DDED VKV IVYF+ELA+MGKERKQFIDT  LGVVDR     NY              
Subjt:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDR----INY-------------R

Query:  RTNRRREMTPHTKRLI-----VSTGFRIGLRDDIDV---ESAHSLSDDAIPRLLRWSCTYSRGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPT
        +   + +++ + ++       V T    G      V   E+  +LSDDAIPRLLRWSC YS GF  L  +VFDNT   VKE+L++T+A+ +HMVR++ P 
Subjt:  RTNRRREMTPHTKRLI-----VSTGFRIGLRDDIDV---ESAHSLSDDAIPRLLRWSCTYSRGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPT

Query:  EARAIPAPPAVPDPPAVPDPTVVPAPAAVRNPPADLERGAEERRVKDKGKNIIDDPVEEAETLDDVALQGPALDDAGPRGNDSEALQKRSKRKKFKNKIS
        E R IP PPAVPD   VPDP   P  AAV +PPAD+E G             ++DPV +A           A+D+A P  ND E L+KR K+ KFK +IS
Subjt:  EARAIPAPPAVPDPPAVPDPTVVPAPAAVRNPPADLERGAEERRVKDKGKNIIDDPVEEAETLDDVALQGPALDDAGPRGNDSEALQKRSKRKKFKNKIS

Query:  RRLKRLDDRVGAIEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRPEAVPKTDE
        RRLKRLD+ VGAIE  L  FGVALKGIQ YLKK++KGKFPD +KYFG GGGPDDD PSDQRPDE+  P  G KSMDED+R +   +TDE
Subjt:  RRLKRLDDRVGAIEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRPEAVPKTDE

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]2.9e-11548.3Show/hide
Query:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYE--VPNYLIF-------DDTSLRFYLNNPPDSS
        M RLFVIY  KWN+ G +YEGG  G LDVD TITY NLVSALHMLTRID DQFDL++ CVY   F+      N +         DD + + Y N      
Subjt:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYE--VPNYLIF-------DDTSLRFYLNNPPDSS

Query:  QNNDESDYDEYHTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVK
        Q++DE DYDEY TE GVED         DGGNGYDNG+ED Y      ++HYENE Q +HESHTVSGNAP QTVE VV RPMQNIITGN  D++G++AVK
Subjt:  QNNDESDYDEYHTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVK

Query:  GIFHSNEELRFKLSVLTMKLNFEF-------------------------KSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQ
        GIF S EELRFKLSVL MKLNF+F                         KSI+GGDSFIIS F D HKCKRE+LNHDHRQARSWVVGQL+K N++DVSRQ
Subjt:  GIFHSNEELRFKLSVLTMKLNFEF-------------------------KSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQ

Query:  YKPKDIINDMRKNYGVNI----------------------------------------------------------------------------------
        Y+PKDIINDMR+NYGVNI                                                                                  
Subjt:  YKPKDIINDMRKNYGVNI----------------------------------------------------------------------------------

Query:  -----------------------TFGIVDRESDQSWTWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMF
                                FG+VD+ES QSWTWFLNM                           VF Q  HGICMQHL TN+KDKFKDD +Q++F
Subjt:  -----------------------TFGIVDRESDQSWTWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMF

Query:  ILAAKECQKSEFRYYFSQLAGFPKVQRYLK
        ILAAK C+KSEFRYYFSQLAGFP+VQRYL+
Subjt:  ILAAKECQKSEFRYYFSQLAGFPKVQRYLK

XP_022156834.1 uncharacterized protein LOC111023667 [Momordica charantia]6.5e-12346.3Show/hide
Query:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYEVPNYLIFDDTSLRFYLNNPPDSS---------
        M RLFVIY  KWN+ G MYEGGV G LDVD TITY NLVSALHMLTRIDPDQFDL++ CVY+FDF+YEVPNYLIFDD+SL+FYLN PPD S         
Subjt:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYEVPNYLIFDDTSLRFYLNNPPDSS---------

Query:  -----------------------------------------------------------------------------------------QNNDESDYDEY
                                                                                                 Q++DE DYDEY
Subjt:  -----------------------------------------------------------------------------------------QNNDESDYDEY

Query:  HTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVKGIFHSNEELRF
         TE GV         E DGGNGYDNG+ED Y      ++HYENE Q +HESHT+SGNAP QTVE VVSRPMQNIITGN  D++G++AVKGIFHS  ELRF
Subjt:  HTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVKGIFHSNEELRF

Query:  KLSVLTMKLNFEFKSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQYKPKDIINDMRKNYGVNI------------------
        KL           KSI+GGDSFIIS F DVHKCKRE+LNHDHRQARSWVVGQL+K NL+DVSRQYKPKDIINDMRKNYGVNI                  
Subjt:  KLSVLTMKLNFEFKSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQYKPKDIINDMRKNYGVNI------------------

Query:  ---------------------------------------------------------------------------------------TFGIVDRESDQSW
                                                                                                F +VD+ESDQSW
Subjt:  ---------------------------------------------------------------------------------------TFGIVDRESDQSW

Query:  TWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMFILAAKECQKSEFRYYFSQLAGFPKVQRYLK
        TWFLNM                           VFPQ AH ICMQHLSTNLKDKFKDDV+Q+MFILA K C+KSEFRYYFSQLAGFP+VQRYL+
Subjt:  TWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMFILAAKECQKSEFRYYFSQLAGFPKVQRYLK

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]2.7e-8965.4Show/hide
Query:  MDDPKTDGRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWREENTI
        MDDP TD   RST+ G + K W+  LLDP  +L DE +D L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPYAAMKPGVL ++  Y WR+E TI
Subjt:  MDDPKTDGRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWREENTI

Query:  WRYVHGRQSDHNVPWSDADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHGGIFSVRPDLPVVPWRVRRVRVPQ
        +RYV GRQSD++  WS+ADIVYT MN+GGNHWVM+GIDLV+GD+TVWDSLQ  TPL++LEK LKPMCTI+P +LH  GI ++RP+LP+VPWRVRR  VPQ
Subjt:  WRYVHGRQSDHNVPWSDADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHGGIFSVRPDLPVVPWRVRRVRVPQ

Query:  QSSVTDCGIFYVRYFEYDATGSNMDTLTQDNIVYFRR
        Q+  TDC IF VR+FEYD  GS +DTL Q NI  FRR
Subjt:  QSSVTDCGIFYVRYFEYDATGSNMDTLTQDNIVYFRR

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156001.4e-9164.72Show/hide
Query:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL
        MDLR+IL+R+DWFPA LTNLAHVDKTT+R+K RLTPTQLDMFRQTCFGPILDM VVFNGPLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLIT L
Subjt:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL

Query:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDRINYRRTNRRREMTPHTKRLIV
        SH+M RV+N IP RRLRARYFKDSVRVKCSELEKIF+E +F DDED VKVGIVYF+ELA+MGKERKQFIDT  +GVVDR      +    M     R I 
Subjt:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDRINYRRTNRRREMTPHTKRLIV

Query:  STGFRIGLRDDIDVESAHSLSDDAIPRLLRWSCTYS------RGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPTEARAIPAPPAVPDPPAVPD
        S   +  L+D +   SA+     A P  +     Y       R    L  +VFDNT   VKE+L++T+AE +HMVR++ P E R IP PPAVPD   VPD
Subjt:  STGFRIGLRDDIDVESAHSLSDDAIPRLLRWSCTYS------RGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPTEARAIPAPPAVPDPPAVPD

Query:  PTVVPAPAA
          VVP P A
Subjt:  PTVVPAPAA

A0A6J1DJX9 uncharacterized protein LOC1110207576.9e-14761.76Show/hide
Query:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL
        MDLR+I++R+DWFPA LTNLAH+DKT++R+K RLTPTQLDMFRQTCFGPILD+DVVFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLIT L
Subjt:  MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCL

Query:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDR----INY-------------R
        SHRM RVDN IP RRLRARYFKD VRVKCSELEKIF+E VF DDED VKV IVYF+ELA+MGKERKQFIDT  LGVVDR     NY              
Subjt:  SHRMIRVDNDIPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDR----INY-------------R

Query:  RTNRRREMTPHTKRLI-----VSTGFRIGLRDDIDV---ESAHSLSDDAIPRLLRWSCTYSRGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPT
        +   + +++ + ++       V T    G      V   E+  +LSDDAIPRLLRWSC YS GF  L  +VFDNT   VKE+L++T+A+ +HMVR++ P 
Subjt:  RTNRRREMTPHTKRLI-----VSTGFRIGLRDDIDV---ESAHSLSDDAIPRLLRWSCTYSRGFLTLLRDVFDNT--MVKEYLVSTNAEAEHMVRIMRPT

Query:  EARAIPAPPAVPDPPAVPDPTVVPAPAAVRNPPADLERGAEERRVKDKGKNIIDDPVEEAETLDDVALQGPALDDAGPRGNDSEALQKRSKRKKFKNKIS
        E R IP PPAVPD   VPDP   P  AAV +PPAD+E G             ++DPV +A           A+D+A P  ND E L+KR K+ KFK +IS
Subjt:  EARAIPAPPAVPDPPAVPDPTVVPAPAAVRNPPADLERGAEERRVKDKGKNIIDDPVEEAETLDDVALQGPALDDAGPRGNDSEALQKRSKRKKFKNKIS

Query:  RRLKRLDDRVGAIEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRPEAVPKTDE
        RRLKRLD+ VGAIE  L  FGVALKGIQ YLKK++KGKFPD +KYFG GGGPDDD PSDQRPDE+  P  G KSMDED+R +   +TDE
Subjt:  RRLKRLDDRVGAIEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRPEAVPKTDE

A0A6J1DSY0 uncharacterized protein LOC1110236351.4e-11548.3Show/hide
Query:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYE--VPNYLIF-------DDTSLRFYLNNPPDSS
        M RLFVIY  KWN+ G +YEGG  G LDVD TITY NLVSALHMLTRID DQFDL++ CVY   F+      N +         DD + + Y N      
Subjt:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYE--VPNYLIF-------DDTSLRFYLNNPPDSS

Query:  QNNDESDYDEYHTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVK
        Q++DE DYDEY TE GVED         DGGNGYDNG+ED Y      ++HYENE Q +HESHTVSGNAP QTVE VV RPMQNIITGN  D++G++AVK
Subjt:  QNNDESDYDEYHTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVK

Query:  GIFHSNEELRFKLSVLTMKLNFEF-------------------------KSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQ
        GIF S EELRFKLSVL MKLNF+F                         KSI+GGDSFIIS F D HKCKRE+LNHDHRQARSWVVGQL+K N++DVSRQ
Subjt:  GIFHSNEELRFKLSVLTMKLNFEF-------------------------KSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQ

Query:  YKPKDIINDMRKNYGVNI----------------------------------------------------------------------------------
        Y+PKDIINDMR+NYGVNI                                                                                  
Subjt:  YKPKDIINDMRKNYGVNI----------------------------------------------------------------------------------

Query:  -----------------------TFGIVDRESDQSWTWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMF
                                FG+VD+ES QSWTWFLNM                           VF Q  HGICMQHL TN+KDKFKDD +Q++F
Subjt:  -----------------------TFGIVDRESDQSWTWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMF

Query:  ILAAKECQKSEFRYYFSQLAGFPKVQRYLK
        ILAAK C+KSEFRYYFSQLAGFP+VQRYL+
Subjt:  ILAAKECQKSEFRYYFSQLAGFPKVQRYLK

A0A6J1DUS4 uncharacterized protein LOC1110236673.1e-12346.3Show/hide
Query:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYEVPNYLIFDDTSLRFYLNNPPDSS---------
        M RLFVIY  KWN+ G MYEGGV G LDVD TITY NLVSALHMLTRIDPDQFDL++ CVY+FDF+YEVPNYLIFDD+SL+FYLN PPD S         
Subjt:  MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYEVPNYLIFDDTSLRFYLNNPPDSS---------

Query:  -----------------------------------------------------------------------------------------QNNDESDYDEY
                                                                                                 Q++DE DYDEY
Subjt:  -----------------------------------------------------------------------------------------QNNDESDYDEY

Query:  HTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVKGIFHSNEELRF
         TE GV         E DGGNGYDNG+ED Y      ++HYENE Q +HESHT+SGNAP QTVE VVSRPMQNIITGN  D++G++AVKGIFHS  ELRF
Subjt:  HTEYGVEDEDSKAEIENDGGNGYDNGDEDVY------EDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVKGIFHSNEELRF

Query:  KLSVLTMKLNFEFKSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQYKPKDIINDMRKNYGVNI------------------
        KL           KSI+GGDSFIIS F DVHKCKRE+LNHDHRQARSWVVGQL+K NL+DVSRQYKPKDIINDMRKNYGVNI                  
Subjt:  KLSVLTMKLNFEFKSIRGGDSFIISMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQYKPKDIINDMRKNYGVNI------------------

Query:  ---------------------------------------------------------------------------------------TFGIVDRESDQSW
                                                                                                F +VD+ESDQSW
Subjt:  ---------------------------------------------------------------------------------------TFGIVDRESDQSW

Query:  TWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMFILAAKECQKSEFRYYFSQLAGFPKVQRYLK
        TWFLNM                           VFPQ AH ICMQHLSTNLKDKFKDDV+Q+MFILA K C+KSEFRYYFSQLAGFP+VQRYL+
Subjt:  TWFLNM---------------------------VFPQAAHGICMQHLSTNLKDKFKDDVIQKMFILAAKECQKSEFRYYFSQLAGFPKVQRYLK

A0A6J1DY60 uncharacterized protein LOC1110252731.3e-8965.4Show/hide
Query:  MDDPKTDGRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWREENTI
        MDDP TD   RST+ G + K W+  LLDP  +L DE +D L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPYAAMKPGVL ++  Y WR+E TI
Subjt:  MDDPKTDGRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWREENTI

Query:  WRYVHGRQSDHNVPWSDADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHGGIFSVRPDLPVVPWRVRRVRVPQ
        +RYV GRQSD++  WS+ADIVYT MN+GGNHWVM+GIDLV+GD+TVWDSLQ  TPL++LEK LKPMCTI+P +LH  GI ++RP+LP+VPWRVRR  VPQ
Subjt:  WRYVHGRQSDHNVPWSDADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHGGIFSVRPDLPVVPWRVRRVRVPQ

Query:  QSSVTDCGIFYVRYFEYDATGSNMDTLTQDNIVYFRR
        Q+  TDC IF VR+FEYD  GS +DTL Q NI  FRR
Subjt:  QSSVTDCGIFYVRYFEYDATGSNMDTLTQDNIVYFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein1.0e-0930.77Show/hide
Query:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHG-GIFSVRPDLPVVPWRVRRVRVPQQSSVTDCGIFYVRYF
        D D +Y  + V GNHWV L IDL +  I V+DS+ + T   E+  +   + T++P +L         R     + W+ R  ++P+     DC I+ ++Y 
Subjt:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHG-GIFSVRPDLPVVPWRVRRVRVPQQSSVTDCGIFYVRYF

Query:  EYDATGSNMDTLTQDNI
        E  A G + D L  +N+
Subjt:  EYDATGSNMDTLTQDNI

AT5G28235.1 Ulp1 protease family protein4.9e-0436.21Show/hide
Query:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLL
        D D +Y  + V GNHWV L IDL +  + V+DS+ + T   E+  +   + T++P +L
Subjt:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLL

AT5G45570.1 Ulp1 protease family protein1.3e-0929.13Show/hide
Query:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHG-GIFSVRPDLPVVPWRVRRVRVPQQSSVTDCGIFYVRYF
        D D +Y  + V GNHWV L IDL    + V+DS+ + T   E+  +   + T++P +L         R     + W+ R  ++P+     DC I+ ++Y 
Subjt:  DADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHG-GIFSVRPDLPVVPWRVRRVRVPQQSSVTDCGIFYVRYF

Query:  EYDATGSNMDTLTQDNIVYFRRHIIHE
        E  A G + D L  +N+   R  +  E
Subjt:  EYDATGSNMDTLTQDNIVYFRRHIIHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCGACTATTTGTGATCTACGATGACAAGTGGAACGATGTTGGTAAGATGTACGAAGGTGGTGTTACGGGTAGGCTAGATGTTGATGTAACAATAACATATGTCAA
CCTAGTAAGTGCTTTGCACATGCTTACTAGAATTGATCCTGACCAATTTGATCTTGTTATACTGTGTGTATATAAATTTGATTTTCAGTACGAAGTCCCGAATTATCTGA
TATTTGATGATACCAGTTTGAGATTCTATTTGAATAATCCTCCCGATTCATCACAAAACAATGATGAGTCTGATTATGATGAATATCATACTGAGTATGGGGTTGAGGAC
GAGGATAGTAAAGCAGAGATTGAGAACGATGGTGGGAATGGGTATGACAATGGGGACGAGGACGTATATGAAGACCATTATGAGAATGAAAGTCAACCTATTCATGAGTC
GCATACAGTGAGTGGGAATGCACCTTGCCAAACAGTTGAAGGAGTAGTTTCACGTCCGATGCAGAATATCATAACTGGCAATGAGGTAGACTATGTAGGTAAACTTGCTG
TTAAGGGTATTTTCCATTCAAATGAAGAACTACGCTTCAAGCTGTCTGTGTTAACTATGAAGCTTAATTTTGAATTTAAGAGTATCCGAGGTGGTGATTCATTCATCATT
TCGATGTTTACTGATGTTCACAAGTGCAAACGTGAGATACTGAATCATGACCACAGGCAAGCTCGAAGTTGGGTGGTTGGTCAGTTATTAAAGTTCAATCTCGATGATGT
AAGCCGTCAGTACAAACCTAAGGACATTATTAACGACATGCGAAAAAACTATGGGGTGAACATTACGTTTGGAATTGTCGATCGGGAGAGTGATCAATCCTGGACTTGGT
TTCTGAATATGGTCTTTCCTCAAGCGGCTCACGGAATATGCATGCAGCACTTGTCCACGAACCTGAAGGACAAATTCAAGGACGATGTCATTCAAAAAATGTTCATATTA
GCAGCAAAGGAATGCCAGAAGTCAGAGTTTAGGTACTATTTTTCCCAACTAGCAGGATTTCCAAAGGTCCAAAGGTACTTGAAGGAATCGGTTTTGAAAAATGGATTCGT
GCATTTCAACCAGGATAAGATCCGGGATGGAAGACTTTCACAACATAATGCATTCTGCCTTCCATCCCGGACCTTATCCCGGATCATTTCATTCCGGGATGAAAGGCAGA
GAATGAAACCCCATTTTCTGCCTTTCATCCCGGAGGCGAAATGGTCCGGGATACGGTCTGGGATGGAAGGCAGAATGCAGACACTTCGAATTTTTTTTTTAAGACATATA
ATGGATTTGAGAGTAATTCTAAACCGTGATGACTGGTTTCCGGCCATGTTGACTAACCTTGCCCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCCAAC
CCAGTTAGACATGTTTAGGCAAACGTGTTTCGGTCCCATTTTGGACATGGACGTAGTTTTTAACGGTCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTA
GGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTTGACCTAATCACCTGCCTCAGTCATAGGATGATTAGGGTAGATAACGAT
ATTCCTGACCGACGACTTCGAGCACGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGC
TGTCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATAATGGGGAAGGAGAGGAAGCAGTTTATAGATACGACCTTTTTAGGGGTTGTGGATAGGATAAACTACCGGC
GTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTATCGGCTTACGAGACGATATCGACGTTGAGTCTGCGCATAGCCTG
AGCGACGACGCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTACTCTGCTAAGAGATGTGTTCGATAACACGATGGTTAAGGAATATTTGGT
TTCGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCAACGGAAGCCCGCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGA
CTGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGCCTGCAGATTTGGAAAGGGGTGCTGAGGAAAGAAGGGTGAAGGACAAAGGAAAAAACATCATAGATGATCCGGTA
GAAGAGGCCGAGACATTGGACGATGTTGCATTACAGGGTCCTGCATTAGACGATGCTGGACCCAGGGGAAATGACAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAA
ATTCAAAAATAAGATCAGTAGAAGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACACTGACTGGCTTCGGGGTCGCCCTAAAAGGTATCCAGAGATACC
TTAAGAAAATGTCGAAGGGTAAATTCCCTGATCCGACCAAATATTTTGGTCGTGGGGGTGGGCCCGATGATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACA
CGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGGAAGCGGTCCCTAAGACTGACGAGTATCAGACCATGGACGATAATCTGAAGAGTATGGATGAGGATCCGATGTT
TATGGTTGAAGACCAGGGTACGATAACGGAGCGGGACAATGCATCGGATGCTTACCCCGATCGTCCTGTCGGTTTGTTTCAGGATGCCACTGTTGGAATGCAAGAGCCGG
ACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAGTCATTAAGCGTGGGATCACCGTGGATAGATACGACCCA
GTATGTCTCATTCCACCGCAGTTGGACGACAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACGGGAGATTGCGGTCCACTGCAACTGGTTTCCAAAAGAAGGAATG
GTATCGCGATCTATTGGACCCTAGTGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGA
AGTTTGCGATAGGCGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCATATGCGGCCATGAAGCCGGGTGTATTGTCCACTAGGATCAACTACCCCTGGCGC
GAGGAGAATACAATCTGGCGATATGTCCACGGTAGGCAGTCGGACCACAACGTGCCCTGGAGTGATGCAGACATCGTGTACACCCCCATGAACGTAGGCGGGAACCACTG
GGTGATGCTCGGGATCGACCTTGTACAGGGCGACATAACCGTATGGGATTCACTCCAAACGGCCACTCCACTGGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAA
TCCTACCTACGCTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAGTGGTGCCGTGGAGGGTACGTCGGGTTCGCGTACCACAGCAGAGTAGCGTGACT
GATTGCGGGATTTTCTATGTCCGGTATTTCGAGTACGATGCCACTGGGTCAAATATGGACACTTTAACCCAAGATAATATTGTATATTTTAGGCGTCACATCATACATGA
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCGACTATTTGTGATCTACGATGACAAGTGGAACGATGTTGGTAAGATGTACGAAGGTGGTGTTACGGGTAGGCTAGATGTTGATGTAACAATAACATATGTCAA
CCTAGTAAGTGCTTTGCACATGCTTACTAGAATTGATCCTGACCAATTTGATCTTGTTATACTGTGTGTATATAAATTTGATTTTCAGTACGAAGTCCCGAATTATCTGA
TATTTGATGATACCAGTTTGAGATTCTATTTGAATAATCCTCCCGATTCATCACAAAACAATGATGAGTCTGATTATGATGAATATCATACTGAGTATGGGGTTGAGGAC
GAGGATAGTAAAGCAGAGATTGAGAACGATGGTGGGAATGGGTATGACAATGGGGACGAGGACGTATATGAAGACCATTATGAGAATGAAAGTCAACCTATTCATGAGTC
GCATACAGTGAGTGGGAATGCACCTTGCCAAACAGTTGAAGGAGTAGTTTCACGTCCGATGCAGAATATCATAACTGGCAATGAGGTAGACTATGTAGGTAAACTTGCTG
TTAAGGGTATTTTCCATTCAAATGAAGAACTACGCTTCAAGCTGTCTGTGTTAACTATGAAGCTTAATTTTGAATTTAAGAGTATCCGAGGTGGTGATTCATTCATCATT
TCGATGTTTACTGATGTTCACAAGTGCAAACGTGAGATACTGAATCATGACCACAGGCAAGCTCGAAGTTGGGTGGTTGGTCAGTTATTAAAGTTCAATCTCGATGATGT
AAGCCGTCAGTACAAACCTAAGGACATTATTAACGACATGCGAAAAAACTATGGGGTGAACATTACGTTTGGAATTGTCGATCGGGAGAGTGATCAATCCTGGACTTGGT
TTCTGAATATGGTCTTTCCTCAAGCGGCTCACGGAATATGCATGCAGCACTTGTCCACGAACCTGAAGGACAAATTCAAGGACGATGTCATTCAAAAAATGTTCATATTA
GCAGCAAAGGAATGCCAGAAGTCAGAGTTTAGGTACTATTTTTCCCAACTAGCAGGATTTCCAAAGGTCCAAAGGTACTTGAAGGAATCGGTTTTGAAAAATGGATTCGT
GCATTTCAACCAGGATAAGATCCGGGATGGAAGACTTTCACAACATAATGCATTCTGCCTTCCATCCCGGACCTTATCCCGGATCATTTCATTCCGGGATGAAAGGCAGA
GAATGAAACCCCATTTTCTGCCTTTCATCCCGGAGGCGAAATGGTCCGGGATACGGTCTGGGATGGAAGGCAGAATGCAGACACTTCGAATTTTTTTTTTAAGACATATA
ATGGATTTGAGAGTAATTCTAAACCGTGATGACTGGTTTCCGGCCATGTTGACTAACCTTGCCCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCCAAC
CCAGTTAGACATGTTTAGGCAAACGTGTTTCGGTCCCATTTTGGACATGGACGTAGTTTTTAACGGTCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTA
GGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTTGACCTAATCACCTGCCTCAGTCATAGGATGATTAGGGTAGATAACGAT
ATTCCTGACCGACGACTTCGAGCACGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGC
TGTCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATAATGGGGAAGGAGAGGAAGCAGTTTATAGATACGACCTTTTTAGGGGTTGTGGATAGGATAAACTACCGGC
GTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTATCGGCTTACGAGACGATATCGACGTTGAGTCTGCGCATAGCCTG
AGCGACGACGCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTACTCTGCTAAGAGATGTGTTCGATAACACGATGGTTAAGGAATATTTGGT
TTCGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCAACGGAAGCCCGCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGA
CTGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGCCTGCAGATTTGGAAAGGGGTGCTGAGGAAAGAAGGGTGAAGGACAAAGGAAAAAACATCATAGATGATCCGGTA
GAAGAGGCCGAGACATTGGACGATGTTGCATTACAGGGTCCTGCATTAGACGATGCTGGACCCAGGGGAAATGACAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAA
ATTCAAAAATAAGATCAGTAGAAGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACACTGACTGGCTTCGGGGTCGCCCTAAAAGGTATCCAGAGATACC
TTAAGAAAATGTCGAAGGGTAAATTCCCTGATCCGACCAAATATTTTGGTCGTGGGGGTGGGCCCGATGATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACA
CGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGGAAGCGGTCCCTAAGACTGACGAGTATCAGACCATGGACGATAATCTGAAGAGTATGGATGAGGATCCGATGTT
TATGGTTGAAGACCAGGGTACGATAACGGAGCGGGACAATGCATCGGATGCTTACCCCGATCGTCCTGTCGGTTTGTTTCAGGATGCCACTGTTGGAATGCAAGAGCCGG
ACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAGTCATTAAGCGTGGGATCACCGTGGATAGATACGACCCA
GTATGTCTCATTCCACCGCAGTTGGACGACAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACGGGAGATTGCGGTCCACTGCAACTGGTTTCCAAAAGAAGGAATG
GTATCGCGATCTATTGGACCCTAGTGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGA
AGTTTGCGATAGGCGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCATATGCGGCCATGAAGCCGGGTGTATTGTCCACTAGGATCAACTACCCCTGGCGC
GAGGAGAATACAATCTGGCGATATGTCCACGGTAGGCAGTCGGACCACAACGTGCCCTGGAGTGATGCAGACATCGTGTACACCCCCATGAACGTAGGCGGGAACCACTG
GGTGATGCTCGGGATCGACCTTGTACAGGGCGACATAACCGTATGGGATTCACTCCAAACGGCCACTCCACTGGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAA
TCCTACCTACGCTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAGTGGTGCCGTGGAGGGTACGTCGGGTTCGCGTACCACAGCAGAGTAGCGTGACT
GATTGCGGGATTTTCTATGTCCGGTATTTCGAGTACGATGCCACTGGGTCAAATATGGACACTTTAACCCAAGATAATATTGTATATTTTAGGCGTCACATCATACATGA
ATGA
Protein sequenceShow/hide protein sequence
MHRLFVIYDDKWNDVGKMYEGGVTGRLDVDVTITYVNLVSALHMLTRIDPDQFDLVILCVYKFDFQYEVPNYLIFDDTSLRFYLNNPPDSSQNNDESDYDEYHTEYGVED
EDSKAEIENDGGNGYDNGDEDVYEDHYENESQPIHESHTVSGNAPCQTVEGVVSRPMQNIITGNEVDYVGKLAVKGIFHSNEELRFKLSVLTMKLNFEFKSIRGGDSFII
SMFTDVHKCKREILNHDHRQARSWVVGQLLKFNLDDVSRQYKPKDIINDMRKNYGVNITFGIVDRESDQSWTWFLNMVFPQAAHGICMQHLSTNLKDKFKDDVIQKMFIL
AAKECQKSEFRYYFSQLAGFPKVQRYLKESVLKNGFVHFNQDKIRDGRLSQHNAFCLPSRTLSRIISFRDERQRMKPHFLPFIPEAKWSGIRSGMEGRMQTLRIFFLRHI
MDLRVILNRDDWFPAMLTNLAHVDKTTSRLKGRLTPTQLDMFRQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITCLSHRMIRVDND
IPDRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAIMGKERKQFIDTTFLGVVDRINYRRTNRRREMTPHTKRLIVSTGFRIGLRDDIDVESAHSL
SDDAIPRLLRWSCTYSRGFLTLLRDVFDNTMVKEYLVSTNAEAEHMVRIMRPTEARAIPAPPAVPDPPAVPDPTVVPAPAAVRNPPADLERGAEERRVKDKGKNIIDDPV
EEAETLDDVALQGPALDDAGPRGNDSEALQKRSKRKKFKNKISRRLKRLDDRVGAIEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDPSDQRPDEAPT
RGPKSMDEDRRPEAVPKTDEYQTMDDNLKSMDEDPMFMVEDQGTITERDNASDAYPDRPVGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKRGITVDRYDP
VCLIPPQLDDKFQRWMDDPKTDGRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWR
EENTIWRYVHGRQSDHNVPWSDADIVYTPMNVGGNHWVMLGIDLVQGDITVWDSLQTATPLDELEKELKPMCTILPTLLHHGGIFSVRPDLPVVPWRVRRVRVPQQSSVT
DCGIFYVRYFEYDATGSNMDTLTQDNIVYFRRHIIHE