; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020178 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020178
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Genome locationtig00153449:674969..713855
RNA-Seq ExpressionSgr020178
SyntenySgr020178
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030488 - tRNA methylation (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0016423 - tRNA (guanine) methyltransferase activity (molecular function)
InterPro domainsIPR001537 - tRNA/rRNA methyltransferase, SpoU type
IPR001907 - ATP-dependent Clp protease proteolytic subunit
IPR023562 - Clp protease proteolytic subunit /Translocation-enhancing protein TepA
IPR029026 - tRNA (guanine-N1-)-methyltransferase, N-terminal
IPR029028 - Alpha/beta knot methyltransferases
IPR029045 - ClpP/crotonase-like domain superfamily
IPR044748 - Trm3/ TARB1, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155532.1 uncharacterized protein LOC111022655 isoform X1 [Momordica charantia]0.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

XP_022155534.1 uncharacterized protein LOC111022655 isoform X3 [Momordica charantia]0.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

XP_022155538.1 uncharacterized protein LOC111022655 isoform X7 [Momordica charantia]0.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

XP_022155539.1 uncharacterized protein LOC111022655 isoform X8 [Momordica charantia]0.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

XP_022155540.1 uncharacterized protein LOC111022655 isoform X9 [Momordica charantia]0.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

TrEMBL top hitse value%identityAlignment
A0A6J1DMQ1 uncharacterized protein LOC111022655 isoform X90.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

A0A6J1DN81 uncharacterized protein LOC111022655 isoform X70.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

A0A6J1DPL1 uncharacterized protein LOC111022655 isoform X10.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

A0A6J1DQJ7 uncharacterized protein LOC111022655 isoform X30.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

A0A6J1DQK3 uncharacterized protein LOC111022655 isoform X80.0e+0091.45Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSE+ +H              K LEEG KSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
        GSVAFDEDFEAELTDY ARTEVSLLAESPD ELTEVFINTELYARVSVAVLFHKLADLADM G SN+YGSCSDAVESGKLFLLELLDSVVNNNDLAKELY
Subjt:  GSVAFDEDFEAELTDYEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELY

Query:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT
        KKHSAIHRRK+RAWQMICVLSRFV QDIIQQVTNSLHV+L RNNLPSVRQ+LETFAISIYLKFPTLVKEQLVPILQDYN+RPQVLSSYVFIATNVILH T
Subjt:  KKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVT

Query:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
        EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCK FPVVEFRATG+M LEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS
Subjt:  EAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFS

Query:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS
        SRVKDLEFECV  SL+EQVLNFLNDVREDLR SMANDLTAI+NESFKT EDHNLMD+  D+ KES TSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLG 
Subjt:  SRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGS

Query:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM
        K+AYKYL+EMEEEDQLLHQLLHSRSLSMEDLR+NRQHFILVASL+DRIPNLAGLARTCEVFKAAGLAIADL+IL DKQFQLISVTAEKWVPIVEVPVNSM
Subjt:  KEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSM

Query:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI
        KLFL+KKK+EGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAI LWE+TRQQRHQ+
Subjt:  KLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQRHQI

SwissProt top hitse value%identityAlignment
P74466 Putative ATP-dependent Clp protease proteolytic subunit-like1.2e-2536.92Show/hide
Query:  AWEQPPPDLASYLYKNRIVYLGMSLVPS----------VTELILAEFLYLQYEDEAKPIYLYINSTGTQ--------------------RYVKPPIFTLC
        A++ PPPDL S L K RIVYLGM L  S          VT+LI+A+ LYLQ++D  KPIY YINSTGT                      Y+KPP+ T+C
Subjt:  AWEQPPPDLASYLYKNRIVYLGMSLVPS----------VTELILAEFLYLQYEDEAKPIYLYINSTGTQ--------------------RYVKPPIFTLC

Query:  VGNAWGEAALLLAAGAKGNRSALPSSTIMIKQ-----------VKLYAKHI---------------GKSTEEIEADIRRPKYFSPSEAVEYDEVD
        +G A G AA++L++G KG R++LP +TI++ Q           +++ AK +               G++ E++  D+ R  Y +P++A EY  +D
Subjt:  VGNAWGEAALLLAAGAKGNRSALPSSTIMIKQ-----------VKLYAKHI---------------GKSTEEIEADIRRPKYFSPSEAVEYDEVD

Q07527 tRNA (guanosine(18)-2'-O)-methyltransferase6.4e-3040.26Show/hide
Query:  MNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLD-QYTF
        + R   I+V+SL+D+ PNL G+ R C+V     L + D+ +    QF+ ++VTA++W+P+ EV ++ +  F+ +KK+EG++++GLEQT  SV LD  + F
Subjt:  MNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLD-QYTF

Query:  PKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQ
        PKK++++LG E  GIP  ++  LD C+EI Q GV+RS+N+  + A+ +  +T Q
Subjt:  PKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQ

Q13395 Probable methyltransferase TARBP13.2e-4530.9Show/hide
Query:  SGKLFLLELLDSVVNNNDL---AKELYKKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVP
        S KLF+ +L   +++ ++L   +K+ Y  +S  HR K R WQ + VL   + Q+ +  + + +  +   NN  S++ F+E   I I  KFP     Q +P
Subjt:  SGKLFLLELLDSVVNNNDL---AKELYKKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVP

Query:  ILQD-YNIRPQVLSSYVFIATNVILH---VTEAVQSSH--LDELLPPLVPQLTSHHHSLR--GFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKS
           D ++   + L + +     V+ H   +T+ +      L + L  ++    +H+ S+R      L     +CK+  V EF A             L  
Subjt:  ILQD-YNIRPQVLSSYVFIATNVILH---VTEAVQSSH--LDELLPPLVPQLTSHHHSLR--GFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKS

Query:  YLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFSSRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKES
         +E +   V    SM G  +A      +     F++    L+  C+ T  +  +L  L+ + ED   ++ +  T   +          L        K  
Subjt:  YLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFSSRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKES

Query:  FTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGSKEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAG
          S+  + T+L        + ++ + TD       K+   + + + + D  L  L   R+     L  +    I+VASL+D+  NL GL RTCEVF A+ 
Subjt:  FTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGSKEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAG

Query:  LAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGV
        L +  L  + DKQFQ +SV+AE+W+P+VEV    +  +L +KK EG++I+G+EQTA S+ L QY FP+K++L+LG E+EGIP ++I  LD CVEIPQ G+
Subjt:  LAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGV

Query:  VRSLNVHVSGAIALWEFTRQQ
        +RSLNVHVSGA+ +WE+TRQQ
Subjt:  VRSLNVHVSGAIALWEFTRQQ

Q8LB10 ATP-dependent Clp protease proteolytic subunit-related protein 4, chloroplastic2.7e-8162.89Show/hide
Query:  MEV-ATTASSF----ALTKRISPSITSAHNGKPNRTFSWSSSRLRAMPLSSNFLAPFAGGSVAGDFTGVKLRPSSLNPDSFYGSKGKRGVVTMVIPFSRG
        MEV A TA+SF    A T  I PS T     KP   FS SSS LRA  LS+ FL+P+ GGS++ D  G KLR  SLNP +F  SK KRGVVTMVIPFS+G
Subjt:  MEV-ATTASSF----ALTKRISPSITSAHNGKPNRTFSWSSSRLRAMPLSSNFLAPFAGGSVAGDFTGVKLRPSSLNPDSFYGSKGKRGVVTMVIPFSRG

Query:  SAWEQPPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR-------------------YVKPPIFTLCVGNAWGEAAL
        SA EQPPPDLASYL+KNRIVYLGMSLVPSVTELILAEFLYLQYEDE KPIYLYINSTGT +                   YVKPPIFTLCVGNAWGEAAL
Subjt:  SAWEQPPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR-------------------YVKPPIFTLCVGNAWGEAAL

Query:  LLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVDGFSRNQ
        LL AGAKGNRSALPSSTIMIKQ                          VKLY+KHIGKS E+IEAD++RPKYFSP+EAVEY  +D    N+
Subjt:  LLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVDGFSRNQ

Q9L4P4 Putative ATP-dependent Clp protease proteolytic subunit-like1.5e-2634.83Show/hide
Query:  PFSRGSAWEQPPPDLASYLYKNRIVYLGMSLVPS----------VTELILAEFLYLQYEDEAKPIYLYINSTG--------------------TQRYVKP
        P+    ++  PPPDL S L K RI+YLGM L  S          VTELI+A+ LYL++++  KPIY YINSTG                    T RY+KP
Subjt:  PFSRGSAWEQPPPDLASYLYKNRIVYLGMSLVPS----------VTELILAEFLYLQYEDEAKPIYLYINSTG--------------------TQRYVKP

Query:  PIFTLCVGNAWGEAALLLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEV
        P+ T+C+G A G AA++L+ G  GNR++LP +TI++ Q                          ++++A++ G+  + +  D  R  Y +P++AVEY  +
Subjt:  PIFTLCVGNAWGEAALLLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEV

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT1G09130.1 ATP-dependent caseinolytic (Clp) protease/crotonase family protein1.7e-2538.8Show/hide
Query:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA
        PPPDL S L   RIVY+GM LVP+VTEL++AE +YLQ+ D  +PIY+YINSTGT R                     +K  + T+CVG A G+A LLL+A
Subjt:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA

Query:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD
        G KG R  +P +  MI+Q                            V+L +KH G S E +   +RRP Y    +A E+  +D
Subjt:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD

AT1G09130.2 ATP-dependent caseinolytic (Clp) protease/crotonase family protein1.7e-2538.8Show/hide
Query:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA
        PPPDL S L   RIVY+GM LVP+VTEL++AE +YLQ+ D  +PIY+YINSTGT R                     +K  + T+CVG A G+A LLL+A
Subjt:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA

Query:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD
        G KG R  +P +  MI+Q                            V+L +KH G S E +   +RRP Y    +A E+  +D
Subjt:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD

AT1G09130.3 ATP-dependent caseinolytic (Clp) protease/crotonase family protein1.7e-2538.8Show/hide
Query:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA
        PPPDL S L   RIVY+GM LVP+VTEL++AE +YLQ+ D  +PIY+YINSTGT R                     +K  + T+CVG A G+A LLL+A
Subjt:  PPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR--------------------YVKPPIFTLCVGNAWGEAALLLAA

Query:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD
        G KG R  +P +  MI+Q                            V+L +KH G S E +   +RRP Y    +A E+  +D
Subjt:  GAKGNRSALPSSTIMIKQ----------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVD

AT4G17040.1 CLP protease R subunit 41.9e-8262.89Show/hide
Query:  MEV-ATTASSF----ALTKRISPSITSAHNGKPNRTFSWSSSRLRAMPLSSNFLAPFAGGSVAGDFTGVKLRPSSLNPDSFYGSKGKRGVVTMVIPFSRG
        MEV A TA+SF    A T  I PS T     KP   FS SSS LRA  LS+ FL+P+ GGS++ D  G KLR  SLNP +F  SK KRGVVTMVIPFS+G
Subjt:  MEV-ATTASSF----ALTKRISPSITSAHNGKPNRTFSWSSSRLRAMPLSSNFLAPFAGGSVAGDFTGVKLRPSSLNPDSFYGSKGKRGVVTMVIPFSRG

Query:  SAWEQPPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR-------------------YVKPPIFTLCVGNAWGEAAL
        SA EQPPPDLASYL+KNRIVYLGMSLVPSVTELILAEFLYLQYEDE KPIYLYINSTGT +                   YVKPPIFTLCVGNAWGEAAL
Subjt:  SAWEQPPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR-------------------YVKPPIFTLCVGNAWGEAAL

Query:  LLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVDGFSRNQ
        LL AGAKGNRSALPSSTIMIKQ                          VKLY+KHIGKS E+IEAD++RPKYFSP+EAVEY  +D    N+
Subjt:  LLAAGAKGNRSALPSSTIMIKQ--------------------------VKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVDGFSRNQ

AT4G17610.1 tRNA/rRNA methyltransferase (SpoU) family protein2.7e-25767.34Show/hide
Query:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY
        MW+LV S WILH+SC KRRVA IAALLSSVLHSS FS + +H              K+LEEG+KSPRT RLAALHL+G+WL +P TIKYY+KEL+LL+LY
Subjt:  MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSER-IH-------------RKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLY

Query:  GSVAFDEDFEAELTD-YEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKEL
        GSVAFDEDFEAEL+D  +ARTEVSLLA+SPD ELTE+FINTELYARVSVA LF KLA+LA M   +++   C DA+ +GKLFLLELLD+ V++ DLAKEL
Subjt:  GSVAFDEDFEAELTD-YEARTEVSLLAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKEL

Query:  YKKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSL--------SRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQV--LSSYV
        YKK+SAIHRRKIRAWQMIC++SRFV  DI+ QV +S+H+ L         RNNLP+VRQ+LETFAI+IYLKFP LVKEQLVPIL++Y+ + QV    + +
Subjt:  YKKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTNSLHVSL--------SRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQV--LSSYV

Query:  FIATNVILHVTEAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNP
         +  NVILH  +  Q +HL ELLPP++P LTSHHHSLRGF QLLV+ VL +LFP VE  ++  + LEK  FE+LKSYL+KNPDC RLRASMEG+L AY+P
Subjt:  FIATNVILHVTEAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPVVEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNP

Query:  ALSVTPSGIFSSRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEK
        + S TP+G+F +RV++ EFECVPT LM+ VL+FLNDVREDLR SMA D+  I+NE FK +E+     + S   +E    +L   +SLDFQ+K+TLSKHEK
Subjt:  ALSVTPSGIFSSRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNLMDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEK

Query:  KDTDTSSYLGSKEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKW
        +D  ++S L + E YK L EME+ED+L+ QLL SRS+ +E L+  RQ  ILVASL+DRIPNLAGLARTCEVFKA+ LA+AD +I++DKQFQLISVTAEKW
Subjt:  KDTDTSSYLGSKEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAAGLAIADLNILYDKQFQLISVTAEKW

Query:  VPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQR
        VPI+EVPVNS+KLFL+KKKREGFSILGLEQTANSV LD+Y FPKKTVLVLGREKEGIPVDIIHILDAC+EIPQLGVVRSLNVHVSGAIALWE+TRQQR
Subjt:  VPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVSGAIALWEFTRQQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCGTTTGGTGCATTCCTCTTGGATACTGCACGTCAGCTGCAACAAGCGAAGGGTTGCACATATTGCTGCGCTTCTGTCTTCTGTTCTTCATTCTTCCACATTTTC
TGAAAGGATTCATAGAAAAATTCTTGAAGAAGGCAAAAAAAGTCCTCGAACGTTTCGTCTTGCTGCATTACATTTGACCGGCATGTGGCTCAGTCATCCATGGACCATAA
AGTATTATGTTAAAGAGCTGAAACTGCTATCACTATATGGTTCTGTTGCATTTGATGAAGATTTTGAAGCTGAATTAACTGATTATGAAGCAAGGACTGAGGTATCATTA
TTGGCAGAAAGTCCAGACCGTGAGCTCACTGAAGTGTTTATCAATACAGAATTGTATGCACGTGTATCTGTTGCTGTTCTGTTTCATAAGCTAGCTGACTTGGCTGATAT
GGCGGGATCGTCGAATGAATATGGGAGTTGCTCTGATGCTGTGGAATCTGGAAAATTGTTTCTGCTTGAGCTCCTTGATTCTGTGGTAAACAACAATGACCTTGCAAAGG
AATTGTATAAAAAGCATAGTGCGATCCATAGACGCAAAATACGTGCTTGGCAAATGATATGTGTTCTATCCCGGTTTGTCTCCCAAGATATAATTCAGCAAGTTACTAAT
AGCTTGCACGTTTCCCTCTCTAGAAACAACCTACCTTCTGTTCGTCAATTCTTGGAAACATTCGCAATCAGCATTTATTTGAAGTTTCCAACATTGGTTAAGGAGCAATT
GGTTCCTATATTACAAGATTACAATATAAGACCGCAGGTACTCTCTTCTTATGTATTTATAGCTACAAATGTCATCCTCCATGTAACTGAAGCCGTTCAATCCAGCCATT
TGGATGAGTTGCTTCCTCCTCTTGTTCCACAATTGACCTCTCATCACCACAGTTTACGAGGTTTTACTCAGTTACTGGTGTACCATGTTCTTTGTAAACTTTTTCCAGTA
GTGGAATTCAGAGCCACTGGGAATATGCCTTTAGAGAAAAGATGTTTTGAAGATTTGAAATCATACCTTGAGAAAAATCCTGATTGCGTCCGTTTACGGGCATCCATGGA
AGGATATTTGCATGCCTATAATCCTGCACTATCTGTCACACCATCTGGAATTTTCTCCAGCAGAGTTAAGGACCTTGAGTTTGAGTGTGTCCCAACATCTCTCATGGAGC
AAGTTCTTAACTTCCTAAACGATGTCAGAGAAGACCTTCGGTGTTCAATGGCAAATGATTTGACAGCTATACAAAACGAGAGCTTCAAAACTAATGAAGATCACAACCTT
ATGGACATGTCATCTGACTTAAAGAAAGAAAGCTTTACTTCCAAACTGCCTCTAGCAACTTCGTTGGATTTTCAGAAGAAGGTCACTCTCTCAAAACATGAGAAGAAAGA
CACTGATACTAGTTCCTACTTGGGCAGCAAAGAAGCTTACAAGTATCTTAACGAAATGGAGGAAGAGGATCAGCTTCTTCACCAGCTGCTGCATTCCAGAAGTTTGTCAA
TGGAAGATTTAAGAATGAATCGGCAACATTTTATCCTTGTAGCATCTCTGCTTGATCGCATACCAAATCTTGCTGGTTTGGCTAGGACTTGTGAGGTTTTTAAGGCTGCA
GGATTGGCCATTGCTGACTTGAATATTCTATATGACAAACAATTCCAACTCATCAGTGTTACAGCGGAGAAGTGGGTTCCTATTGTAGAAGTCCCAGTGAACAGTATGAA
ACTTTTCCTAGACAAAAAGAAACGGGAAGGCTTCTCTATTTTGGGATTGGAGCAAACAGCTAACAGTGTACCTCTTGACCAGTATACATTTCCCAAAAAGACAGTGTTGG
TCCTTGGGCGTGAAAAAGAAGGTATACCTGTTGACATAATCCATATTCTTGACGCATGTGTTGAGATTCCTCAATTGGGAGTTGTCAGATCTCTTAATGTTCACGTTAGT
GGTGCCATTGCACTCTGGGAGTTTACTCGACAACAAAGGCACCAAATTGAGTTCTTGTCCATATACAAACGATGGTGGTACGCTGCCAAACAATACAAGCGAGAAGAACC
AAGGAAGATGAAGCTTACCGGCATCAATAACAGCATCCCAGTTGGTACTTCCGTCTTTCCGAAACTGATTCAAATCCCAAGTCCCATTAACCCATTTAGGGTCTTCAAAT
TTGCTCACCACTCCACTTCTCCGGCAGCCGCCACAGACCCATTAGATCCGGCGAGGCTTTCCTGCTCTTCCGATACCGCCTTGGCCGGCTCTGGATCTTTCTGTTCGACG
GCGGTTTCTACAGCCGTGGCAGCTGAAATTCCGGCGCCATTGTCGGCAGCGGCCCTGATGGTGAAGAGAGAGGAAGCGGGATTCTGGAGAAGAGACTCAGAAGAACACAG
CGCTCCTCATCTGAGCTATCCTCAGCCTTGGAATTGTGATGGATGGGGAAGATTTATCGCTCTGCTTCCAGGTTTTCCAGTTATGGAGGTCGCGACCACTGCCTCCAGCT
TCGCGCTTACCAAGCGAATATCCCCATCAATTACTTCTGCCCATAATGGAAAGCCGAACCGGACCTTCTCCTGGTCGTCCTCCCGTCTCAGAGCGATGCCGCTTTCCTCT
AATTTTCTCGCCCCATTCGCTGGGGGCAGCGTAGCCGGGGACTTCACCGGCGTGAAGCTCCGACCGTCGTCTCTTAATCCAGATTCTTTCTATGGATCCAAGGGCAAGCG
AGGCGTTGTCACCATGGTTATTCCATTTTCAAGGGGAAGTGCATGGGAGCAACCTCCCCCGGATTTGGCCTCTTATTTATACAAGAACCGAATTGTTTACTTGGGCATGT
CTCTTGTTCCTTCTGTCACAGAGTTGATACTCGCGGAATTTCTTTACCTTCAGTATGAGGATGAAGCAAAGCCTATATATCTTTACATTAACTCCACAGGGACACAAAGG
TATGTTAAGCCTCCTATATTCACATTATGCGTTGGCAATGCTTGGGGAGAAGCAGCACTTCTTTTGGCAGCTGGTGCTAAGGGAAACCGTTCTGCATTGCCCTCATCAAC
AATAATGATCAAGCAGGTTAAACTTTATGCAAAGCATATTGGAAAGTCGACCGAGGAGATTGAAGCTGATATCAGGAGGCCAAAGTATTTTAGTCCCAGTGAAGCAGTTG
AATATGATGAGGTAGACGGGTTCTCAAGGAACCAGAGCACTGCTCTCTCATCACAGTCAAGCCTGTTGTGTAGAGCAAAGCATTCCTTTCGGTATGCAGTACCTCTGACA
GTAATCAGATGGCACGAAAGAGATGATATTACATTTCAGTCATCACAAGCAGGTACCAATTGCGTGTTCTGTTTCCTCAAAGGTGGTTTTCCTGCTGTAATCACCAACAT
GGATTGTAATGAAAAGACCAGAAACGATGATCAAATTACTACACAGTCACATGCTACATGTATTCTCCAGATGCCATATGCTAATGGGTTAACGTTCATATCTGTGTCTC
TGCATTCCCGGAGTTTCGGAGGAAAAATTACGATGTATAAGATGGACCACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGCGTTTGGTGCATTCCTCTTGGATACTGCACGTCAGCTGCAACAAGCGAAGGGTTGCACATATTGCTGCGCTTCTGTCTTCTGTTCTTCATTCTTCCACATTTTC
TGAAAGGATTCATAGAAAAATTCTTGAAGAAGGCAAAAAAAGTCCTCGAACGTTTCGTCTTGCTGCATTACATTTGACCGGCATGTGGCTCAGTCATCCATGGACCATAA
AGTATTATGTTAAAGAGCTGAAACTGCTATCACTATATGGTTCTGTTGCATTTGATGAAGATTTTGAAGCTGAATTAACTGATTATGAAGCAAGGACTGAGGTATCATTA
TTGGCAGAAAGTCCAGACCGTGAGCTCACTGAAGTGTTTATCAATACAGAATTGTATGCACGTGTATCTGTTGCTGTTCTGTTTCATAAGCTAGCTGACTTGGCTGATAT
GGCGGGATCGTCGAATGAATATGGGAGTTGCTCTGATGCTGTGGAATCTGGAAAATTGTTTCTGCTTGAGCTCCTTGATTCTGTGGTAAACAACAATGACCTTGCAAAGG
AATTGTATAAAAAGCATAGTGCGATCCATAGACGCAAAATACGTGCTTGGCAAATGATATGTGTTCTATCCCGGTTTGTCTCCCAAGATATAATTCAGCAAGTTACTAAT
AGCTTGCACGTTTCCCTCTCTAGAAACAACCTACCTTCTGTTCGTCAATTCTTGGAAACATTCGCAATCAGCATTTATTTGAAGTTTCCAACATTGGTTAAGGAGCAATT
GGTTCCTATATTACAAGATTACAATATAAGACCGCAGGTACTCTCTTCTTATGTATTTATAGCTACAAATGTCATCCTCCATGTAACTGAAGCCGTTCAATCCAGCCATT
TGGATGAGTTGCTTCCTCCTCTTGTTCCACAATTGACCTCTCATCACCACAGTTTACGAGGTTTTACTCAGTTACTGGTGTACCATGTTCTTTGTAAACTTTTTCCAGTA
GTGGAATTCAGAGCCACTGGGAATATGCCTTTAGAGAAAAGATGTTTTGAAGATTTGAAATCATACCTTGAGAAAAATCCTGATTGCGTCCGTTTACGGGCATCCATGGA
AGGATATTTGCATGCCTATAATCCTGCACTATCTGTCACACCATCTGGAATTTTCTCCAGCAGAGTTAAGGACCTTGAGTTTGAGTGTGTCCCAACATCTCTCATGGAGC
AAGTTCTTAACTTCCTAAACGATGTCAGAGAAGACCTTCGGTGTTCAATGGCAAATGATTTGACAGCTATACAAAACGAGAGCTTCAAAACTAATGAAGATCACAACCTT
ATGGACATGTCATCTGACTTAAAGAAAGAAAGCTTTACTTCCAAACTGCCTCTAGCAACTTCGTTGGATTTTCAGAAGAAGGTCACTCTCTCAAAACATGAGAAGAAAGA
CACTGATACTAGTTCCTACTTGGGCAGCAAAGAAGCTTACAAGTATCTTAACGAAATGGAGGAAGAGGATCAGCTTCTTCACCAGCTGCTGCATTCCAGAAGTTTGTCAA
TGGAAGATTTAAGAATGAATCGGCAACATTTTATCCTTGTAGCATCTCTGCTTGATCGCATACCAAATCTTGCTGGTTTGGCTAGGACTTGTGAGGTTTTTAAGGCTGCA
GGATTGGCCATTGCTGACTTGAATATTCTATATGACAAACAATTCCAACTCATCAGTGTTACAGCGGAGAAGTGGGTTCCTATTGTAGAAGTCCCAGTGAACAGTATGAA
ACTTTTCCTAGACAAAAAGAAACGGGAAGGCTTCTCTATTTTGGGATTGGAGCAAACAGCTAACAGTGTACCTCTTGACCAGTATACATTTCCCAAAAAGACAGTGTTGG
TCCTTGGGCGTGAAAAAGAAGGTATACCTGTTGACATAATCCATATTCTTGACGCATGTGTTGAGATTCCTCAATTGGGAGTTGTCAGATCTCTTAATGTTCACGTTAGT
GGTGCCATTGCACTCTGGGAGTTTACTCGACAACAAAGGCACCAAATTGAGTTCTTGTCCATATACAAACGATGGTGGTACGCTGCCAAACAATACAAGCGAGAAGAACC
AAGGAAGATGAAGCTTACCGGCATCAATAACAGCATCCCAGTTGGTACTTCCGTCTTTCCGAAACTGATTCAAATCCCAAGTCCCATTAACCCATTTAGGGTCTTCAAAT
TTGCTCACCACTCCACTTCTCCGGCAGCCGCCACAGACCCATTAGATCCGGCGAGGCTTTCCTGCTCTTCCGATACCGCCTTGGCCGGCTCTGGATCTTTCTGTTCGACG
GCGGTTTCTACAGCCGTGGCAGCTGAAATTCCGGCGCCATTGTCGGCAGCGGCCCTGATGGTGAAGAGAGAGGAAGCGGGATTCTGGAGAAGAGACTCAGAAGAACACAG
CGCTCCTCATCTGAGCTATCCTCAGCCTTGGAATTGTGATGGATGGGGAAGATTTATCGCTCTGCTTCCAGGTTTTCCAGTTATGGAGGTCGCGACCACTGCCTCCAGCT
TCGCGCTTACCAAGCGAATATCCCCATCAATTACTTCTGCCCATAATGGAAAGCCGAACCGGACCTTCTCCTGGTCGTCCTCCCGTCTCAGAGCGATGCCGCTTTCCTCT
AATTTTCTCGCCCCATTCGCTGGGGGCAGCGTAGCCGGGGACTTCACCGGCGTGAAGCTCCGACCGTCGTCTCTTAATCCAGATTCTTTCTATGGATCCAAGGGCAAGCG
AGGCGTTGTCACCATGGTTATTCCATTTTCAAGGGGAAGTGCATGGGAGCAACCTCCCCCGGATTTGGCCTCTTATTTATACAAGAACCGAATTGTTTACTTGGGCATGT
CTCTTGTTCCTTCTGTCACAGAGTTGATACTCGCGGAATTTCTTTACCTTCAGTATGAGGATGAAGCAAAGCCTATATATCTTTACATTAACTCCACAGGGACACAAAGG
TATGTTAAGCCTCCTATATTCACATTATGCGTTGGCAATGCTTGGGGAGAAGCAGCACTTCTTTTGGCAGCTGGTGCTAAGGGAAACCGTTCTGCATTGCCCTCATCAAC
AATAATGATCAAGCAGGTTAAACTTTATGCAAAGCATATTGGAAAGTCGACCGAGGAGATTGAAGCTGATATCAGGAGGCCAAAGTATTTTAGTCCCAGTGAAGCAGTTG
AATATGATGAGGTAGACGGGTTCTCAAGGAACCAGAGCACTGCTCTCTCATCACAGTCAAGCCTGTTGTGTAGAGCAAAGCATTCCTTTCGGTATGCAGTACCTCTGACA
GTAATCAGATGGCACGAAAGAGATGATATTACATTTCAGTCATCACAAGCAGGTACCAATTGCGTGTTCTGTTTCCTCAAAGGTGGTTTTCCTGCTGTAATCACCAACAT
GGATTGTAATGAAAAGACCAGAAACGATGATCAAATTACTACACAGTCACATGCTACATGTATTCTCCAGATGCCATATGCTAATGGGTTAACGTTCATATCTGTGTCTC
TGCATTCCCGGAGTTTCGGAGGAAAAATTACGATGTATAAGATGGACCACCTATGA
Protein sequenceShow/hide protein sequence
MWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSSTFSERIHRKILEEGKKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSVAFDEDFEAELTDYEARTEVSL
LAESPDRELTEVFINTELYARVSVAVLFHKLADLADMAGSSNEYGSCSDAVESGKLFLLELLDSVVNNNDLAKELYKKHSAIHRRKIRAWQMICVLSRFVSQDIIQQVTN
SLHVSLSRNNLPSVRQFLETFAISIYLKFPTLVKEQLVPILQDYNIRPQVLSSYVFIATNVILHVTEAVQSSHLDELLPPLVPQLTSHHHSLRGFTQLLVYHVLCKLFPV
VEFRATGNMPLEKRCFEDLKSYLEKNPDCVRLRASMEGYLHAYNPALSVTPSGIFSSRVKDLEFECVPTSLMEQVLNFLNDVREDLRCSMANDLTAIQNESFKTNEDHNL
MDMSSDLKKESFTSKLPLATSLDFQKKVTLSKHEKKDTDTSSYLGSKEAYKYLNEMEEEDQLLHQLLHSRSLSMEDLRMNRQHFILVASLLDRIPNLAGLARTCEVFKAA
GLAIADLNILYDKQFQLISVTAEKWVPIVEVPVNSMKLFLDKKKREGFSILGLEQTANSVPLDQYTFPKKTVLVLGREKEGIPVDIIHILDACVEIPQLGVVRSLNVHVS
GAIALWEFTRQQRHQIEFLSIYKRWWYAAKQYKREEPRKMKLTGINNSIPVGTSVFPKLIQIPSPINPFRVFKFAHHSTSPAAATDPLDPARLSCSSDTALAGSGSFCST
AVSTAVAAEIPAPLSAAALMVKREEAGFWRRDSEEHSAPHLSYPQPWNCDGWGRFIALLPGFPVMEVATTASSFALTKRISPSITSAHNGKPNRTFSWSSSRLRAMPLSS
NFLAPFAGGSVAGDFTGVKLRPSSLNPDSFYGSKGKRGVVTMVIPFSRGSAWEQPPPDLASYLYKNRIVYLGMSLVPSVTELILAEFLYLQYEDEAKPIYLYINSTGTQR
YVKPPIFTLCVGNAWGEAALLLAAGAKGNRSALPSSTIMIKQVKLYAKHIGKSTEEIEADIRRPKYFSPSEAVEYDEVDGFSRNQSTALSSQSSLLCRAKHSFRYAVPLT
VIRWHERDDITFQSSQAGTNCVFCFLKGGFPAVITNMDCNEKTRNDDQITTQSHATCILQMPYANGLTFISVSLHSRSFGGKITMYKMDHL