; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019758 (gene) of Snake gourd v1 genome

Gene IDTan0019758
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMethyltransferase FkbM
Genome locationLG03:63348524..63350080
RNA-Seq ExpressionTan0019758
SyntenyTan0019758
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR006342 - Methyltransferase FkbM
IPR013216 - Methyltransferase type 11
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148602.1 uncharacterized protein LOC101209474 [Cucumis sativus]1.7e-27992.13Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSM GKFASLRNLLVRVLLFSILVI+GRFAY+VTIAGESCSIGDFCF +LPE  NFM P GGGGR GSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALK+IGVS+SVGIFKKALKPLVIKGEAHRIPFED+TFDFVFSGGSRLDISRR +DFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+K+KDID+VLPSMP  REYVLKKEYRILENGL KL  DG SYKKNCSVPG+K++L RNAEPLILEEPLKPWLT
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT

Query:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE
        LKRNIQN+KYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQY+SKKGVTLLPYAAWVRNETL FEINKDPGQ KE
Subjt:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE

Query:  EKG-SRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
        +KG SRGMGRIQP QS G F+GEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
Subjt:  EKG-SRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY

Query:  SQCIELFASLRQSGVLVHQWW
        S+CI+LFASLRQSGVLVHQWW
Subjt:  SQCIELFASLRQSGVLVHQWW

XP_008464242.1 PREDICTED: uncharacterized protein LOC103502172 [Cucumis melo]2.9e-27992.51Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSM GKFASLRNLLVRVLLFSILVI+GRFAY+VTIAGESCSIGDFCF +LPE  NFM P G GGR GSAALFTDN DPRVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVS+SVGIFKKALKPLVIKGEAHRIPFED+TFDFVFSGGSRLDISRR +DFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT
        GFAVVHVSAKDTYSFNSFVDLFN CKI+KSKDID+VLPSMP  REYVLKKEYRILENGL KL  DG SYKKNCSVPG+K++L RNAEPLILEEPLKPWLT
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT

Query:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE
        LKRNIQNIKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQY+SKKGVTLLPYAAWVRNETL FEINKDPGQEKE
Subjt:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE

Query:  EK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
        +K GSRGMGRIQP QSSG F+GEVNQIQGFDFA+WLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
Subjt:  EK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY

Query:  SQCIELFASLRQSGVLVHQWW
        SQCI+LFASLRQSGVLVHQWW
Subjt:  SQCIELFASLRQSGVLVHQWW

XP_022144100.1 uncharacterized protein LOC111013875 [Momordica charantia]3.8e-27991.31Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSMSGKFASLRNLLVRVLLFSIL+I+GRFAYIVTI GESCSIGDFCF +LPEN NFM    G GRAG+AALFTDNADPRVIGPPRPDLYSS+DWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        F+SSVF+DL+TEGYLSPNSKSLC+ETPAG DVSALK IGVSDSVGIFKKALKPLVIKGEAHRIPF D+TFDFVFSGG RLDISR  Y+FASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHVSAKDTYSFNSFVDLFNCCKIVKS+DIDVVLPSMP IREYVLKKEYR+LE GL KLDGDS+KKNCSVPG+K++LVRNAEPLILEEPLKPWLTLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNIQ IKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEAD+TFHEQYRSKK VTLLPYAAWVRNETL FEINKDPGQEKEEK
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC
        G RGMGRIQP QSSGHF+GEVNQI+GFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRS+KYEKTYSQC
Subjt:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC

Query:  IELFASLRQSGVLVHQWW
        IELF SLRQSGVLVHQWW
Subjt:  IELFASLRQSGVLVHQWW

XP_022978145.1 uncharacterized protein LOC111478211 [Cucurbita maxima]2.4e-28192.86Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGS+SGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESC IGDFCF +LPEN NFM    GGGRAGSAALFTDN D RVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDL+TEGYLS NSK+LCIETPAGADVSALKEIGVSDSVGIFKKALKP VIKGEAHRIPFED TFDFVFSGG RLDIS+R YDFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+KSKDID  LPSMPF REYVLKKEYRILEN LQKLDGDSYKKNCSVPG+K+SLVRNAEPLILEEPLKPWLTLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNIQNIKYLPSMAEISFKN+YVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKK VTLLPYAAWVRNETL FEINKDPGQE E+K
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC
        GSRGMGRIQP QSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPG+RS KYEKTYS+C
Subjt:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC

Query:  IELFASLRQSGVLVHQWW
        I+LFASLRQSGVLVHQWW
Subjt:  IELFASLRQSGVLVHQWW

XP_038881642.1 uncharacterized protein LOC120073098 [Benincasa hispida]1.6e-28092.72Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSMSGKFASLRNLLVRVLLFSILVI+GRFAYIVTIAGESCSIGDFCF +LPEN NFM  AGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDLITEGYLSPNSKSLCIETP GADVSALKEIGVSDSVGIFKKA KPLVIKGEAHRIPFE +TFDFVFSGG RLDISRR +DFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGD---SYKKNCSVPGFKISLVRNAEPLILEEPLKPWL
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+KSKDID+VLPSMPF REYVLKKEYRILENGLQKLDG+    + KNCSVPG+K++LVRNAEPLILEEPLKPWL
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGD---SYKKNCSVPGFKISLVRNAEPLILEEPLKPWL

Query:  TLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEK
        TLKRNIQNIKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQY+SKKGVTLLPYAAWVRNETL FEINKDP QEK
Subjt:  TLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEK

Query:  EEK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKT
        E+K GSRGMGRIQP QSS  F+GEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKT
Subjt:  EEK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKT

Query:  YSQCIELFASLRQSGVLVHQWW
        YSQCI+L ASLRQSGVLVHQWW
Subjt:  YSQCIELFASLRQSGVLVHQWW

TrEMBL top hitse value%identityAlignment
A0A0A0L2S2 Uncharacterized protein8.3e-28092.13Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSM GKFASLRNLLVRVLLFSILVI+GRFAY+VTIAGESCSIGDFCF +LPE  NFM P GGGGR GSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALK+IGVS+SVGIFKKALKPLVIKGEAHRIPFED+TFDFVFSGGSRLDISRR +DFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+K+KDID+VLPSMP  REYVLKKEYRILENGL KL  DG SYKKNCSVPG+K++L RNAEPLILEEPLKPWLT
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT

Query:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE
        LKRNIQN+KYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQY+SKKGVTLLPYAAWVRNETL FEINKDPGQ KE
Subjt:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE

Query:  EKG-SRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
        +KG SRGMGRIQP QS G F+GEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
Subjt:  EKG-SRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY

Query:  SQCIELFASLRQSGVLVHQWW
        S+CI+LFASLRQSGVLVHQWW
Subjt:  SQCIELFASLRQSGVLVHQWW

A0A1S3CL16 uncharacterized protein LOC1035021721.4e-27992.51Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSM GKFASLRNLLVRVLLFSILVI+GRFAY+VTIAGESCSIGDFCF +LPE  NFM P G GGR GSAALFTDN DPRVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVS+SVGIFKKALKPLVIKGEAHRIPFED+TFDFVFSGGSRLDISRR +DFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT
        GFAVVHVSAKDTYSFNSFVDLFN CKI+KSKDID+VLPSMP  REYVLKKEYRILENGL KL  DG SYKKNCSVPG+K++L RNAEPLILEEPLKPWLT
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKL--DGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLT

Query:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE
        LKRNIQNIKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQY+SKKGVTLLPYAAWVRNETL FEINKDPGQEKE
Subjt:  LKRNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKE

Query:  EK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
        +K GSRGMGRIQP QSSG F+GEVNQIQGFDFA+WLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
Subjt:  EK-GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY

Query:  SQCIELFASLRQSGVLVHQWW
        SQCI+LFASLRQSGVLVHQWW
Subjt:  SQCIELFASLRQSGVLVHQWW

A0A6J1CQP9 uncharacterized protein LOC1110138751.9e-27991.31Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGSMSGKFASLRNLLVRVLLFSIL+I+GRFAYIVTI GESCSIGDFCF +LPEN NFM    G GRAG+AALFTDNADPRVIGPPRPDLYSS+DWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        F+SSVF+DL+TEGYLSPNSKSLC+ETPAG DVSALK IGVSDSVGIFKKALKPLVIKGEAHRIPF D+TFDFVFSGG RLDISR  Y+FASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHVSAKDTYSFNSFVDLFNCCKIVKS+DIDVVLPSMP IREYVLKKEYR+LE GL KLDGDS+KKNCSVPG+K++LVRNAEPLILEEPLKPWLTLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNIQ IKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEAD+TFHEQYRSKK VTLLPYAAWVRNETL FEINKDPGQEKEEK
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC
        G RGMGRIQP QSSGHF+GEVNQI+GFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRS+KYEKTYSQC
Subjt:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC

Query:  IELFASLRQSGVLVHQWW
        IELF SLRQSGVLVHQWW
Subjt:  IELFASLRQSGVLVHQWW

A0A6J1GE05 uncharacterized protein LOC1114530901.6e-27891.89Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGS+SGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCF +LPEN NFM    GGGR GSAAL TDN D RVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDL+TEGYLS NSK LCIETPAGADVSALKEIGVSDSVGIFKKALKP VIKGEAHRIPFED TFDFVFSGG RLDIS R YDFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+KSKDID  LPSMPF REYVLKKE+RILEN LQKLDGDSYKKNCSVPG+K++LVRNAEPLILEEPLKPWLTLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNIQNIKYLPSMAEISFKNRYVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKK VTLLPYAAWVRNETL FEINKDPGQE E+K
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC
        G RGMGRIQP +SSGHFNGEVNQIQGFDFA WLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPG+RS KYEKTYS+C
Subjt:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC

Query:  IELFASLRQSGVLVHQWW
        I+LFASLRQSGVLVHQWW
Subjt:  IELFASLRQSGVLVHQWW

A0A6J1IP93 uncharacterized protein LOC1114782111.2e-28192.86Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        MGS+SGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESC IGDFCF +LPEN NFM    GGGRAGSAALFTDN D RVIGPPRPDLYSSEDWIKTVQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSSVFQDL+TEGYLS NSK+LCIETPAGADVSALKEIGVSDSVGIFKKALKP VIKGEAHRIPFED TFDFVFSGG RLDIS+R YDFASEIARILKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHVSAKDTYSFNSFVDLFNCCKI+KSKDID  LPSMPF REYVLKKEYRILEN LQKLDGDSYKKNCSVPG+K+SLVRNAEPLILEEPLKPWLTLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNIQNIKYLPSMAEISFKN+YVYVDVGAR YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKK VTLLPYAAWVRNETL FEINKDPGQE E+K
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC
        GSRGMGRIQP QSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPG+RS KYEKTYS+C
Subjt:  GSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQC

Query:  IELFASLRQSGVLVHQWW
        I+LFASLRQSGVLVHQWW
Subjt:  IELFASLRQSGVLVHQWW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G24480.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein8.0e-2537.2Show/hide
Query:  RVIGPPRPDLYSSEDWIKTVQFYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSR
        + + P    ++ + DW + ++ +S  FQDL  +G LS +SK LC+    G +V ALK +GV+DSVG+      PLV+KG+ H  PF+D TFDF FS  + 
Subjt:  RVIGPPRPDLYSSEDWIKTVQFYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSR

Query:  LDISRRAYDFASEIARILKPEGFAVVHV---------SAKDTYSFNSFVDLFNCCKIVKSKDID
         D +     F  EI R L+P G  V+HV         SA D +S  + V LF   ++V  +++D
Subjt:  LDISRRAYDFASEIARILKPEGFAVVHV---------SAKDTYSFNSFVDLFNCCKIVKSKDID

AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)1.6e-3327.9Show/hide
Query:  SSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAY--DFASEIARILKPE
        + V ++L     LS +SK+LCI   + + V A+   G+SD    +   +     +     + +ED++F FVFS    +D+   A       EI RILKP 
Subjt:  SSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAY--DFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDG-DSYKKNCSVPGF--KISLVRNAEPLILEEPLKPWL
        G   + V        N  V             +  +L +   +    L K+  +    + K DG DS++ + +   F    S V N  P I    L+P L
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDG-DSYKKNCSVPGF--KISLVRNAEPLILEEPLKPWL

Query:  TLKRN--IQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQ
          KR+   + I YLP   ++S + R VY+D+GA  +  +  +WF   YP   K F  Y +  + +    Y    GVT + +      +T           
Subjt:  TLKRN--IQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQ

Query:  EKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEK
                         ++G       +   FDF  W K T S  DFVVLKM+   TE + +  L +TGAIC +DE+FL C                   
Subjt:  EKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEK

Query:  TYSQCIELFASLRQSGVLVHQWW
         YS C  +  SLR SGV VHQWW
Subjt:  TYSQCIELFASLRQSGVLVHQWW

AT5G01710.1 methyltransferases2.4e-20266.41Show/hide
Query:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ
        M +  GK + LRN++VRVLLF +L+I+ RFAY+VTI GESC+ GDFCF +LPENLNF+     G  +GS+A+     D      P  DLY++ DWIK+VQ
Subjt:  MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQ

Query:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE
        FYSS+FQDLI +GYLSP SK+LC+ET  G +V +L+EIGV +SVGI KKA +PLV++GE H IPFED+ FDFVFSGG RL  S +  +FA EI R LKPE
Subjt:  FYSSVFQDLITEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPE

Query:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK
        GFAVVHV A DTYSFNSF+DLFN C++VK +DID   PSMP IRE+V++K   I   G     G      C +PG+K  L+R+AEPLI EEPLKPW+TLK
Subjt:  GFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLK

Query:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK
        RNI+NIKY+PSM +I FK+RYVYVDVGAR YGSSIGSWFKK+YPKQNKTF+V+AIEAD+ FHE+Y+ KK V LLPYAAWVRNETL+FEIN DPG+E E K
Subjt:  RNIQNIKYLPSMAEISFKNRYVYVDVGARGYGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEK

Query:  G-SRGMGRIQP--DQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY
           RGMGRIQP     S    GEVN IQGFDFA+WLK +V E+DFVV+KMDVEGTEF+LIPRL +TGAICLIDE+FLECHYNRWQRCCPGQRS KY KTY
Subjt:  G-SRGMGRIQP--DQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTY

Query:  SQCIELFASLRQSGVLVHQWW
        +QC+ELF SLRQ GVLVHQWW
Subjt:  SQCIELFASLRQSGVLVHQWW

AT5G03190.1 conserved peptide upstream open reading frame 472.0e-3126.28Show/hide
Query:  IKTVQFYSSVFQDLITE----GYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFE-DSTFDFVFSGGSRLDISRRAYDFA
        I+T ++   V  DL+ E      L  ++K LCI   + + VS  KE+G S   G+ K  L     +   + +    D +FDFV  G   +D         
Subjt:  IKTVQFYSSVFQDLITE----GYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFE-DSTFDFVFSGGSRLDISRRAYDFA

Query:  SEIARILKPEGFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILE
         E+ R+LKP G   V VS        S        +IV+  ++D    ++   +  V +  Y I   G  +L  D    + + P          EPL+ +
Subjt:  SEIARILKPEGFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILE

Query:  EPLKPWLTLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGY--GSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFE
        +P           +++ YLP   ++S K   VY+D+GA  +   +   +WF   YP  +K F VY ++ + +    Y  K GVT + +     N +   +
Subjt:  EPLKPWLTLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGY--GSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFE

Query:  INKDPGQEKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQ
        I               + +++P            + + FDF  W + T    DFVVLKM+    E + +  L ETG IC +DE+FL C  ++        
Subjt:  INKDPGQEKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQ

Query:  RSTKYEKTYSQCIELFASLRQSGVLVHQWW
                 S CI +  +LR  GV VHQWW
Subjt:  RSTKYEKTYSQCIELFASLRQSGVLVHQWW

AT5G03190.2 conserved peptide upstream open reading frame 472.0e-3126.28Show/hide
Query:  IKTVQFYSSVFQDLITE----GYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFE-DSTFDFVFSGGSRLDISRRAYDFA
        I+T ++   V  DL+ E      L  ++K LCI   + + VS  KE+G S   G+ K  L     +   + +    D +FDFV  G   +D         
Subjt:  IKTVQFYSSVFQDLITE----GYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFE-DSTFDFVFSGGSRLDISRRAYDFA

Query:  SEIARILKPEGFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILE
         E+ R+LKP G   V VS        S        +IV+  ++D    ++   +  V +  Y I   G  +L  D    + + P          EPL+ +
Subjt:  SEIARILKPEGFAVVHVSAKDTYSFNSFVDLFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILE

Query:  EPLKPWLTLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGY--GSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFE
        +P           +++ YLP   ++S K   VY+D+GA  +   +   +WF   YP  +K F VY ++ + +    Y  K GVT + +     N +   +
Subjt:  EPLKPWLTLKRNIQNIKYLPSMAEISFKNRYVYVDVGARGY--GSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFE

Query:  INKDPGQEKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQ
        I               + +++P            + + FDF  W + T    DFVVLKM+    E + +  L ETG IC +DE+FL C  ++        
Subjt:  INKDPGQEKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEKDFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQ

Query:  RSTKYEKTYSQCIELFASLRQSGVLVHQWW
                 S CI +  +LR  GV VHQWW
Subjt:  RSTKYEKTYSQCIELFASLRQSGVLVHQWW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCATGTCCGGGAAATTCGCCTCCCTGAGGAACCTTCTGGTTCGAGTGCTTCTGTTCTCTATTCTTGTGATTCTCGGCCGATTTGCCTATATTGTGACCATTGC
CGGAGAATCCTGTAGCATCGGCGACTTTTGTTTTCTCGCTTTGCCGGAAAATCTCAATTTCATGAACCCTGCCGGCGGCGGCGGCCGTGCCGGAAGTGCCGCTTTGTTCA
CGGACAACGCCGACCCTCGCGTCATCGGACCTCCTCGTCCGGATCTGTACTCGAGTGAAGATTGGATCAAGACCGTCCAGTTTTACTCCTCCGTTTTCCAGGATCTGATT
ACCGAGGGCTACTTGTCGCCGAATTCGAAATCCTTGTGCATTGAAACCCCCGCCGGAGCGGACGTGTCGGCGTTAAAGGAGATCGGGGTTTCGGACTCGGTCGGGATTTT
CAAGAAGGCGTTGAAGCCTCTCGTGATTAAGGGCGAGGCTCATCGGATTCCGTTTGAGGACAGTACCTTCGACTTCGTGTTCTCCGGCGGTAGCCGATTGGATATTTCTC
GTCGGGCCTATGATTTTGCTTCAGAGATTGCTCGGATACTGAAACCCGAAGGGTTTGCGGTGGTCCATGTGAGTGCTAAAGATACGTATAGTTTCAATTCATTTGTTGAT
TTATTCAATTGCTGCAAGATAGTTAAATCAAAAGATATCGATGTTGTTCTTCCATCAATGCCTTTTATTCGTGAATATGTATTGAAAAAAGAGTATAGGATTCTTGAAAA
TGGATTACAGAAACTGGATGGGGATTCTTACAAGAAGAATTGTTCTGTTCCTGGGTTTAAGATATCTCTGGTTCGAAATGCTGAGCCTCTGATATTAGAGGAGCCATTGA
AGCCATGGTTAACTTTAAAGAGAAATATACAGAACATTAAGTATTTGCCATCAATGGCTGAAATAAGCTTCAAAAACAGGTATGTCTATGTGGATGTTGGTGCTAGGGGC
TATGGCTCCAGCATTGGGAGTTGGTTCAAAAAGCAATATCCAAAGCAGAACAAAACCTTTGAAGTTTATGCCATTGAAGCTGACCAAACATTTCATGAACAGTATAGATC
AAAGAAAGGAGTGACACTGTTGCCTTATGCTGCTTGGGTGAGGAATGAGACATTGACTTTTGAGATCAATAAAGACCCAGGTCAGGAAAAGGAAGAGAAAGGCAGCAGAG
GGATGGGCAGGATTCAACCCGACCAATCTTCGGGGCATTTCAATGGGGAGGTGAATCAGATTCAGGGGTTTGATTTTGCTAACTGGTTGAAGAACACAGTATCAGAGAAG
GATTTCGTGGTGTTGAAGATGGACGTGGAAGGAACCGAATTTGAATTGATTCCAAGACTCTTTGAGACAGGAGCGATCTGTCTAATAGATGAGATGTTTCTTGAATGCCA
TTACAATAGGTGGCAGAGGTGTTGCCCAGGGCAAAGGAGTACAAAGTATGAGAAAACATATTCTCAATGCATAGAGTTATTTGCTTCTCTCAGACAAAGTGGAGTTCTGG
TTCACCAATGGTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCCATGTCCGGGAAATTCGCCTCCCTGAGGAACCTTCTGGTTCGAGTGCTTCTGTTCTCTATTCTTGTGATTCTCGGCCGATTTGCCTATATTGTGACCATTGC
CGGAGAATCCTGTAGCATCGGCGACTTTTGTTTTCTCGCTTTGCCGGAAAATCTCAATTTCATGAACCCTGCCGGCGGCGGCGGCCGTGCCGGAAGTGCCGCTTTGTTCA
CGGACAACGCCGACCCTCGCGTCATCGGACCTCCTCGTCCGGATCTGTACTCGAGTGAAGATTGGATCAAGACCGTCCAGTTTTACTCCTCCGTTTTCCAGGATCTGATT
ACCGAGGGCTACTTGTCGCCGAATTCGAAATCCTTGTGCATTGAAACCCCCGCCGGAGCGGACGTGTCGGCGTTAAAGGAGATCGGGGTTTCGGACTCGGTCGGGATTTT
CAAGAAGGCGTTGAAGCCTCTCGTGATTAAGGGCGAGGCTCATCGGATTCCGTTTGAGGACAGTACCTTCGACTTCGTGTTCTCCGGCGGTAGCCGATTGGATATTTCTC
GTCGGGCCTATGATTTTGCTTCAGAGATTGCTCGGATACTGAAACCCGAAGGGTTTGCGGTGGTCCATGTGAGTGCTAAAGATACGTATAGTTTCAATTCATTTGTTGAT
TTATTCAATTGCTGCAAGATAGTTAAATCAAAAGATATCGATGTTGTTCTTCCATCAATGCCTTTTATTCGTGAATATGTATTGAAAAAAGAGTATAGGATTCTTGAAAA
TGGATTACAGAAACTGGATGGGGATTCTTACAAGAAGAATTGTTCTGTTCCTGGGTTTAAGATATCTCTGGTTCGAAATGCTGAGCCTCTGATATTAGAGGAGCCATTGA
AGCCATGGTTAACTTTAAAGAGAAATATACAGAACATTAAGTATTTGCCATCAATGGCTGAAATAAGCTTCAAAAACAGGTATGTCTATGTGGATGTTGGTGCTAGGGGC
TATGGCTCCAGCATTGGGAGTTGGTTCAAAAAGCAATATCCAAAGCAGAACAAAACCTTTGAAGTTTATGCCATTGAAGCTGACCAAACATTTCATGAACAGTATAGATC
AAAGAAAGGAGTGACACTGTTGCCTTATGCTGCTTGGGTGAGGAATGAGACATTGACTTTTGAGATCAATAAAGACCCAGGTCAGGAAAAGGAAGAGAAAGGCAGCAGAG
GGATGGGCAGGATTCAACCCGACCAATCTTCGGGGCATTTCAATGGGGAGGTGAATCAGATTCAGGGGTTTGATTTTGCTAACTGGTTGAAGAACACAGTATCAGAGAAG
GATTTCGTGGTGTTGAAGATGGACGTGGAAGGAACCGAATTTGAATTGATTCCAAGACTCTTTGAGACAGGAGCGATCTGTCTAATAGATGAGATGTTTCTTGAATGCCA
TTACAATAGGTGGCAGAGGTGTTGCCCAGGGCAAAGGAGTACAAAGTATGAGAAAACATATTCTCAATGCATAGAGTTATTTGCTTCTCTCAGACAAAGTGGAGTTCTGG
TTCACCAATGGTGGTAA
Protein sequenceShow/hide protein sequence
MGSMSGKFASLRNLLVRVLLFSILVILGRFAYIVTIAGESCSIGDFCFLALPENLNFMNPAGGGGRAGSAALFTDNADPRVIGPPRPDLYSSEDWIKTVQFYSSVFQDLI
TEGYLSPNSKSLCIETPAGADVSALKEIGVSDSVGIFKKALKPLVIKGEAHRIPFEDSTFDFVFSGGSRLDISRRAYDFASEIARILKPEGFAVVHVSAKDTYSFNSFVD
LFNCCKIVKSKDIDVVLPSMPFIREYVLKKEYRILENGLQKLDGDSYKKNCSVPGFKISLVRNAEPLILEEPLKPWLTLKRNIQNIKYLPSMAEISFKNRYVYVDVGARG
YGSSIGSWFKKQYPKQNKTFEVYAIEADQTFHEQYRSKKGVTLLPYAAWVRNETLTFEINKDPGQEKEEKGSRGMGRIQPDQSSGHFNGEVNQIQGFDFANWLKNTVSEK
DFVVLKMDVEGTEFELIPRLFETGAICLIDEMFLECHYNRWQRCCPGQRSTKYEKTYSQCIELFASLRQSGVLVHQWW