; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0027621 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0027621
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCMiso1.1chr01:28811450..28814493
RNA-Seq ExpressionCmc01g0027621
SyntenyCmc01g0027621
Gene Ontology termsNA
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064033.1 uncharacterized protein E6C27_scaffold99G00310 [Cucumis melo var. makuwa]7.2e-25099.33Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVID+EFKHLG SFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

KAG6593098.1 hypothetical protein SDJN03_12574, partial [Cucurbita argyrosperma subsp. sororia]3.0e-20383.18Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        M ENP  L+I+EL YSDL LLST  HS SSLQ +ER+ESITKSI EALGP+GPGLLAI GVPNSSVLRR LLPLARKLALLNPD RKRILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQ  DKQS  SE D FC SIE +V DNEFKHLG SFKELGSCM+ELGLRIARICD +IGGQELE+SLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL K  N KGTARNQA+SRRN+EQSI SR + S+  GL QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNT     QDL C SE  
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKND+FMVN+PPESFIIQVGESADIISRGKLRSTLHSV RPSK EDLCREMFVVFLQPAWNKTFSMS +  ESS L ED++DLVE+E
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
         T+IT+EIQKIVPPLASRLKEGMTFA+FSRETTKQYYGG+GLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

XP_008451313.1 PREDICTED: uncharacterized protein LOC103492644 [Cucumis melo]2.2e-251100Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

XP_011659287.1 uncharacterized protein LOC101222496 [Cucumis sativus]5.9e-23693.72Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MG NPK+L+I+EL YSDLLLLS  YHS SSLQE++R+ESITKSILEALGPNGPGLLAI GVPNSSVLRRALLPLARKLALLNPDHRK+ILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSS SE DSFCHSIENK+ DNEF+HLG SFKELGSCMMELGLRIARICDREIGG+ELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL KPANSKGTARNQASSRRNREQSIQSRHD S+RKGL QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLE+GLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPL SRLKEGMTFA+FSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

XP_038897661.1 uncharacterized protein LOC120085635 [Benincasa hispida]1.0e-21687.67Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        M EN KIL I+EL YSDLLLLS+PYHS SSLQE ER+ESITKSILEALGPNGPGLLA+ GVPNSSVLRRALLPLARKLALLNPDHRKRILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESK+FMQNNQSQ  D+QS  S+ D FC SIE +  DNEFKHLG SFKELGSCMMELGLRIARICD+EIGG+ELE+SLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL KPANSKGTARNQASSRRNREQ I+SRH+TS+  GL QS+TNLWQQWHYDYGIFTVLTTPMFL PSNTLETG QDL C SE T
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIIS+GKLRSTLHSV RPSKQEDLCREM+VVFLQPAWNKTFSMSGH TESSML EDRK LVE+E
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
          +ITREIQKIVPPLASRLKEGMTFA+FSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

TrEMBL top hitse value%identityAlignment
A0A0A0K8T7 Uncharacterized protein2.9e-23693.72Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MG NPK+L+I+EL YSDLLLLS  YHS SSLQE++R+ESITKSILEALGPNGPGLLAI GVPNSSVLRRALLPLARKLALLNPDHRK+ILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSS SE DSFCHSIENK+ DNEF+HLG SFKELGSCMMELGLRIARICDREIGG+ELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL KPANSKGTARNQASSRRNREQSIQSRHD S+RKGL QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLE+GLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPL SRLKEGMTFA+FSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

A0A1S3BQK9 uncharacterized protein LOC1034926441.1e-251100Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

A0A5A7V7E3 Uncharacterized protein3.5e-25099.33Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVID+EFKHLG SFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

A0A6J1H636 uncharacterized protein LOC1114608762.1e-20282.74Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        M ENP  L+I+EL YSDL LLST  HS SSLQ +ER+ESITKSI EALGP+GPGLLAI GVPNSSVLRR LLPLARKLALLNPD RKRILKDHN+GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLKYTESKEFMQNNQSQ  DKQS  SE D FC SIE +V DNEFKHLG SFKELGSCM+ELGL IARICD +IGGQELE+SLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL K  N KGTARNQA+SRRN+EQSI SR + S+  GL QS TNLWQQWHYDYGIFTVLTTPMFLSPSNT     QDL C SE  
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKND+FMVN+PPESFIIQVGES+DIISRGKLRSTLHSV RPSK EDLCREMFVVFLQPAWNKTFSMS +  ESS L ED++DLVE+E
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
         T+ITREIQKIVPPLASRLKEGMTFA+FSRETTKQYYGG+GLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

A0A6J1KXC5 uncharacterized protein LOC111497992 isoform X15.1e-20182.06Show/hide
Query:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP
        M ENP  L+I+EL YS+L LLST  HS SSLQ++ER+ESITKSI EALGP+GPGLLAI GVPNSSVLRR LLPLARKLALLNPD RKRILKDH +GSDVP
Subjt:  MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVP

Query:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT
        LRNPERSVSSFAMQLK+TESKEFMQN+QSQ  DKQS  SE D FC SIE +V DNEFKHLG SFKELGSCM+ELGLRIARICD +IGGQELE+SLLESCT
Subjt:  LRNPERSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCT

Query:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT
        AKGRLIHYHSALDAQLL K  N KGTARNQA+SRRN+EQSI SR + S+  GL QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNT     QDL C  E  
Subjt:  AKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERT

Query:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE
        SPSGHLYLQIFDPCKND+FMVN+PPESFIIQVGESADIISRGKLRSTLHSV RPSK EDLCREMFVVFLQPAWNKTFSMS +  ESS L +D++DLVE+E
Subjt:  SPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEE

Query:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
         T+ITREIQKIVPPLASRLKEGMTFA+FSRETTKQYYGG+GLQSNR
Subjt:  GTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G63290.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.6e-11251.25Show/hide
Query:  KILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVPLRNPE
        +IL+ ++LS+SDLLL S             R + I+K++++ALGP GPGLL I GV  S+ LRR LLP+ARKLALL+PD RK IL +H++GSDVPL+NPE
Subjt:  KILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVPLRNPE

Query:  RSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCTAKGRL
        R VSSFAMQL Y  +       +   ++  S +   +           D+ F +LG +FKELG CM ELGL IAR+CDREIGG  LEESLL+SCTAKGRL
Subjt:  RSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCTAKGRL

Query:  IHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGH
        IHYHSA D   L + +  +  + N+ SS+R  + + +   +  N  GLS S  NLWQQWHYDYGIFTVLT PMFLSP +  E  L            S H
Subjt:  IHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGH

Query:  LYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEEGTLIT
         YLQI+ P KN  +MV +P +SF++Q+GESADI+S+GKLRSTLH V +P K + + RE FVVFL P W++TFS+S +  E           +  +  +  
Subjt:  LYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEEGTLIT

Query:  REIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
         ++Q IVPPL+SRL++GMTFA+FSRETTKQYYGG+GLQSNR
Subjt:  REIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

AT3G63290.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.8e-8254.61Show/hide
Query:  DNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCTAKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGL
        D+ F +LG +FKELG CM ELGL IAR+CDREIGG  LEESLL+SCTAKGRLIHYHSA D   L + +  +  + N+ SS+R  + + +   +  N  GL
Subjt:  DNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCTAKGRLIHYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGL

Query:  SQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSR
        S S  NLWQQWHYDYGIFTVLT PMFLSP +  E  L            S H YLQI+ P KN  +MV +P +SF++Q+GESADI+S+GKLRSTLH V +
Subjt:  SQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGHLYLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSR

Query:  PSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEEGTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR
        P K + + RE FVVFL P W++TFS+S +  E           +  +  +   ++Q IVPPL+SRL++GMTFA+FSRETTKQYYGG+GLQSNR
Subjt:  PSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEEGTLITREIQKIVPPLASRLKEGMTFAQFSRETTKQYYGGSGLQSNR

AT4G13400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.7e-2027.89Show/hide
Query:  SYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVPLRNPERSVSSFAM
        S S +  +ST   S S L+ES     ++  I E  GPNG G+L++  VP  S LR+ LL LA +LA L P+  KR L+D +   +    + +  + S   
Subjt:  SYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVPLRNPERSVSSFAM

Query:  QLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIG-------GQELEESLLESCTAKGRLI
        +L   +   +    Q         +    S+C S  N    N    L  +FK LG  M E+GL +A  CD+ +         Q LE+ LL S   KGRL+
Subjt:  QLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIG-------GQELEESLLESCTAKGRLI

Query:  HYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGHL
        +Y  A ++                            S HD          S + W  WH D+G  T LT  +F    +++E    D         P+  L
Subjt:  HYHSALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGHL

Query:  YLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQE--DLCREMFVVFLQPAWNKTFSMSGHLT
        Y+Q        +  V    +    Q+GE+  I+S G L +T H V  P  +E   L R  F +F+QP W++  +    +T
Subjt:  YLQIFDPCKNDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQE--DLCREMFVVFLQPAWNKTFSMSGHLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAAAATCCCAAAATACTCGATATCCATGAGCTTTCATATTCCGACCTGCTGCTCTTGTCTACTCCTTACCATTCTCCATCTTCACTGCAAGAGAGCGAA
CGGATGGAATCCATAACCAAATCCATACTTGAAGCCCTAGGTCCCAATGGACCTGGCCTTCTTGCAATCATCGGCGTCCCCAATTCTTCTGTTCTCCGGCGAGCA
TTACTTCCTCTCGCTCGCAAGCTCGCCTTGCTCAATCCCGATCATCGCAAACGGATTCTTAAGGATCATAACGTAGGGAGTGATGTTCCCTTAAGGAATCCAGAA
AGAAGCGTCTCCTCCTTTGCAATGCAACTCAAATATACAGAAAGTAAAGAATTCATGCAGAATAACCAAAGCCAAATAGAGGATAAACAATCATCTGTTTCAGAA
GCTGATTCATTTTGTCATTCAATTGAGAACAAAGTCATTGACAACGAGTTTAAACATCTTGGTAAATCATTTAAAGAGTTAGGAAGTTGCATGATGGAATTGGGG
CTTCGCATTGCACGTATATGTGATCGGGAAATCGGTGGTCAAGAGTTAGAAGAGAGCTTGTTGGAGTCATGCACTGCAAAAGGCCGTCTCATACACTACCATTCA
GCTCTGGATGCTCAGCTTTTAAGTAAACCAGCAAACAGCAAGGGAACTGCAAGAAACCAAGCTAGTTCTCGAAGAAATAGAGAGCAAAGCATACAAAGTAGACAT
GATACATCAAATAGAAAGGGACTAAGTCAATCAAGCACAAATCTATGGCAGCAATGGCATTATGACTATGGTATCTTCACTGTTCTAACAACGCCCATGTTTCTT
TCGCCATCAAATACACTTGAAACTGGACTGCAAGATCTATGGTGCTGTAGTGAACGTACTTCTCCCAGTGGACACTTGTATTTGCAAATTTTTGATCCGTGTAAG
AATGACGTTTTCATGGTTAACTCTCCACCAGAAAGCTTTATCATCCAGGTGGGCGAATCGGCTGATATAATATCGCGAGGGAAGCTTCGATCCACTCTTCACTCT
GTGAGCAGACCTTCTAAGCAAGAGGATTTGTGCAGAGAAATGTTTGTTGTATTCTTGCAGCCAGCTTGGAACAAAACGTTTTCCATGTCTGGCCATCTCACTGAA
AGCTCAATGTTACCTGAGGACAGAAAAGATCTTGTTGAAGAGGAGGGAACCTTAATAACTCGAGAAATCCAGAAAATAGTTCCACCATTAGCGTCTAGATTGAAG
GAAGGGATGACATTTGCGCAGTTCTCACGTGAAACCACCAAGCAATATTACGGGGGAAGTGGTTTGCAATCCAATAGATGA
mRNA sequenceShow/hide mRNA sequence
TAAATGTTGTCTTCTTCCTCTTGATGTAAATGGAAGCCCCAGATGGAGGATAGAGAAGTGGAAGACTCAGAGAACAAAAATGGGGGAAAATCCCAAAATACTCGA
TATCCATGAGCTTTCATATTCCGACCTGCTGCTCTTGTCTACTCCTTACCATTCTCCATCTTCACTGCAAGAGAGCGAACGGATGGAATCCATAACCAAATCCAT
ACTTGAAGCCCTAGGTCCCAATGGACCTGGCCTTCTTGCAATCATCGGCGTCCCCAATTCTTCTGTTCTCCGGCGAGCATTACTTCCTCTCGCTCGCAAGCTCGC
CTTGCTCAATCCCGATCATCGCAAACGGATTCTTAAGGATCATAACGTAGGGAGTGATGTTCCCTTAAGGAATCCAGAAAGAAGCGTCTCCTCCTTTGCAATGCA
ACTCAAATATACAGAAAGTAAAGAATTCATGCAGAATAACCAAAGCCAAATAGAGGATAAACAATCATCTGTTTCAGAAGCTGATTCATTTTGTCATTCAATTGA
GAACAAAGTCATTGACAACGAGTTTAAACATCTTGGTAAATCATTTAAAGAGTTAGGAAGTTGCATGATGGAATTGGGGCTTCGCATTGCACGTATATGTGATCG
GGAAATCGGTGGTCAAGAGTTAGAAGAGAGCTTGTTGGAGTCATGCACTGCAAAAGGCCGTCTCATACACTACCATTCAGCTCTGGATGCTCAGCTTTTAAGTAA
ACCAGCAAACAGCAAGGGAACTGCAAGAAACCAAGCTAGTTCTCGAAGAAATAGAGAGCAAAGCATACAAAGTAGACATGATACATCAAATAGAAAGGGACTAAG
TCAATCAAGCACAAATCTATGGCAGCAATGGCATTATGACTATGGTATCTTCACTGTTCTAACAACGCCCATGTTTCTTTCGCCATCAAATACACTTGAAACTGG
ACTGCAAGATCTATGGTGCTGTAGTGAACGTACTTCTCCCAGTGGACACTTGTATTTGCAAATTTTTGATCCGTGTAAGAATGACGTTTTCATGGTTAACTCTCC
ACCAGAAAGCTTTATCATCCAGGTGGGCGAATCGGCTGATATAATATCGCGAGGGAAGCTTCGATCCACTCTTCACTCTGTGAGCAGACCTTCTAAGCAAGAGGA
TTTGTGCAGAGAAATGTTTGTTGTATTCTTGCAGCCAGCTTGGAACAAAACGTTTTCCATGTCTGGCCATCTCACTGAAAGCTCAATGTTACCTGAGGACAGAAA
AGATCTTGTTGAAGAGGAGGGAACCTTAATAACTCGAGAAATCCAGAAAATAGTTCCACCATTAGCGTCTAGATTGAAGGAAGGGATGACATTTGCGCAGTTCTC
ACGTGAAACCACCAAGCAATATTACGGGGGAAGTGGTTTGCAATCCAATAGATGATTTGGTTCTTCAGTTGAGAGCTGTAGGTTTTCTCCTCTAAAGTTACTTTT
ATTATTATTATTCTAATCTTACTACCCAAACCACCCTCCACCGGTTAACTTTAGTTGATTGATTAAATTAATGCATAGCTATCCAAAGTTCTACATTTAACTGCA
TATAGTTTAGAATTTAATCTAATATGATAAATCCTGAAATTTGCTTCTAAAAGATGTAAACAAATCCATAGTTTTTCAGTTATACTCTTTTGATCTAACACTTTT
TCTTTATATGAGATGAGAGTTCAAAATCGGAGCAAAACAGAACTGTTAAGAAAATACCAAATGGTCAAATTGGAAGAGAGAAGCGTTTGTAGGAAGAGCTTTGCC
CTGGCAAGTACACATGAGAAAGCCATGGAAAAGATCAAAATGGAAGTGATCTTTACTTTCGATATAAGCATCAAAATAAACAGAACTGACTTGAAGAAAAGAGGA
CAAAGCAGTAGCAGGAGCAGCCCTGAAAATAGAGCATTGATTTGTAGACGCAAAATCCTCCAGCATTAGGGTGTGAAATATAGGCCATGTACATACATCTCTAAA
TCCCTGAAGTACATACAAATTATAGTGTCGTATATTTCTCAGATTCACACAGGGACAGAAAGGGAGTAGGAATATCTTTTTGATCAAGGTTTGTGATTGACTAAT
GACCAGTGTTCATGACTCTGCTACCCTAAAAAAGAGTCGACAGAGAGAGAAAGAGTTTGATCGAGGCCAAACAGCCATAGAACATCATAGAGACTTTACCCAGAG
GAAAGAAAAGCTGCAGCAGCCATGAAAATGGAACCAGAAGTTAGAATTTTTAAACCCTCAGATGGAATTAACGTGGCATTTGTTTTAGAGCTTTGTAAATGGTGT
TTGATGGAAATGTCTTTACTTAACATGAATCATTCCAAACACAAGAAGAATCAACAGAGCTCCAAACAAGAATTGAATGGACCAAGGAAATCTATCTTCTAGCCG
TCCAAAATGTGTACAAACTTATAGTGTATTTAAAAGTAGTTGGAGTAGTGATTTTGAAAAAGAAAAATTAGGTGTAAATTAATTTCTAAGCAATAAAAAAAATCT
TCGCTCTAAATTTTAGTTT
Protein sequenceShow/hide protein sequence
MGENPKILDIHELSYSDLLLLSTPYHSPSSLQESERMESITKSILEALGPNGPGLLAIIGVPNSSVLRRALLPLARKLALLNPDHRKRILKDHNVGSDVPLRNPE
RSVSSFAMQLKYTESKEFMQNNQSQIEDKQSSVSEADSFCHSIENKVIDNEFKHLGKSFKELGSCMMELGLRIARICDREIGGQELEESLLESCTAKGRLIHYHS
ALDAQLLSKPANSKGTARNQASSRRNREQSIQSRHDTSNRKGLSQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETGLQDLWCCSERTSPSGHLYLQIFDPCK
NDVFMVNSPPESFIIQVGESADIISRGKLRSTLHSVSRPSKQEDLCREMFVVFLQPAWNKTFSMSGHLTESSMLPEDRKDLVEEEGTLITREIQKIVPPLASRLK
EGMTFAQFSRETTKQYYGGSGLQSNR