; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0742 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0742
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationMC09:7051054..7057731
RNA-Seq ExpressionMC09g0742
SyntenyMC09g0742
Gene Ontology termsNA
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593098.1 hypothetical protein SDJN03_12574, partial [Cucurbita argyrosperma subsp. sororia]4.13e-25380.58Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        MEE    L+IYEL+YSDL LLS+  HSSSS+  NERI+SI +SI +ALGPSGPGLLAI GVPNSSV RR LLPLARKLALLNP+DRKRILKDHNLGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLKYT+SK F+QNNQSQ  R DKQSP S +DH+ D I  E QD+EFKHLG+SFKELGSCM+ELGLRIARICD +IGGQELEQSLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LLRK  N KGTAR++A+SRRNKEQSIH ++EP+DS GL QSS+NLWQQWHYDYGIFTVLT+PMFLSPSNT   EAQDLCCY+E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
        C SP  H YLQIFDPCKNDIFMV+ PPESFIIQVGESADIIS+GKLRSTLHSVCRPSK E+LCRE FVVFLQPAWNKTFS+S YSIESS LS+++ DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +E ++IT+EIQKIVPPL SRLKEGM FAEFSRETTKQYYGG+GLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

XP_022155829.1 uncharacterized protein LOC111022858 [Momordica charantia]0.099.78Show/hide
Query:  MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDH
        MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLAR LALLNPEDRKRILKDH
Subjt:  MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDH

Query:  NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE
        NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE
Subjt:  NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE

Query:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD
        QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD
Subjt:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD

Query:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE
        LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE
Subjt:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE

Query:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
Subjt:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

XP_022959987.1 uncharacterized protein LOC111460876 [Cucurbita moschata]1.38e-25180.13Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        MEE    L+IYEL+YSDL LLS+  HSSSS+  NERI+SI +SI +ALGPSGPGLLAI GVPNSSV RR LLPLARKLALLNP+DRKRILKDHNLGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLKYT+SK F+QNNQSQ  R DKQSP S +DH+ D I  E QD+EFKHLG+SFKELGSCM+ELGL IARICD +IGGQELEQSLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LLRK  N KGTAR++A+SRRNKEQSIH ++EP+DS GL QS +NLWQQWHYDYGIFTVLT+PMFLSPSNT   EAQDLCCY+E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
        C SP  H YLQIFDPCKNDIFMV+ PPESFIIQVGES+DIIS+GKLRSTLHSVCRPSK E+LCRE FVVFLQPAWNKTFS+S YSIESS LS+++ DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +E ++ITREIQKIVPPL SRLKEGM FAEFSRETTKQYYGG+GLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

XP_023004799.1 uncharacterized protein LOC111497992 isoform X1 [Cucurbita maxima]4.58e-25079.91Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        MEE    L+IYEL+YS+L LLS+  HSSSS+  NERI+SI +SI +ALGPSGPGLLAI GVPNSSV RR LLPLARKLALLNP+DRKRILKDH LGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLK+T+SK F+QN+QSQ  R DKQSP S +DH+ D I  E QD+EFKHLG+SFKELGSCM+ELGLRIARICD +IGGQELEQSLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LLRK  N KGTAR++A+SRRNKEQSIH ++EP+DS GL QSS+NLWQQWHYDYGIFTVLT+PMFLSPSNT   EAQDLCCY E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
        C SP  H YLQIFDPCKNDIFMV+ PPESFIIQVGESADIIS+GKLRSTLHSVCRPSK E+LCRE FVVFLQPAWNKTFS+S YSIESS LS ++ DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +E ++ITREIQKIVPPL SRLKEGM FAEFSRETTKQYYGG+GLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

XP_038897661.1 uncharacterized protein LOC120085635 [Benincasa hispida]3.18e-26081.72Show/hide
Query:  QEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHN
        +EQR+KM E +KILQIYELQYSDLLLLSSPYHSSSS+  +ERI+SI +SIL+ALGP+GPGLLA+ GVPNSSV RRALLPLARKLALLNP+ RKRILKDHN
Subjt:  QEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHN

Query:  LGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE
        LGSDVPLRNPER VSSFAMQLKYT+SK F+QNNQSQ  R D+QS  S +D + D I  EFQD+EFKHLG+SFKELGSCMMELGLRIARICDQEIGG+ELE
Subjt:  LGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE

Query:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD
        QSLLESCTAKGRLIHYHSALDA+LLRKPANSKGTAR++ASSRRN+EQ I  + E +DS GL QS++NLWQQWHYDYGIFTVLT+PMFL PSNTLE  AQD
Subjt:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD

Query:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE
        LCCY+ECTSP  H YLQIFDPCKND+FMV++PPESFIIQVGESADIISQGKLRSTLHSVCRPSKQE+LCRE +VVFLQPAWNKTFS+SG+  ESS+LS++
Subjt:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE

Query:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        R+ LVEKE  +ITREIQKIVPPL SRLKEGM FAEFSRETTKQYYGGSGLQSNR
Subjt:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

TrEMBL top hitse value%identityAlignment
A0A0A0K8T7 Uncharacterized protein1.05e-24880.14Show/hide
Query:  KILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNPE
        K+L+IYEL YSDLLLLS+ YHSSSS+  N+RI+SI +SIL+ALGP+GPGLLAI GVPNSSV RRALLPLARKLALLNP+ RK+ILKDHNLGSDVPLRNPE
Subjt:  KILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNPE

Query:  RRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAKG
        R VSSFAMQLKYT+SK F+QNNQSQ   EDKQS  S +D +   I N+ +D+EF+HLGNSFKELGSCMMELGLRIARICD+EIGG+ELE+SLLESCTAKG
Subjt:  RRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAKG

Query:  RLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPD
        RLIHYHSALDA+LLRKPANSKGTAR++ASSRRN+EQSI  + +P+D KGL QSS+NLWQQWHYDYGIFTVLT+PMFLSPSNTLE   QDL C +E TSP 
Subjt:  RLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPD

Query:  RHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGSM
         H YLQIFDPCKND+FMV++PPESFIIQVGESADIIS+GKLRSTLHSV RPSKQE+LCRE FVVFLQPAWNKTFS+SG+  ESS+L ++R+DLVE+EG++
Subjt:  RHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGSM

Query:  ITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        ITREIQKIVPPLVSRLKEGM FAEFSRETTKQYYGGSGLQSNR
Subjt:  ITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

A0A5A7V7E3 Uncharacterized protein5.75e-24679.02Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        M E  KIL I+EL YSDLLLLS+PYHS SS+  +ER++SI +SIL+ALGP+GPGLLAI+GVPNSSV RRALLPLARKLALLNP+ RKRILKDHNLGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLKYT+SK F+QNNQSQ   EDKQS  S  D +   I N+  D EFKHLGNSFKELGSCMMELGLRIARICD+EIGGQELE+SLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LL KPANSKGTAR++ASSRRN+EQSI  + + ++ KGL QSS+NLWQQWHYDYGIFTVLT+PMFLSPSNTLE   QDL C +E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
         TSP  H YLQIFDPCKND+FMV++PPESFIIQVGESADIIS+GKLRSTLHSV RPSKQE+LCRE FVVFLQPAWNKTFS+SG+  ESS+L ++R+DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +EG++ITREIQKIVPPL SRLKEGM FA+FSRETTKQYYGGSGLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

A0A6J1DRE9 uncharacterized protein LOC1110228580.099.78Show/hide
Query:  MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDH
        MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLAR LALLNPEDRKRILKDH
Subjt:  MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDH

Query:  NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE
        NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE
Subjt:  NLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELE

Query:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD
        QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD
Subjt:  QSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQD

Query:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE
        LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE
Subjt:  LCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKE

Query:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
Subjt:  REDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

A0A6J1H636 uncharacterized protein LOC1114608766.66e-25280.13Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        MEE    L+IYEL+YSDL LLS+  HSSSS+  NERI+SI +SI +ALGPSGPGLLAI GVPNSSV RR LLPLARKLALLNP+DRKRILKDHNLGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLKYT+SK F+QNNQSQ  R DKQSP S +DH+ D I  E QD+EFKHLG+SFKELGSCM+ELGL IARICD +IGGQELEQSLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LLRK  N KGTAR++A+SRRNKEQSIH ++EP+DS GL QS +NLWQQWHYDYGIFTVLT+PMFLSPSNT   EAQDLCCY+E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
        C SP  H YLQIFDPCKNDIFMV+ PPESFIIQVGES+DIIS+GKLRSTLHSVCRPSK E+LCRE FVVFLQPAWNKTFS+S YSIESS LS+++ DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +E ++ITREIQKIVPPL SRLKEGM FAEFSRETTKQYYGG+GLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

A0A6J1KXC5 uncharacterized protein LOC111497992 isoform X12.22e-25079.91Show/hide
Query:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP
        MEE    L+IYEL+YS+L LLS+  HSSSS+  NERI+SI +SI +ALGPSGPGLLAI GVPNSSV RR LLPLARKLALLNP+DRKRILKDH LGSDVP
Subjt:  MEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVP

Query:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES
        LRNPER VSSFAMQLK+T+SK F+QN+QSQ  R DKQSP S +DH+ D I  E QD+EFKHLG+SFKELGSCM+ELGLRIARICD +IGGQELEQSLLES
Subjt:  LRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCS-VDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLES

Query:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE
        CTAKGRLIHYHSALDA+LLRK  N KGTAR++A+SRRNKEQSIH ++EP+DS GL QSS+NLWQQWHYDYGIFTVLT+PMFLSPSNT   EAQDLCCY E
Subjt:  CTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNE

Query:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE
        C SP  H YLQIFDPCKNDIFMV+ PPESFIIQVGESADIIS+GKLRSTLHSVCRPSK E+LCRE FVVFLQPAWNKTFS+S YSIESS LS ++ DLVE
Subjt:  CTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVE

Query:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        +E ++ITREIQKIVPPL SRLKEGM FAEFSRETTKQYYGG+GLQSNR
Subjt:  KEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G63290.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.9e-11351.35Show/hide
Query:  SKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNP
        ++IL+ Y+L +SDLLL S             R + I+++++ ALGP+GPGLL I GV  S+  RR LLP+ARKLALL+P+ RK IL +H+LGSDVPL+NP
Subjt:  SKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNP

Query:  ERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIG-NEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAK
        ER VSSFAMQL Y            +T  +        D    ++   E  D  F +LG +FKELG CM ELGL IAR+CD+EIGG  LE+SLL+SCTAK
Subjt:  ERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIG-NEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAK

Query:  GRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSP
        GRLIHYHSA D   LR+ +  +  + +R SS+R  + +   +    +  GL  S  NLWQQWHYDYGIFTVLT PMFLSP +           Y E +  
Subjt:  GRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSP

Query:  DRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGS
          HSYLQI+ P KN  +MV TP +SF++Q+GESADI+S+GKLRSTLH VC+P K +++ RETFVVFL P W++TFS+S Y++E        +++V +   
Subjt:  DRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGS

Query:  MITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
            ++Q IVPPL SRL++GM FAEFSRETTKQYYGG+GLQSNR
Subjt:  MITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

AT3G63290.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.9e-8455.07Show/hide
Query:  EFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADS
        E  D  F +LG +FKELG CM ELGL IAR+CD+EIGG  LE+SLL+SCTAKGRLIHYHSA D   LR+ +  +  + +R SS+R  + +   +    + 
Subjt:  EFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADS

Query:  KGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHS
         GL  S  NLWQQWHYDYGIFTVLT PMFLSP +           Y E +    HSYLQI+ P KN  +MV TP +SF++Q+GESADI+S+GKLRSTLH 
Subjt:  KGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFIIQVGESADIISQGKLRSTLHS

Query:  VCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR
        VC+P K +++ RETFVVFL P W++TFS+S Y++E        +++V +       ++Q IVPPL SRL++GM FAEFSRETTKQYYGG+GLQSNR
Subjt:  VCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRETTKQYYGGSGLQSNR

AT4G13400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-1827.67Show/hide
Query:  INESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSP
        ++  I +  GP+G G+L++  VP  S  R+ LL LA +LA L PE+ KR L+D +   +    + + ++ S  + +     KG    N    Q     + 
Subjt:  INESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRNPERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSP

Query:  CSVDHYSDRIG-NEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIG-------GQELEQSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDR
          +  Y    G N +  +    L  +FK LG  M E+GL +A  CDQ +         Q LE+ LL S   KGRL++Y  A                   
Subjt:  CSVDHYSDRIG-NEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIG-------GQELEQSLLESCTAKGRLIHYHSALDARLLRKPANSKGTARDR

Query:  ASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFII
              +E S H              S + W  WH D+G  T LT  +F    +++E+          C  P    Y+Q        I  V    +    
Subjt:  ASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPDRHSYLQIFDPCKNDIFMVSTPPESFII

Query:  QVGESADIISQGKLRSTLHSVCRPSKQE--NLCRETFVVFLQPAWNK
        Q+GE+  I+S G L +T H V  P  +E   L R TF +F+QP W++
Subjt:  QVGESADIISQGKLRSTLHSVCRPSKQE--NLCRETFVVFLQPAWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGCAGAGATCAAAAATGGAGGAAATTTCAAAAATACTCCAAATCTATGAGCTCCAATATTCCGACCTCTTGCTCTTGTCTTCGCCTTACCATTCTTCATCTTC
AATGCCGCACAACGAACGGATCAAATCGATAAACGAATCGATTCTTCAAGCTCTAGGTCCCAGTGGACCTGGCCTTCTCGCAATCGTCGGCGTCCCTAATTCTTCTGTTC
CCCGCCGAGCATTATTGCCTCTCGCTCGCAAACTCGCGCTGCTCAATCCTGAAGATCGGAAACGGATTCTTAAGGATCATAACTTAGGAAGTGATGTTCCCTTGAGGAAT
CCAGAAAGAAGAGTATCCTCTTTTGCAATGCAACTCAAATATACAGACAGTAAGGGATTCTTGCAGAATAATCAAAGTCAGACTCAGAGAGAGGACAAACAGTCACCATG
TTCAGTTGATCATTACAGTGATCGGATCGGGAACGAATTTCAGGACCATGAATTTAAACATCTTGGCAATTCATTTAAAGAGCTAGGAAGTTGCATGATGGAACTGGGGC
TTCGCATTGCACGTATATGCGACCAGGAAATCGGAGGTCAAGAGTTAGAACAAAGCTTGTTGGAGTCCTGCACTGCAAAAGGCCGTCTCATACACTACCATTCGGCTCTG
GATGCCCGGCTCTTAAGAAAACCAGCAAACAGCAAAGGAACTGCAAGAGACCGAGCTAGTTCTAGAAGAAATAAAGAACAGAGCATACACTGTAAACGAGAGCCAGCAGA
CAGTAAAGGTCTACGTCAATCAAGCTCTAATCTATGGCAGCAATGGCACTATGACTATGGTATCTTCACAGTTCTAACATCTCCCATGTTTCTATCGCCATCAAATACAC
TTGAGATTGAAGCACAAGATCTGTGTTGCTATAACGAGTGTACTTCTCCCGACAGACATTCGTATTTGCAAATTTTTGATCCTTGCAAGAATGATATTTTCATGGTTAGC
ACTCCACCAGAAAGTTTTATCATCCAGGTGGGCGAATCGGCTGATATTATATCGCAAGGGAAGCTTCGTTCCACTCTTCACTCTGTGTGCAGACCTTCCAAGCAAGAGAA
TTTGTGCAGAGAAACATTTGTTGTATTCCTGCAGCCAGCTTGGAACAAAACATTTTCCCTGTCCGGCTATTCCATTGAAAGCTCAATTTTGTCCAAGGAGAGAGAAGATC
TTGTTGAAAAGGAGGGATCGATGATAACTCGAGAAATCCAGAAGATTGTTCCACCATTAGTGTCGAGGTTGAAGGAAGGGATGAAATTTGCAGAGTTCTCACGGGAGACT
ACCAAGCAATATTACGGAGGAAGTGGCTTGCAATCTAACAGATGA
mRNA sequenceShow/hide mRNA sequence
GTTCAAATCAGTCAATATCCCTTAGAAATTAGAATAATGAAGTTTTTTAAAAAATGTCGCGCTCGCTCAAATTTAGGTTATTCTGGCAAATACCACATTCATCGTCTTCC
TGACGCTGTTACAAAGCCGTAGAGGGCGGAGTTCAGCGAATGCAAGAGCAGAGATCAAAAATGGAGGAAATTTCAAAAATACTCCAAATCTATGAGCTCCAATATTCCGA
CCTCTTGCTCTTGTCTTCGCCTTACCATTCTTCATCTTCAATGCCGCACAACGAACGGATCAAATCGATAAACGAATCGATTCTTCAAGCTCTAGGTCCCAGTGGACCTG
GCCTTCTCGCAATCGTCGGCGTCCCTAATTCTTCTGTTCCCCGCCGAGCATTATTGCCTCTCGCTCGCAAACTCGCGCTGCTCAATCCTGAAGATCGGAAACGGATTCTT
AAGGATCATAACTTAGGAAGTGATGTTCCCTTGAGGAATCCAGAAAGAAGAGTATCCTCTTTTGCAATGCAACTCAAATATACAGACAGTAAGGGATTCTTGCAGAATAA
TCAAAGTCAGACTCAGAGAGAGGACAAACAGTCACCATGTTCAGTTGATCATTACAGTGATCGGATCGGGAACGAATTTCAGGACCATGAATTTAAACATCTTGGCAATT
CATTTAAAGAGCTAGGAAGTTGCATGATGGAACTGGGGCTTCGCATTGCACGTATATGCGACCAGGAAATCGGAGGTCAAGAGTTAGAACAAAGCTTGTTGGAGTCCTGC
ACTGCAAAAGGCCGTCTCATACACTACCATTCGGCTCTGGATGCCCGGCTCTTAAGAAAACCAGCAAACAGCAAAGGAACTGCAAGAGACCGAGCTAGTTCTAGAAGAAA
TAAAGAACAGAGCATACACTGTAAACGAGAGCCAGCAGACAGTAAAGGTCTACGTCAATCAAGCTCTAATCTATGGCAGCAATGGCACTATGACTATGGTATCTTCACAG
TTCTAACATCTCCCATGTTTCTATCGCCATCAAATACACTTGAGATTGAAGCACAAGATCTGTGTTGCTATAACGAGTGTACTTCTCCCGACAGACATTCGTATTTGCAA
ATTTTTGATCCTTGCAAGAATGATATTTTCATGGTTAGCACTCCACCAGAAAGTTTTATCATCCAGGTGGGCGAATCGGCTGATATTATATCGCAAGGGAAGCTTCGTTC
CACTCTTCACTCTGTGTGCAGACCTTCCAAGCAAGAGAATTTGTGCAGAGAAACATTTGTTGTATTCCTGCAGCCAGCTTGGAACAAAACATTTTCCCTGTCCGGCTATT
CCATTGAAAGCTCAATTTTGTCCAAGGAGAGAGAAGATCTTGTTGAAAAGGAGGGATCGATGATAACTCGAGAAATCCAGAAGATTGTTCCACCATTAGTGTCGAGGTTG
AAGGAAGGGATGAAATTTGCAGAGTTCTCACGGGAGACTACCAAGCAATATTACGGAGGAAGTGGCTTGCAATCTAACAGATGATTCAGTTATTTGAACAATAGGGGGGC
CCAAGGATTGAGGGGCGCTCGCCTGAAGCGAAACGATGGTGAAGAAAAGAAAGCCTTTCCCGTTCATGTCTCTGATACCCTAAAAATGGTCAACAGAGAGAAAAAGAGGT
TGATGACATCAAGATAGATAGCCACAGCAGCCCAAATGTAATCATCATATGAATAAGCAAGGAGGTTCCCTGTGTCGTAAATGATATATCCACAGAAAATGATCGATGCC
AAACAACCGTAGACCATTACAGAGATCCTACCTAATGGGAAGAAGAGCTGCAGCCATGAAAGTGGAACCAGAGGTAAGAATATCAAAAGAAAAACCTCAGATGGAACAAT
ATGGACAGAATTGAGAGATGTTAATGGTGTTTGATGAAATGTCTAAGAGAAGCAGAGAACTACCTGAATCATTCCAAACACTAGAAGAATCATCAGAGCTCCAAACAGGA
ATGGACCAAGGAAGCTGAATTCGATCCCTCTTCTAGCAGCCCAAAATGTGTACACAGTGAGACTGATAACAGCCACGGTGGTCAGAATTACAGATTCCAGAATCACTTTC
CCTGTGAAGCATCGAGGAAATTGGGAAAAATGTCAAAGTATGAGATTAGAGAGACGAACAAATAATGTGAAGGCAGATGCTCGATCTTGGGTCTGCAAATTTCCTAGAGT
CCGATGCCACGTTCATCATCAATGAAGCAACCATGGATCAGAAATTTCATCAGTTCCAGAAGGGAAAAAAAAGACAGGAAATTTACCGCTGGTAAAAGCACAAGTCAATC
CAACAGCAAATGCCAGAGAAACAGTGAAGATTCCGAGAAGAATATAATTCACCGGGTGATACTGATGGTAATAGAACAACGGGCACAGCACTGCCAAATTTTCCGATCAC
ACCCACAACCACATTTCCCGTACATACAAGGCCAGCCCTGTGCTGTCTCGGACAAAGAATGTCGATATTGGACGAACCGAAACGACGGTGGCGGCGATGGCAATGGTGGC
GAGCAACTGAACAGAAATGATGGAGTAAACCTTGCGGATGAAGGCCCACCGGAGATGCGGCGGCTCCAGCATAACCAGGAACCGGGGCCCCGCTCCGGCCTCGGCGTCCA
CCTTACGGTACGGGTGGCTCCACATCATATTTGCTCCGGCGATCGGCGCGTGGTTTGTACGCTGTTTGGCTGCCGAGAAAGTTTCAGAGATGGGCGGGAAAAGTCGGTTG
TGAGATTTCTTCTTCTTCTTTTAATTTATTTTTTTTTGGCGGTGAAATGTGAAGAATAAAGTGACATGTATCGAATGTTGGCGGTGAGTGGAAGGTGGAAATCGGTTGGA
GAATGTTAGGCGAAGCATCCACGTGTGTGACAGGTCGAATGAAATGATAACACGTGGGCTGATTCTTATTACTGTGCCTGCCACGTCAATTGCTTATTAAAAATGATACG
AGATTGTTATAAAAATAAAATAATAGTGTGAAAAAAAATATATGGTATTTTTATAATATTGAAACTTTTTCTGGCATTCGTATTAGAAATTTGCTTTTCAT
Protein sequenceShow/hide protein sequence
MQEQRSKMEEISKILQIYELQYSDLLLLSSPYHSSSSMPHNERIKSINESILQALGPSGPGLLAIVGVPNSSVPRRALLPLARKLALLNPEDRKRILKDHNLGSDVPLRN
PERRVSSFAMQLKYTDSKGFLQNNQSQTQREDKQSPCSVDHYSDRIGNEFQDHEFKHLGNSFKELGSCMMELGLRIARICDQEIGGQELEQSLLESCTAKGRLIHYHSAL
DARLLRKPANSKGTARDRASSRRNKEQSIHCKREPADSKGLRQSSSNLWQQWHYDYGIFTVLTSPMFLSPSNTLEIEAQDLCCYNECTSPDRHSYLQIFDPCKNDIFMVS
TPPESFIIQVGESADIISQGKLRSTLHSVCRPSKQENLCRETFVVFLQPAWNKTFSLSGYSIESSILSKEREDLVEKEGSMITREIQKIVPPLVSRLKEGMKFAEFSRET
TKQYYGGSGLQSNR