; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012032 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012032
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein SIEVE ELEMENT OCCLUSION B-like
Genome locationtig00153201:40580..45774
RNA-Seq ExpressionSgr012032
SyntenySgr012032
Gene Ontology termsGO:0010088 - phloem development (biological process)
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily
IPR027942 - Sieve element occlusion, N-terminal
IPR027944 - Sieve element occlusion, C-terminal
IPR039299 - Protein SIEVE ELEMENT OCCLUSION
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138360.1 protein SIEVE ELEMENT OCCLUSION B-like [Momordica charantia]1.3e-22770.32Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAHQTT+ ILDILISYPWEAKA++ LTAF  EYGDIWHLNHYSH DPLAK+LA++K  +SLKKH DSL+YRQ+L SP SLIY+CL+AIK+MN++R FSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        DI+EL+ELSSA+RQIPL+TYW+IHIIVASRTE+SSYLN+TEG PQ+YL EL+EKI+SI+NILEN L+++R QQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC------------------------------------AIYY--KIAG
           KI +KPF DGSTLT+V++E  L DKNVILVISGL+IS++DIKALH V++E                                      I Y  K+AG
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC------------------------------------AIYY--KIAG

Query:  LRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVI
        LRFLEE+WQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFT ++ D LLRKNWPESTI+KFTDHPRL++WINQE+SILFYGGKDP WIQ FEEKV+
Subjt:  LRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVI

Query:  DIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFN
        DIK+DPLM +KGITFE VRIGKN +GEDDP LMSRFW+TQWGYF++KSQ+KGSSASETTEDILRLISY+NENGW VLAVGS PL+VGRGNL+LAV EEFN
Subjt:  DIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFN

Query:  KWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        KWK NL+IKGFPDSF DYFND+ALKTHQCER+TLPGFSGWIPMVVNCPECPRFMET
Subjt:  KWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

XP_022138379.1 protein SIEVE ELEMENT OCCLUSION B-like [Momordica charantia]1.2e-20965.35Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAH+TTM ILDILISYPWEAKA++ L AF  +YGD+WHLN+Y   DPLA++LA++K +  LKKH  + +YRQ+  SP  LI+ CLQAIKYM +++NFSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        DI+EL+ELSSA+RQIPL+TYW+IHIIVASRTEIS YL  T+G  Q YLNEL+EKI SIL  LENHLNI+REQQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------AIYY--KIA
           K++AKPFIDGST  +V++E+ L+ K VILVISGLNIS++DIKALH VYNE                                       + Y  KIA
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------AIYY--KIA

Query:  GLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKV
        G RFLEE WQLR+DPLVVVLNS+SKVEFTNAIHLIRVWG++AIPFT  K D LLRKNWPESTILKFT HPRL +WINQ+KSI+FYGGKDP WIQQFE+KV
Subjt:  GLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKV

Query:  IDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEF
        IDIKND L+  KGITFE VRIGKNI GEDDP LMSRFW+TQWG+F++KSQ++GSSASETTEDILRLISYENENGW V+ VGS PL+VGRG+LILAV E+F
Subjt:  IDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEF

Query:  NKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
         KWK+ L++KGF DSFKDYFN++A+ THQC+R+TLPGFSGWIPMVVNCPECPRFMET
Subjt:  NKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

XP_022138387.1 protein SIEVE ELEMENT OCCLUSION B-like [Momordica charantia]1.5e-21566.67Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAHQTT+ ILDIL+SYPWEAKA++TLTAF  EYGDIWHLNHYS LDPLAKSLAM+K +  LKK  DS++YRQ+L SPNSLIY+CL+A+ Y+N+L+NFSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD----------------
        DI+EL+ELSS LRQIPLV+YW+IHIIVASRTEISSYLN TEG  QKYLNEL++KI SILN LENHLNI+  QQ         VD                
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD----------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC----------------------------------------AIYY--
           K+DAKPFIDGST +QV+I+D L++KNVILVISGL+ISD DI+ALH VYNE                                          + Y  
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC----------------------------------------AIYY--

Query:  KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFE
        KIAG R+LEE WQLR+DPLVVVL+S+S++EFTNAIHLIRVWGT+AIPFT  +T+ LL KNWPEST+ KF D PRLQ+W+NQE+SI+FYGGKDP WIQQFE
Subjt:  KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFE

Query:  EKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF
        EKV++IKNDP + EKGITFE VR+GKNIKG++D TL  RFWITQWGYFV+KSQL+GSSA+ETTEDILRLISYEN+NGW VLAVGS PL+V RGNL+L VF
Subjt:  EKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF

Query:  EEFNKWKKNLSIKGFPDSFKDYF-NDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        E+FNKWK+NL+IK FPD+F+DYF N++ LK H CER+TLPGFSGWIPM+VNCPECPRFMET
Subjt:  EEFNKWKKNLSIKGFPDSFKDYF-NDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

XP_038897661.1 uncharacterized protein LOC120085635 [Benincasa hispida]3.4e-21282.46Show/hide
Query:  ESGMQDQRSNMEENSQILEIYELPYSDLLLLSTSYHS----EENERTESLTKSILEALGPKGPGLLAITGVPNSSILRRALLPLARNLALLNPEDRKRIL
        +S +++QR+ M EN++IL+IYEL YSDLLLLS+ YHS    +E+ER ES+TKSILEALGP GPGLLA+TGVPNSS+LRRALLPLAR LALLNP+ RKRIL
Subjt:  ESGMQDQRSNMEENSQILEIYELPYSDLLLLSTSYHS----EENERTESLTKSILEALGPKGPGLLAITGVPNSSILRRALLPLARNLALLNPEDRKRIL

Query:  KDHNIGSDVPLRNPERSVSSFAMQLKYTESKVFMQNNQCLRDDKQSPGSEIDHYSNWVGKEFQYNEFKHLGDSFKELGSCMMELGLRIACICDRKIGGQE
        KDHN+GSDVPLRNPERSVSSFAMQLKYTESK FMQNNQ  R D+QS GS++D + + + KEFQ NEFKHLGDSFKELGSCMMELGLRIA ICD++IGG+E
Subjt:  KDHNIGSDVPLRNPERSVSSFAMQLKYTESKVFMQNNQCLRDDKQSPGSEIDHYSNWVGKEFQYNEFKHLGDSFKELGSCMMELGLRIACICDRKIGGQE

Query:  LEQSLLESCTAKGRLIHYHSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESNGLCQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEA
        LEQSLLESCTAKGRLIHYHS LDAQ LRKPA SKG+ARN+A+S RNRE+ I SR E S+SNGLCQS+TNLWQQWHYDYGIFTVLTTPMFL PSNTLET A
Subjt:  LEQSLLESCTAKGRLIHYHSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESNGLCQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEA

Query:  QDQCCYSECTSPSGHLYLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRPSKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLS
        QD CCYSECTSPSGHLYLQIFDPCKND+FMVN+PPESFIIQVGESADIISQG+LRSTLHSVCRPSKQE+LCREMYVVFLQPAWNKTFS+SG+P ESSMLS
Subjt:  QDQCCYSECTSPSGHLYLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRPSKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLS

Query:  EDRKDLVETERTIITREIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR
        EDRK LVE E  +ITREIQKIVPP+ASRLKEGMTFAEFSRETTKQYYGGSGLQ NR
Subjt:  EDRKDLVETERTIITREIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR

XP_038906603.1 protein SIEVE ELEMENT OCCLUSION B-like [Benincasa hispida]3.7e-21164.64Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAH+TT+ ILDIL+SYPWEAKA++TLTAF  EYGDIWHLNHYS LDPLAKSLAM+K +  LKK  DS++YRQLL SPNSLI++CL+A+KY++QL+NF+KY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        DI+ELSELSS LRQIPLV+YW+IHIIVASR EISSYLN TEG  QKYLNEL+EKI+SIL  LENHLNI+R QQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------------AIYY--K
           K+DAKPFIDGST  QV++ED L+DKNVIL+ISGL+IS+ DI+ALH +Y+E                                         + Y  K
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------------AIYY--K

Query:  IAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEE
        IAG R+LEE WQLR+DPLVVV+NSKS+VEFTNAIHLIRVWGT+A+PFT  +T  LL K+WPEST+ KF + PRLQ W+NQE+SI+FYGGKDP WIQ+FEE
Subjt:  IAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEE

Query:  KVIDIKNDPLMSEKGITFETVRIGKNIKGED-DPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF
        KV++IKNDP + EKG TFE +R+G+NIKGE+ D TL  RFW+TQWGYFV+KSQLKGSSA+ETTEDILRLISYENENGW +LA+GS PL+VGRGNLIL V 
Subjt:  KVIDIKNDPLMSEKGITFETVRIGKNIKGED-DPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF

Query:  EEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        ++FNKWK+N++I+ FPD+F+DYFN++ LK H CER+TLPGFSGWIPM+VNCPECPRFMET
Subjt:  EEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

TrEMBL top hitse value%identityAlignment
A0A0A0LNQ7 Uncharacterized protein7.6e-21064.4Show/hide
Query:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYD
        AH+TT+ ILDIL+SYPWEAKA++TLTAF  EYGDIWHLNHYS LDPLAKSLAM+K +  LKK  DS++YRQLL +PNSLIY+CL+A+KY++ L+NFSKYD
Subjt:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYD

Query:  IRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD-----------------
        I+ELSELSS LRQIPLV YW+IHIIVASR EISSYLN TEG  QKY+NELSEKI+SIL  LENHL I++EQQ         VD                 
Subjt:  IRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD-----------------

Query:  RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------------AIYY--KI
          K DAKPFIDGST  QV++EDGL+DKNVILVISGL+IS+ DI+ALH +YNE                                         + Y  KI
Subjt:  RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------------AIYY--KI

Query:  AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEK
        AG R+LEE WQLR+DPL+VV+NSKS+VEF NAIHLIRVWG DAIPFT  +T+ LL KNWPEST+ KF D PRL NW+NQE++I+FYGGK+P WIQQFE++
Subjt:  AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEK

Query:  VIDIKNDPLMSEKGITFETVRIGKNIKGE-DDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFE
        +++IKNDP + EKG TFE +R+G+NIKG+ +D TL  +FW+TQWGYFV+KSQLKGSSA+ETTEDILRLISYENENGW ++AVGSTPL+VGRGNLI+ V +
Subjt:  VIDIKNDPLMSEKGITFETVRIGKNIKGE-DDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFE

Query:  EFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        +FNKWK+N++IK FPD+F+DYFN++ L  H CER+TLPGFSGWIPM+VNCPECPRFMET
Subjt:  EFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

A0A6J1C993 protein SIEVE ELEMENT OCCLUSION B-like6.2e-22870.32Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAHQTT+ ILDILISYPWEAKA++ LTAF  EYGDIWHLNHYSH DPLAK+LA++K  +SLKKH DSL+YRQ+L SP SLIY+CL+AIK+MN++R FSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        DI+EL+ELSSA+RQIPL+TYW+IHIIVASRTE+SSYLN+TEG PQ+YL EL+EKI+SI+NILEN L+++R QQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC------------------------------------AIYY--KIAG
           KI +KPF DGSTLT+V++E  L DKNVILVISGL+IS++DIKALH V++E                                      I Y  K+AG
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC------------------------------------AIYY--KIAG

Query:  LRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVI
        LRFLEE+WQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFT ++ D LLRKNWPESTI+KFTDHPRL++WINQE+SILFYGGKDP WIQ FEEKV+
Subjt:  LRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVI

Query:  DIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFN
        DIK+DPLM +KGITFE VRIGKN +GEDDP LMSRFW+TQWGYF++KSQ+KGSSASETTEDILRLISY+NENGW VLAVGS PL+VGRGNL+LAV EEFN
Subjt:  DIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFN

Query:  KWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        KWK NL+IKGFPDSF DYFND+ALKTHQCER+TLPGFSGWIPMVVNCPECPRFMET
Subjt:  KWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

A0A6J1C9Z3 protein SIEVE ELEMENT OCCLUSION B-like5.8e-21065.35Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAH+TTM ILDILISYPWEAKA++ L AF  +YGD+WHLN+Y   DPLA++LA++K +  LKKH  + +YRQ+  SP  LI+ CLQAIKYM +++NFSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        DI+EL+ELSSA+RQIPL+TYW+IHIIVASRTEIS YL  T+G  Q YLNEL+EKI SIL  LENHLNI+REQQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------AIYY--KIA
           K++AKPFIDGST  +V++E+ L+ K VILVISGLNIS++DIKALH VYNE                                       + Y  KIA
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------AIYY--KIA

Query:  GLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKV
        G RFLEE WQLR+DPLVVVLNS+SKVEFTNAIHLIRVWG++AIPFT  K D LLRKNWPESTILKFT HPRL +WINQ+KSI+FYGGKDP WIQQFE+KV
Subjt:  GLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKV

Query:  IDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEF
        IDIKND L+  KGITFE VRIGKNI GEDDP LMSRFW+TQWG+F++KSQ++GSSASETTEDILRLISYENENGW V+ VGS PL+VGRG+LILAV E+F
Subjt:  IDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEF

Query:  NKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
         KWK+ L++KGF DSFKDYFN++A+ THQC+R+TLPGFSGWIPMVVNCPECPRFMET
Subjt:  NKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

A0A6J1CAZ1 protein SIEVE ELEMENT OCCLUSION B-like7.1e-21666.67Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        KAHQTT+ ILDIL+SYPWEAKA++TLTAF  EYGDIWHLNHYS LDPLAKSLAM+K +  LKK  DS++YRQ+L SPNSLIY+CL+A+ Y+N+L+NFSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD----------------
        DI+EL+ELSS LRQIPLV+YW+IHIIVASRTEISSYLN TEG  QKYLNEL++KI SILN LENHLNI+  QQ         VD                
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQ---------VD----------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC----------------------------------------AIYY--
           K+DAKPFIDGST +QV+I+D L++KNVILVISGL+ISD DI+ALH VYNE                                          + Y  
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC----------------------------------------AIYY--

Query:  KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFE
        KIAG R+LEE WQLR+DPLVVVL+S+S++EFTNAIHLIRVWGT+AIPFT  +T+ LL KNWPEST+ KF D PRLQ+W+NQE+SI+FYGGKDP WIQQFE
Subjt:  KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFE

Query:  EKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF
        EKV++IKNDP + EKGITFE VR+GKNIKG++D TL  RFWITQWGYFV+KSQL+GSSA+ETTEDILRLISYEN+NGW VLAVGS PL+V RGNL+L VF
Subjt:  EKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVF

Query:  EEFNKWKKNLSIKGFPDSFKDYF-NDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
        E+FNKWK+NL+IK FPD+F+DYF N++ LK H CER+TLPGFSGWIPM+VNCPECPRFMET
Subjt:  EEFNKWKKNLSIKGFPDSFKDYF-NDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

A0A6J1KK25 protein SIEVE ELEMENT OCCLUSION B-like5.5e-20863.77Show/hide
Query:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY
        +AH+TT++ILDIL+SY WEAKA++TLTAF AEYGDIWHLNHYS LDPLAKSL+M+K +  LKK  + ++YRQ+L SPNSLIY+CL+A+KY+ QL+NFSKY
Subjt:  KAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------
        D +ELSELSS LRQIPLV+YW+IHIIVA+R EISSYLN TEG  QKYLNEL+EKI+SILN+LE HLN +R QQ +                         
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVD-------------------------

Query:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------------AIY
           K++AKPFIDGST  QV++EDGL+DKNVIL+ISGL+IS+ DI+ALH VYNE                                             + 
Subjt:  -RRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------------------------AIY

Query:  Y--KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ
        Y  KIAG R+LEE WQLR+DPLVVV+NS+S+VEFTNAIHLIRVWGT+AIPFT  +T  LL KNWPEST+LKF + PRL++W+NQ+++I+FYGGKDP WIQ
Subjt:  Y--KIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ

Query:  QFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLIL
        QFEEKV++IKNDP + +KG TFE VR+GK I   +D  L   FWITQWGYFV+KSQLKGSSA+ETTEDILRLISYENENGW VLAVGS PL+VGRGNLIL
Subjt:  QFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLIL

Query:  AVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET
         V E+FNKWK+NL+I+ FPD+FKDYFN++ LK H CER+TLPGFSGWIPM+VNCPECPRFMET
Subjt:  AVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMET

SwissProt top hitse value%identityAlignment
Q93XX2 Protein SIEVE ELEMENT OCCLUSION A1.0e-3024.11Show/hide
Query:  TTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYDIRE
        TT ++L ++  Y W+AK ++ L+A   +YG    L      + L KSLA++K + S+   Q++L  R  L     L+ + +     +  +     Y +  
Subjt:  TTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYDIRE

Query:  LSELSSALRQIPLVTYWVI--------HIIVAS---RTEISSYLNNTEGHPQK--------YL-----------------NELSEKISSILNILENHLNI
            ++    IP   YW++        HI  AS   + +I S++  +E H           YL                  E  E I +   I+  H+++
Subjt:  LSELSSALRQIPLVTYWVI--------HIIVAS---RTEISSYLNNTEGHPQK--------YL-----------------NELSEKISSILNILENHLNI

Query:  VREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------AIYYKI---------
        V       R ID      G +  +V I + L  K+V+L+IS L   ++++  L  +Y E                          A++  +         
Subjt:  VREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------AIYYKI---------

Query:  ----AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNL-LRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ
            A +RF+ E W  +  P++V L+ K +V  TNA  ++ +W   A PFT  +  +L   + W    ++  TD P   N +   K I  YGG+D  WI+
Subjt:  ----AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNL-LRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ

Query:  QFEEKVIDIKNDPLMSEKGITFETVRIGK-NIKGEDDPTLMS-----------------RFWITQWGYFVMKSQ------LKGSSASETTE------DIL
         F     ++          I  E V +GK N K    P + +                  FW      +  K +      +KG    +  E      +++
Subjt:  QFEEKVIDIKNDPLMSEKGITFETVRIGK-NIKGEDDPTLMS-----------------RFWITQWGYFVMKSQ------LKGSSASETTE------DIL

Query:  RLISYENE-NGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFME
         ++ Y  E +GW +++  S  +V  +GNL      EFN+W+ N+  KGF  +  D+   + L  H C R  LP  +G IP  V C EC R ME
Subjt:  RLISYENE-NGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFME

Q9FXE2 Protein SIEVE ELEMENT OCCLUSION C9.8e-1320.1Show/hide
Query:  DRKAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLA---------------------MVKGISSLKK---HQDSLRYRQ--
        + +  + TM + D+L  Y W+AKA++ L    A YG +    H +  DP+A S+A                     ++K +  + K     + + ++Q  
Subjt:  DRKAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLA---------------------MVKGISSLKK---HQDSLRYRQ--

Query:  ----LLSSPNSLIY-----------NCLQAIKYMNQLRNF--SKYDIRELS-ELSSALRQIPLVTYWVIHI-------IVASRTEISSYLN------NTE
            +L    S IY            C+Q I Y  Q +    S+    ELS E   A  ++  + Y +++I       +    T+I   +N      N E
Subjt:  ----LLSSPNSLIY-----------NCLQAIKYMNQLRNF--SKYDIRELS-ELSSALRQIPLVTYWVIHI-------IVASRTEISSYLN------NTE

Query:  GHP--QKYLNELSEKISSI-LNILENHLNIVREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNV--------ILVISGLNISDQDIKALHFVYNECAI
         H   Q  L+ L      + L      ++I   Q      + +KP ++        + D   + N         + + S    +D++ +   F Y+    
Subjt:  GHP--QKYLNELSEKISSI-LNILENHLNIVREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNV--------ILVISGLNISDQDIKALHFVYNECAI

Query:  YYKIAG--------LRFLEEKWQLRE-DPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYG
        +  +          L F +++W  ++ + ++VV++S  +    NA+ ++ +WG  A PF+  + D L +++     +L    HP  +      + I  +G
Subjt:  YYKIAG--------LRFLEEKWQLRE-DPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYG

Query:  GKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGED---------DPTLMSRFWITQWGYFVMKSQLK-----GSSASETTEDILRLI--SYE
         ++  WI +F      I+N       G   E + +    + E           PTL   FW+      + +S+LK      S      E++  L+   Y 
Subjt:  GKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGED---------DPTLMSRFWITQWGYFVMKSQLK-----GSSASETTEDILRLI--SYE

Query:  NENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPM-VVNCPEC
           GW ++  GST   V  G  +     +  +W +     GF ++ +      A K  +     +  F   + M VV C +C
Subjt:  NENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPM-VVNCPEC

Q9SS87 Protein SIEVE ELEMENT OCCLUSION B2.7e-3923.28Show/hide
Query:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNF-SKY
        +H+ TM++ + L S+ W+ K ++TL AF   YG+ W L  +   + LAKSLAM+K    L   Q+ +    +    N LI         + +L     +Y
Subjt:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNF-SKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQ--------------KYLNELSEKISSILNILENHLNIVREQQ-------------
           ++ +LS  L  IP+  YW I  ++A  ++I+  +    GH                  L  + + ++  L +   H+   R  +             
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQ--------------KYLNELSEKISSILNILENHLNIVREQQ-------------

Query:  VDRRKI-----DAKPFI----DGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------AIY
        +D  KI       KP I    DG T  +V++ D L+ K V+L+IS LNI   ++     +Y E                                   + 
Subjt:  VDRRKI-----DAKPFI----DGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------AIY

Query:  YKIAGLR--------------------FLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWI
         K   LR                    F+  +W     P++VV++ +      NA+H+I +WGT+A PFT  + + L R+      ++       + NWI
Subjt:  YKIAGLR--------------------FLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWI

Query:  NQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKN--------------IKGED------DPTLMSRFWITQWGYFVMKSQL-KGSS
          +  I  YGG D  WI++F      +       +  +  E   +GK               I+ E+      +P LM  FW         K QL K   
Subjt:  NQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKN--------------IKGED------DPTLMSRFWITQWGYFVMKSQL-KGSS

Query:  ASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKT--HQCERLT--LPGFSGWIPMVVNCPEC
          +  + I +++SY+   GW +L+ G   +++  G +   +      WK ++  KG+  +  D+ +D  L+     C      +   SG IP  +NC EC
Subjt:  ASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKT--HQCERLT--LPGFSGWIPMVVNCPEC

Query:  PRFMETVKVF
         R ME    F
Subjt:  PRFMETVKVF

Arabidopsis top hitse value%identityAlignment
AT1G67790.1 unknown protein8.0e-1821.28Show/hide
Query:  DRKAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFS
        + +  + TM + D+L  Y W+AKA++ L    A YG +    H +  DP+A S+A +  +       +  ++R  L S N LI   +   K + +     
Subjt:  DRKAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFS

Query:  -KYDIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSIL--NILENHLNIVREQQVDRRKIDAKPFIDGSTLTQVNI
         K    + + L   L  I L TY V+   +    +I  Y   T+   Q  + E+ +K++ +L        L  + +Q  D            +T T+ N 
Subjt:  -KYDIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSIL--NILENHLNIVREQQVDRRKIDAKPFIDGSTLTQVNI

Query:  EDGLKDKNVILVISGLNISDQDIKALHFVYNECAIYYKIAG--------LRFLEEKWQLRE-DPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTD
        E        + + S    +D++ +   F Y+    +  +          L F +++W  ++ + ++VV++S  +    NA+ ++ +WG  A PF+  + D
Subjt:  EDGLKDKNVILVISGLNISDQDIKALHFVYNECAIYYKIAG--------LRFLEEKWQLRE-DPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTD

Query:  NLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGED---------DPTLMSRFWITQW
         L +++     +L    HP  +      + I  +G ++  WI +F      I+N       G   E + +    + E           PTL   FW+   
Subjt:  NLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGED---------DPTLMSRFWITQW

Query:  GYFVMKSQLK-----GSSASETTEDILRLI--SYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTL
           + +S+LK      S      E++  L+   Y    GW ++  GST   V  G  +     +  +W +     GF ++ +      A K  +     +
Subjt:  GYFVMKSQLK-----GSSASETTEDILRLI--SYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTL

Query:  PGFSGWIPM-VVNCPEC
          F   + M VV C +C
Subjt:  PGFSGWIPM-VVNCPEC

AT3G01670.1 unknown protein7.4e-3224.11Show/hide
Query:  TTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYDIRE
        TT ++L ++  Y W+AK ++ L+A   +YG    L      + L KSLA++K + S+   Q++L  R  L     L+ + +     +  +     Y +  
Subjt:  TTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYDIRE

Query:  LSELSSALRQIPLVTYWVI--------HIIVAS---RTEISSYLNNTEGHPQK--------YL-----------------NELSEKISSILNILENHLNI
            ++    IP   YW++        HI  AS   + +I S++  +E H           YL                  E  E I +   I+  H+++
Subjt:  LSELSSALRQIPLVTYWVI--------HIIVAS---RTEISSYLNNTEGHPQK--------YL-----------------NELSEKISSILNILENHLNI

Query:  VREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------AIYYKI---------
        V       R ID      G +  +V I + L  K+V+L+IS L   ++++  L  +Y E                          A++  +         
Subjt:  VREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC-------------------------AIYYKI---------

Query:  ----AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNL-LRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ
            A +RF+ E W  +  P++V L+ K +V  TNA  ++ +W   A PFT  +  +L   + W    ++  TD P   N +   K I  YGG+D  WI+
Subjt:  ----AGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNL-LRKNWPESTILKFTDHPRLQNWINQEKSILFYGGKDPTWIQ

Query:  QFEEKVIDIKNDPLMSEKGITFETVRIGK-NIKGEDDPTLMS-----------------RFWITQWGYFVMKSQ------LKGSSASETTE------DIL
         F     ++          I  E V +GK N K    P + +                  FW      +  K +      +KG    +  E      +++
Subjt:  QFEEKVIDIKNDPLMSEKGITFETVRIGK-NIKGEDDPTLMS-----------------RFWITQWGYFVMKSQ------LKGSSASETTE------DIL

Query:  RLISYENE-NGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFME
         ++ Y  E +GW +++  S  +V  +GNL      EFN+W+ N+  KGF  +  D+   + L  H C R  LP  +G IP  V C EC R ME
Subjt:  RLISYENE-NGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFME

AT3G01680.1 CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640)1.9e-4023.28Show/hide
Query:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNF-SKY
        +H+ TM++ + L S+ W+ K ++TL AF   YG+ W L  +   + LAKSLAM+K    L   Q+ +    +    N LI         + +L     +Y
Subjt:  AHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNF-SKY

Query:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQ--------------KYLNELSEKISSILNILENHLNIVREQQ-------------
           ++ +LS  L  IP+  YW I  ++A  ++I+  +    GH                  L  + + ++  L +   H+   R  +             
Subjt:  DIRELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQ--------------KYLNELSEKISSILNILENHLNIVREQQ-------------

Query:  VDRRKI-----DAKPFI----DGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------AIY
        +D  KI       KP I    DG T  +V++ D L+ K V+L+IS LNI   ++     +Y E                                   + 
Subjt:  VDRRKI-----DAKPFI----DGSTLTQVNIEDGLKDKNVILVISGLNISDQDIKALHFVYNEC---------------------------------AIY

Query:  YKIAGLR--------------------FLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWI
         K   LR                    F+  +W     P++VV++ +      NA+H+I +WGT+A PFT  + + L R+      ++       + NWI
Subjt:  YKIAGLR--------------------FLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWI

Query:  NQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKN--------------IKGED------DPTLMSRFWITQWGYFVMKSQL-KGSS
          +  I  YGG D  WI++F      +       +  +  E   +GK               I+ E+      +P LM  FW         K QL K   
Subjt:  NQEKSILFYGGKDPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKN--------------IKGED------DPTLMSRFWITQWGYFVMKSQL-KGSS

Query:  ASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKT--HQCERLT--LPGFSGWIPMVVNCPEC
          +  + I +++SY+   GW +L+ G   +++  G +   +      WK ++  KG+  +  D+ +D  L+     C      +   SG IP  +NC EC
Subjt:  ASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFEEFNKWKKNLSIKGFPDSFKDYFNDVALKT--HQCERLT--LPGFSGWIPMVVNCPEC

Query:  PRFMETVKVF
         R ME    F
Subjt:  PRFMETVKVF

AT3G63290.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-11351.36Show/hide
Query:  SQILEIYELPYSDLLLLSTSYHSEENERTESLTKSILEALGPKGPGLLAITGVPNSSILRRALLPLARNLALLNPEDRKRILKDHNIGSDVPLRNPERSV
        ++ILE Y+L +SDLLL S         R++ ++K++++ALGP GPGLL ITGV  S+ LRR LLP+AR LALL+P+ RK IL +H++GSDVPL+NPER V
Subjt:  SQILEIYELPYSDLLLLSTSYHSEENERTESLTKSILEALGPKGPGLLAITGVPNSSILRRALLPLARNLALLNPEDRKRILKDHNIGSDVPLRNPERSV

Query:  SSFAMQLKYTESKVFMQNNQCLRDDKQSPGSEIDHYSNWVGKEFQYNEFKHLGDSFKELGSCMMELGLRIACICDRKIGGQELEQSLLESCTAKGRLIHY
        SSFAMQL Y  +       +   D+    GS++D       +E   + F +LG +FKELG CM ELGL IA +CDR+IGG  LE+SLL+SCTAKGRLIHY
Subjt:  SSFAMQLKYTESKVFMQNNQCLRDDKQSPGSEIDHYSNWVGKEFQYNEFKHLGDSFKELGSCMMELGLRIACICDRKIGGQELEQSLLESCTAKGRLIHY

Query:  HSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESN--GLCQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEAQDQCCYSECTSPSGHL
        HS  D   LR+ +  +  + NR +S R  + +  + QEL+  N  GL  S  NLWQQWHYDYGIFTVLT PMFLSP +           Y E +  S H 
Subjt:  HSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESN--GLCQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEAQDQCCYSECTSPSGHL

Query:  YLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRPSKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLSEDRKDLVETERTIITR
        YLQI+ P KN  +MV  P +SF++Q+GESADI+S+G+LRSTLH VC+P K +++ RE +VVFL P W++TFS+S Y +E           + ++  +   
Subjt:  YLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRPSKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLSEDRKDLVETERTIITR

Query:  EIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR
        ++Q IVPP++SRL++GMTFAEFSRETTKQYYGG+GLQ NR
Subjt:  EIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR

AT3G63290.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-8053.77Show/hide
Query:  FKHLGDSFKELGSCMMELGLRIACICDRKIGGQELEQSLLESCTAKGRLIHYHSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESN--GLC
        F +LG +FKELG CM ELGL IA +CDR+IGG  LE+SLL+SCTAKGRLIHYHS  D   LR+ +  +  + NR +S R  + +  + QEL+  N  GL 
Subjt:  FKHLGDSFKELGSCMMELGLRIACICDRKIGGQELEQSLLESCTAKGRLIHYHSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSESN--GLC

Query:  QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEAQDQCCYSECTSPSGHLYLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRP
         S  NLWQQWHYDYGIFTVLT PMFLSP +           Y E +  S H YLQI+ P KN  +MV  P +SF++Q+GESADI+S+G+LRSTLH VC+P
Subjt:  QSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEAQDQCCYSECTSPSGHLYLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRP

Query:  SKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLSEDRKDLVETERTIITREIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR
         K +++ RE +VVFL P W++TFS+S Y +E           + ++  +   ++Q IVPP++SRL++GMTFAEFSRETTKQYYGG+GLQ NR
Subjt:  SKQENLCREMYVVFLQPAWNKTFSISGYPIESSMLSEDRKDLVETERTIITREIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCTCCAGGGATAGAAAAGCTCATCAGACTACAATGAACATCCTTGATATACTGATTAGCTATCCATGGGAAGCTAAGGCATTGATGACTTTGACTGCTTTTAC
TGCTGAATATGGAGATATATGGCACCTCAACCATTATTCCCACTTGGACCCACTTGCAAAATCATTGGCAATGGTCAAGGGAATATCTTCACTAAAAAAGCACCAAGACT
CCCTCAGGTATCGACAGCTGCTTTCTAGCCCTAACAGTTTGATCTACAATTGCTTGCAAGCCATCAAATACATGAATCAACTCAGGAATTTCTCCAAATATGATATAAGA
GAACTTTCTGAGCTCTCTTCTGCCCTTCGCCAAATCCCCTTAGTTACGTATTGGGTTATACACATTATTGTTGCTTCCAGAACTGAGATCTCAAGTTACCTCAACAACAC
CGAGGGTCACCCACAAAAATATTTGAATGAGCTGTCTGAGAAGATTAGCTCCATACTCAACATACTTGAAAACCATCTAAACATTGTTCGGGAACAGCAAGTTGATCGAA
GGAAAATTGATGCTAAGCCTTTCATCGATGGTTCTACTCTAACGCAGGTCAATATTGAAGATGGCTTGAAGGACAAAAATGTAATATTGGTAATTTCAGGGCTAAACATA
TCCGATCAAGACATTAAAGCTCTTCATTTTGTTTACAATGAATGTGCAATATACTACAAAATTGCAGGGTTAAGGTTTCTTGAAGAGAAGTGGCAACTCAGAGAGGACCC
TTTAGTCGTTGTGCTTAACTCAAAATCAAAGGTGGAATTCACAAATGCAATTCATTTGATTCGAGTTTGGGGAACTGATGCCATTCCATTCACTTATGATAAAACAGACA
ATCTATTGAGAAAGAATTGGCCAGAGTCCACCATTTTGAAATTTACTGATCACCCCAGACTACAAAATTGGATCAATCAAGAAAAGAGTATCTTATTCTATGGAGGAAAA
GATCCCACATGGATCCAACAATTTGAAGAAAAAGTTATTGATATAAAAAATGATCCATTGATGAGTGAGAAAGGAATTACATTTGAGACTGTACGAATAGGAAAAAATAT
CAAAGGAGAAGACGATCCTACCCTTATGTCTCGTTTTTGGATCACACAATGGGGCTATTTTGTGATGAAGAGCCAATTAAAAGGTTCTAGTGCAAGTGAAACAACTGAAG
ATATTTTGCGATTGATATCCTATGAGAATGAAAATGGTTGGGTTGTTCTAGCAGTGGGCTCAACACCTCTAGTTGTAGGTCGTGGCAATTTGATTCTGGCAGTATTTGAA
GAGTTTAACAAATGGAAAAAGAATTTGAGTATAAAAGGCTTCCCCGACTCTTTTAAAGATTATTTCAATGACGTGGCTCTAAAAACTCATCAATGTGAGCGTTTGACTCT
TCCTGGATTTAGTGGATGGATTCCAATGGTGGTAAATTGTCCAGAGTGTCCTCGCTTCATGGAAACCGTTAAAGTCTTCTTCTTCCTACAGCTGTTGCAGAGTCCCCCAG
AGGGGGAGTCGGGAATGCAAGACCAGAGATCAAACATGGAGGAAAATTCTCAAATACTTGAAATCTATGAGCTCCCATATTCGGACCTCTTGCTCTTGTCTACATCTTAC
CATTCGGAAGAGAACGAAAGGACGGAATCGTTAACCAAATCCATTCTTGAAGCCCTAGGGCCTAAAGGGCCTGGCCTTCTCGCAATCACCGGCGTCCCCAATTCTTCTAT
TCTCCGGCGAGCGCTGCTTCCTCTCGCTCGCAACCTCGCGTTGCTCAATCCTGAGGATCGGAAACGGATTCTCAAGGATCATAACATAGGGAGTGACGTTCCCCTGAGGA
ATCCAGAAAGAAGTGTCTCCTCTTTTGCTATGCAACTCAAATATACAGAGAGTAAAGTATTCATGCAGAATAATCAATGTCTGAGAGATGACAAACAGTCACCTGGTTCA
GAAATAGATCATTACAGCAATTGGGTTGGGAAGGAATTTCAGTACAATGAATTTAAACATCTTGGCGATTCATTTAAAGAGCTAGGAAGTTGCATGATGGAACTGGGGCT
TCGCATTGCATGCATATGCGATCGCAAAATCGGAGGCCAAGAGTTAGAACAAAGCTTGTTAGAGTCATGCACTGCAAAAGGCCGCCTCATACACTATCATTCGACTCTGG
ATGCCCAGCATTTAAGAAAACCAGCAACCAGCAAAGGATCTGCAAGAAACCGAGCCAATTCTACAAGAAACAGAGAACGAAGCATACACAGTAGGCAAGAGTTATCAGAA
AGTAATGGACTATGTCAATCTAGTACCAATCTATGGCAGCAATGGCATTATGACTATGGTATCTTCACAGTCCTAACAACTCCCATGTTTCTTTCGCCATCAAATACGCT
TGAAACCGAAGCACAAGATCAATGTTGCTATAGCGAGTGTACTTCTCCCAGTGGGCACTTGTATTTGCAAATTTTTGATCCTTGTAAGAATGACATTTTCATGGTTAATG
CTCCTCCAGAAAGTTTTATCATTCAGGTGGGCGAATCGGCTGACATTATATCACAAGGAAGGCTTCGCTCCACTCTGCACTCTGTGTGCAGACCTTCCAAACAAGAGAAC
TTGTGCAGAGAAATGTATGTTGTATTCTTGCAGCCAGCTTGGAACAAAACGTTTTCCATATCTGGCTATCCCATTGAAAGCTCAATGTTGTCTGAGGACAGAAAAGATCT
TGTTGAAACCGAGAGAACGATAATAACTCGAGAAATCCAGAAGATAGTTCCACCAATAGCGTCAAGATTGAAGGAAGGGATGACATTTGCAGAGTTCTCACGTGAAACCA
CCAAGCAATATTATGGGGGAAGCGGTTTGCAATTGAACAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCTCCAGGGATAGAAAAGCTCATCAGACTACAATGAACATCCTTGATATACTGATTAGCTATCCATGGGAAGCTAAGGCATTGATGACTTTGACTGCTTTTAC
TGCTGAATATGGAGATATATGGCACCTCAACCATTATTCCCACTTGGACCCACTTGCAAAATCATTGGCAATGGTCAAGGGAATATCTTCACTAAAAAAGCACCAAGACT
CCCTCAGGTATCGACAGCTGCTTTCTAGCCCTAACAGTTTGATCTACAATTGCTTGCAAGCCATCAAATACATGAATCAACTCAGGAATTTCTCCAAATATGATATAAGA
GAACTTTCTGAGCTCTCTTCTGCCCTTCGCCAAATCCCCTTAGTTACGTATTGGGTTATACACATTATTGTTGCTTCCAGAACTGAGATCTCAAGTTACCTCAACAACAC
CGAGGGTCACCCACAAAAATATTTGAATGAGCTGTCTGAGAAGATTAGCTCCATACTCAACATACTTGAAAACCATCTAAACATTGTTCGGGAACAGCAAGTTGATCGAA
GGAAAATTGATGCTAAGCCTTTCATCGATGGTTCTACTCTAACGCAGGTCAATATTGAAGATGGCTTGAAGGACAAAAATGTAATATTGGTAATTTCAGGGCTAAACATA
TCCGATCAAGACATTAAAGCTCTTCATTTTGTTTACAATGAATGTGCAATATACTACAAAATTGCAGGGTTAAGGTTTCTTGAAGAGAAGTGGCAACTCAGAGAGGACCC
TTTAGTCGTTGTGCTTAACTCAAAATCAAAGGTGGAATTCACAAATGCAATTCATTTGATTCGAGTTTGGGGAACTGATGCCATTCCATTCACTTATGATAAAACAGACA
ATCTATTGAGAAAGAATTGGCCAGAGTCCACCATTTTGAAATTTACTGATCACCCCAGACTACAAAATTGGATCAATCAAGAAAAGAGTATCTTATTCTATGGAGGAAAA
GATCCCACATGGATCCAACAATTTGAAGAAAAAGTTATTGATATAAAAAATGATCCATTGATGAGTGAGAAAGGAATTACATTTGAGACTGTACGAATAGGAAAAAATAT
CAAAGGAGAAGACGATCCTACCCTTATGTCTCGTTTTTGGATCACACAATGGGGCTATTTTGTGATGAAGAGCCAATTAAAAGGTTCTAGTGCAAGTGAAACAACTGAAG
ATATTTTGCGATTGATATCCTATGAGAATGAAAATGGTTGGGTTGTTCTAGCAGTGGGCTCAACACCTCTAGTTGTAGGTCGTGGCAATTTGATTCTGGCAGTATTTGAA
GAGTTTAACAAATGGAAAAAGAATTTGAGTATAAAAGGCTTCCCCGACTCTTTTAAAGATTATTTCAATGACGTGGCTCTAAAAACTCATCAATGTGAGCGTTTGACTCT
TCCTGGATTTAGTGGATGGATTCCAATGGTGGTAAATTGTCCAGAGTGTCCTCGCTTCATGGAAACCGTTAAAGTCTTCTTCTTCCTACAGCTGTTGCAGAGTCCCCCAG
AGGGGGAGTCGGGAATGCAAGACCAGAGATCAAACATGGAGGAAAATTCTCAAATACTTGAAATCTATGAGCTCCCATATTCGGACCTCTTGCTCTTGTCTACATCTTAC
CATTCGGAAGAGAACGAAAGGACGGAATCGTTAACCAAATCCATTCTTGAAGCCCTAGGGCCTAAAGGGCCTGGCCTTCTCGCAATCACCGGCGTCCCCAATTCTTCTAT
TCTCCGGCGAGCGCTGCTTCCTCTCGCTCGCAACCTCGCGTTGCTCAATCCTGAGGATCGGAAACGGATTCTCAAGGATCATAACATAGGGAGTGACGTTCCCCTGAGGA
ATCCAGAAAGAAGTGTCTCCTCTTTTGCTATGCAACTCAAATATACAGAGAGTAAAGTATTCATGCAGAATAATCAATGTCTGAGAGATGACAAACAGTCACCTGGTTCA
GAAATAGATCATTACAGCAATTGGGTTGGGAAGGAATTTCAGTACAATGAATTTAAACATCTTGGCGATTCATTTAAAGAGCTAGGAAGTTGCATGATGGAACTGGGGCT
TCGCATTGCATGCATATGCGATCGCAAAATCGGAGGCCAAGAGTTAGAACAAAGCTTGTTAGAGTCATGCACTGCAAAAGGCCGCCTCATACACTATCATTCGACTCTGG
ATGCCCAGCATTTAAGAAAACCAGCAACCAGCAAAGGATCTGCAAGAAACCGAGCCAATTCTACAAGAAACAGAGAACGAAGCATACACAGTAGGCAAGAGTTATCAGAA
AGTAATGGACTATGTCAATCTAGTACCAATCTATGGCAGCAATGGCATTATGACTATGGTATCTTCACAGTCCTAACAACTCCCATGTTTCTTTCGCCATCAAATACGCT
TGAAACCGAAGCACAAGATCAATGTTGCTATAGCGAGTGTACTTCTCCCAGTGGGCACTTGTATTTGCAAATTTTTGATCCTTGTAAGAATGACATTTTCATGGTTAATG
CTCCTCCAGAAAGTTTTATCATTCAGGTGGGCGAATCGGCTGACATTATATCACAAGGAAGGCTTCGCTCCACTCTGCACTCTGTGTGCAGACCTTCCAAACAAGAGAAC
TTGTGCAGAGAAATGTATGTTGTATTCTTGCAGCCAGCTTGGAACAAAACGTTTTCCATATCTGGCTATCCCATTGAAAGCTCAATGTTGTCTGAGGACAGAAAAGATCT
TGTTGAAACCGAGAGAACGATAATAACTCGAGAAATCCAGAAGATAGTTCCACCAATAGCGTCAAGATTGAAGGAAGGGATGACATTTGCAGAGTTCTCACGTGAAACCA
CCAAGCAATATTATGGGGGAAGCGGTTTGCAATTGAACAGATGA
Protein sequenceShow/hide protein sequence
MQGSRDRKAHQTTMNILDILISYPWEAKALMTLTAFTAEYGDIWHLNHYSHLDPLAKSLAMVKGISSLKKHQDSLRYRQLLSSPNSLIYNCLQAIKYMNQLRNFSKYDIR
ELSELSSALRQIPLVTYWVIHIIVASRTEISSYLNNTEGHPQKYLNELSEKISSILNILENHLNIVREQQVDRRKIDAKPFIDGSTLTQVNIEDGLKDKNVILVISGLNI
SDQDIKALHFVYNECAIYYKIAGLRFLEEKWQLREDPLVVVLNSKSKVEFTNAIHLIRVWGTDAIPFTYDKTDNLLRKNWPESTILKFTDHPRLQNWINQEKSILFYGGK
DPTWIQQFEEKVIDIKNDPLMSEKGITFETVRIGKNIKGEDDPTLMSRFWITQWGYFVMKSQLKGSSASETTEDILRLISYENENGWVVLAVGSTPLVVGRGNLILAVFE
EFNKWKKNLSIKGFPDSFKDYFNDVALKTHQCERLTLPGFSGWIPMVVNCPECPRFMETVKVFFFLQLLQSPPEGESGMQDQRSNMEENSQILEIYELPYSDLLLLSTSY
HSEENERTESLTKSILEALGPKGPGLLAITGVPNSSILRRALLPLARNLALLNPEDRKRILKDHNIGSDVPLRNPERSVSSFAMQLKYTESKVFMQNNQCLRDDKQSPGS
EIDHYSNWVGKEFQYNEFKHLGDSFKELGSCMMELGLRIACICDRKIGGQELEQSLLESCTAKGRLIHYHSTLDAQHLRKPATSKGSARNRANSTRNRERSIHSRQELSE
SNGLCQSSTNLWQQWHYDYGIFTVLTTPMFLSPSNTLETEAQDQCCYSECTSPSGHLYLQIFDPCKNDIFMVNAPPESFIIQVGESADIISQGRLRSTLHSVCRPSKQEN
LCREMYVVFLQPAWNKTFSISGYPIESSMLSEDRKDLVETERTIITREIQKIVPPIASRLKEGMTFAEFSRETTKQYYGGSGLQLNR