; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003391 (gene) of Snake gourd v1 genome

Gene IDTan0003391
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like protein
Genome locationLG08:69553930..69557668
RNA-Seq ExpressionTan0003391
SyntenyTan0003391
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043909.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa]6.6e-18575.47Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPISHS DF  KFC++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH

Query:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES
        VPARTAGLLLEAALRIQKQST ARSKS  KSNGLGL GSFLKR THR RSRKREI+GD R NDPR  PPLP KMAI  N+ ENDSV   SNVT FDFCES
Subjt:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES

Query:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH
        N CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ NDVESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Subjt:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH

Query:  QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK
        QLLKKLRRFERLAELDP+ELETFLL DE+ + D+  D DDIDHLKEE EEY          EK IKQHN E N SS FQ  +RP++DTK LVCNLI  E+
Subjt:  QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK

Query:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        R+ V I++REE  KRVY+R DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL H
Subjt:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

XP_011651995.1 uncharacterized protein LOC105434967 [Cucumis sativus]6.8e-19076.35Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPI HSSDF  KFC++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH

Query:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES
        VPARTAGLLLEAALRIQKQST ARSKS  KSNGLGL GSFLKR THR R+RKREI+GD R NDPR  PPLP KMAI  N+ ENDSV   SNVT FDFCES
Subjt:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES

Query:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH
        N CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ NDVESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Subjt:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH

Query:  QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGE
        QLLKKLRRFERLAELDP+ELETFLL DE   EDEL D D DDIDHLKEE E           EK IKQHN E N SS FQIP+RP++DTK LVCNLI  E
Subjt:  QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGE

Query:  KRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        +R+ VVI++ EE  KRVY+R DLWKRVDS AID+M G+DLK E+DGW  N E RGEIA+EIE+AIFSLLVEEMQ+ELHCLTH
Subjt:  KRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

XP_022144766.1 uncharacterized protein LOC111014376 [Momordica charantia]8.0e-19178.59Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK+ACFFSF  SPDLR  SPLFEF SPV    RNPN IFLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI------NKNENDSVSPQSNVTSFDF
        PARTAG+LLEAALRIQKQST ARSK   K+NGLGL GSFLKR THRGR+RKREI+GD RRND  G  PLP KMAI      N NEN SVS Q+N+TSF F
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI------NKNENDSVSPQSNVTSFDF

Query:  CESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK
        CESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Subjt:  CESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK

Query:  AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK
        AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+         QH+VEANGSSSFQIPH       RLV N I GE+
Subjt:  AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK

Query:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        RD  V D REEM K VYVRSDLWKRVDS AID   GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLTH
Subjt:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

XP_022945267.1 uncharacterized protein LOC111449564 [Cucurbita moschata]2.1e-17073.15Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP  FCK ACF SF  SPDLRNPSPLF+F SPVKSP RN N +FLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFC
        PA TAGLLLEAALRIQKQST AR      SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR    LPP      NE DSVS QSNVTS DFCE    
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFC

Query:  DSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL
         SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVESLKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Subjt:  DSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL

Query:  KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVI
        KKLRRFERLAELDPVELETFLLKDEE EL  DDDDIDHLK EEE  SHNFD SNNEK +KQH ++ N                                 
Subjt:  KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVI

Query:  DEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC
               +RVY+R DLWK V+S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Subjt:  DEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC

XP_038903007.1 uncharacterized protein LOC120089713 [Benincasa hispida]9.1e-20380.34Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPS KSHF HLN  KPISHSSDFP KFC++ACFFSF HSPDL N SPLF F SPVK+P RNPNPIFLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCESN
        PARTAGLLLEAALRIQKQST ARSKSL KSNGLG+ GSFLKR THRGR+RKREI+GD R+NDPR  PPLP KMAI  N+NENDSVS  SNVT FDFC+SN
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCESN

Query:  FCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ
         CDSPFRFVLQSSPS GH+TPE +SPA+SPAR DHQ NDVE LKKLPVEDEEEEKEQSSPVSVLDPPFEDD+EGHYEDGEDEDDY+LERS+AIVQ+AKHQ
Subjt:  FCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ

Query:  LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPV
        LLKKLRRFERLAELDPVELETFLLKDE+++ D+DDDDIDHLKEEE+Y          +K IK+H++EAN SS FQIPHRPA+D   LVCNL+  E+RD V
Subjt:  LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPV

Query:  VIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELH
        VI++REEM K +YVRSDLWKRVDS AI+VM GQDLK E+DGW RN EQR EIAIEIE+AIFSLLVEEMQ ELH
Subjt:  VIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELH

TrEMBL top hitse value%identityAlignment
A0A0A0LAR8 Uncharacterized protein3.3e-19076.35Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPI HSSDF  KFC++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH

Query:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES
        VPARTAGLLLEAALRIQKQST ARSKS  KSNGLGL GSFLKR THR R+RKREI+GD R NDPR  PPLP KMAI  N+ ENDSV   SNVT FDFCES
Subjt:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES

Query:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH
        N CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ NDVESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Subjt:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH

Query:  QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGE
        QLLKKLRRFERLAELDP+ELETFLL DE   EDEL D D DDIDHLKEE E           EK IKQHN E N SS FQIP+RP++DTK LVCNLI  E
Subjt:  QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGE

Query:  KRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        +R+ VVI++ EE  KRVY+R DLWKRVDS AID+M G+DLK E+DGW  N E RGEIA+EIE+AIFSLLVEEMQ+ELHCLTH
Subjt:  KRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

A0A5D3DNQ5 Histone-lysine N-methyltransferase SETD1B-like isoform X23.2e-18575.47Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPISHS DF  KFC++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH

Query:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES
        VPARTAGLLLEAALRIQKQST ARSKS  KSNGLGL GSFLKR THR RSRKREI+GD R NDPR  PPLP KMAI  N+ ENDSV   SNVT FDFCES
Subjt:  VPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKNENDSVSPQSNVTSFDFCES

Query:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH
        N CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ NDVESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Subjt:  NFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH

Query:  QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK
        QLLKKLRRFERLAELDP+ELETFLL DE+ + D+  D DDIDHLKEE EEY          EK IKQHN E N SS FQ  +RP++DTK LVCNLI  E+
Subjt:  QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK

Query:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        R+ V I++REE  KRVY+R DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL H
Subjt:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

A0A6J1CUE0 uncharacterized protein LOC1110143763.9e-19178.59Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK+ACFFSF  SPDLR  SPLFEF SPV    RNPN IFLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI------NKNENDSVSPQSNVTSFDF
        PARTAG+LLEAALRIQKQST ARSK   K+NGLGL GSFLKR THRGR+RKREI+GD RRND  G  PLP KMAI      N NEN SVS Q+N+TSF F
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI------NKNENDSVSPQSNVTSFDF

Query:  CESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK
        CESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Subjt:  CESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK

Query:  AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK
        AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+         QH+VEANGSSSFQIPH       RLV N I GE+
Subjt:  AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEK

Query:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
        RD  V D REEM K VYVRSDLWKRVDS AID   GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLTH
Subjt:  RDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH

A0A6J1G0G0 uncharacterized protein LOC1114495641.0e-17073.15Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP  FCK ACF SF  SPDLRNPSPLF+F SPVKSP RN N +FLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFC
        PA TAGLLLEAALRIQKQST AR      SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR    LPP      NE DSVS QSNVTS DFCE    
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFC

Query:  DSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL
         SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVESLKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Subjt:  DSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL

Query:  KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVI
        KKLRRFERLAELDPVELETFLLKDEE EL  DDDDIDHLK EEE  SHNFD SNNEK +KQH ++ N                                 
Subjt:  KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVI

Query:  DEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC
               +RVY+R DLWK V+S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Subjt:  DEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC

A0A6J1L3C1 uncharacterized protein LOC1114987351.3e-16772.21Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV
        MAQKHLHELLKEDQEPFLLTNFIA+RR +LKRPSPKSH LHLNK KPISH +DFP  FCK ACF SF HSPDLRNPSPLF+F SPVKSP RN N +FLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNE--NDSVSPQSNVTSFDFCESN
        PA TA LLLEAALRIQKQSTPAR      SNG GL GSFLKRFT+RGRSRKREI+G CRRNDP  +     KMAIN+NE  NDSVS QSNVTS     S+
Subjt:  PARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNE--NDSVSPQSNVTSFDFCESN

Query:  FCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ
        FCDSPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVESLKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY +ERSYAIVQKAKHQ
Subjt:  FCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ

Query:  LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPV
        LLKKLRRFERLAELDPVELETFLLKDEE +LDDD    DHL EEEE  SHNFD SNNEK +KQH +E+N                               
Subjt:  LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPV

Query:  VIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELD-GWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC
                 +RVY+R DLWK V+S AIDVMA +DL+AE+D GW RNGE+RG+IAIEIE+ IF LLVEEMQTE+ C
Subjt:  VIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELD-GWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein9.2e-5236.89Show/hide
Query:  QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKF-CKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVP
        +KHLHE L++DQEPF L ++I + RS +         + + KRK  + ++  P  F C+ +CFF+   SPD R  SPLFE  SP K   R+   +FL +P
Subjt:  QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKF-CKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQST--PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRR-NDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESN
        ARTA +LL+AA RIQKQ +     +K+  + NG G+FGS LK  T+R  ++ R  N D    +  RGS P             S   +  V   D C   
Subjt:  ARTAGLLLEAALRIQKQST--PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRR-NDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESN

Query:  FCDSPFRFVLQSSP-SAGHRTPEFSSPATSPAR---NDHQVNDVESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYA
        FC+SPF FVLQ++P S+GH+TP F+S ATSPAR    D   ++ ESL+K+  ++    EEE+KEQ SPVSVLDP  E++ +  +   E +   +L  S+ 
Subjt:  FCDSPFRFVLQSSP-SAGHRTPEFSSPATSPAR---NDHQVNDVESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYA

Query:  IVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELD-----DDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRL
        IVQ+AK +LLKKLRRFE+LA LDPVELE  + ++E++E +     ++DD+I     +EEY   +               EA    S     R A+D KR 
Subjt:  IVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELD-----DDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRL

Query:  VCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRVDSRA---IDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTEL
                        ++ + R++ +   + W RV   A   +D +  +DL+ E   W R+G +  E   ++E +IF +L++E   EL
Subjt:  VCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRVDSRA---IDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTEL

AT5G03670.1 unknown protein2.1e-6437.84Show/hide
Query:  AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHL--NKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH
        +Q+HL +LL+EDQEPF L ++I+DRR  +      +H  HL   KR+PIS ++  P +FC+ ACFFS   SPD +  SPLFE    +KSP+R+ N IF++
Subjt:  AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHL--NKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLH

Query:  VPARTAGLLLEAALRIQKQSTP-ARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREING---------DCRRNDPRGSPPLPPKMAINK---NENDSVSPQ
        +PARTA +LLEAA+RIQKQS+  +++++    N  G+FGS LK+ T+R   +KREI+G            ++  R   P+  K+   K   NE ++ S Q
Subjt:  VPARTAGLLLEAALRIQKQSTP-ARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREING---------DCRRNDPRGSPPLPPKMAINK---NENDSVSPQ

Query:  S---------------------NVT----------------------SFDFC----------ESNFCDSPFRFVLQSSPS-AGHRTPEFSSPATSPARND
        +                     +VT                      S +F           +  FC+SPF FVLQ+ PS  G RTP FSSPA SP  + 
Subjt:  S---------------------NVT----------------------SFDFC----------ESNFCDSPFRFVLQSSPS-AGHRTPEFSSPATSPARND

Query:  HQVN----DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDE
        H++     +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DD+E  +      DD ++  S+  VQKAKH LL+KL RFE+LA LDP+ELE   + D+E E
Subjt:  HQVN----DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDE

Query:  LDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE---MRKRVYVRSDLWKRVDSRAI
         ++++       EEEE  S        ++ +K +  E       ++P    +  + L+ +L A E   P  ID   E   + KRV  R   W+ V+S  I
Subjt:  LDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE---MRKRVYVRSDLWKRVDSRAI

Query:  DVMAGQDLKAELDG-W-IRNGEQRGEIAIEIELAIFSLLVEEMQTEL
        D+M   D + E  G W  +N     E  ++IE  IF  LVEE+  ++
Subjt:  DVMAGQDLKAELDG-W-IRNGEQRGEIAIEIELAIFSLLVEEMQTEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATC
CCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCA
GAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAA
GCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGG
CCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTT
CTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAG
TTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAG
TCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAA
AGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGAC
GATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAA
TGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAG
AAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGG
ATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAA
mRNA sequenceShow/hide mRNA sequence
GTCCTTCTCTCCACATTCCACTCTCTCCCCTCACACAGTTCTTTTGCTCTGCTTTATTTATTTCTTAATCTTCTGTCAACTCATCTTTCTCGACCTTTTTTTCCCACTGA
AAGTTGAAATTTCTTCAACTTGGCTTTACTGGAATCAAAATTCTCGAATTCCCCCATTTTAGCTTTCGTCTTCTCTGAATATTCACTCCCCTTTTGCAGTTGCAGAGAAC
ACATACACACTCCCCATTGATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCA
AACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTC
ACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAAC
GGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTA
AGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAG
AATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGC
CGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGG
AAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGC
AGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGA
TGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAAC
AACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCG
GTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAA
AGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTAC
ATTGCTTAACTCATTAACTGATGGAAATTATTCCACTCCACAGAAATAATATTTAAATTTCAACAATAATCTCTAGATTTTAAATTACTTTTAGGAATATAATCTGACTT
TAAGAGCCATAGGTTAAGTTTAACACTACCTCTTAAGTAGAAAGATTAGCATGAAAGGGACATCACTGTGATTTGTAAATTTA
Protein sequenceShow/hide protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLE
AALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPE
FSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELD
DDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGW
IRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH