; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014409 (gene) of Snake gourd v1 genome

Gene IDTan0014409
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIENR2 domain-containing protein
Genome locationLG01:8337341..8342500
RNA-Seq ExpressionTan0014409
SyntenyTan0014409
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003611 - Nuclease associated modular domain 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600400.1 hypothetical protein SDJN03_05633, partial [Cucurbita argyrosperma subsp. sororia]2.0e-16672.41Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW
        GSFEKQV SR+  DDDCPEN ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY R THSDQVK KISSSLRRVW
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW

Query:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR
        GKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRR
Subjt:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR

Query:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVL
        NGGKR+ KEGEDIQ  MK+LTA ERS LKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+WEK+DLD IKKG++R+EVSLADQIQ AKNRKAESIACK+L
Subjt:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVL

Query:  VASTLSYGCTREAERFLEHLDPLESQKLCICRLDSDKLLSNSCMSMASDRWLA
        VASTLSYGC   A                   +DS+K  S S  SMA+DRWLA
Subjt:  VASTLSYGCTREAERFLEHLDPLESQKLCICRLDSDKLLSNSCMSMASDRWLA

KAG7031061.1 hypothetical protein SDJN02_05100 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-16670.11Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW
        GSFEKQV SR+  DDDCPEN ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY R THSDQVK KISSSLRRVW
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW

Query:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR
        GKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRR
Subjt:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR

Query:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVL
        NGGKR+ KEGEDIQ  MK+LTA ERS LKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+WEK+DLD IKKG++R+EVSLADQIQ AKNRKAESIACK+L
Subjt:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVL

Query:  VASTLSYGCTREAERFLEHLDPLESQKLCICRLDSDKLLSNSCMSMASDRWLACLLLNLFHNCVGYLKILNASLK
        VASTLSYGC   A                   +DS+K  S S  SMA+DRWLA         C  Y+KIL++ ++
Subjt:  VASTLSYGCTREAERFLEHLDPLESQKLCICRLDSDKLLSNSCMSMASDRWLACLLLNLFHNCVGYLKILNASLK

XP_008451721.1 PREDICTED: uncharacterized protein LOC103492929 isoform X1 [Cucumis melo]6.1e-16878.55Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S +R+ SLFSPFN+FN+T+SSQ+ L MVDEGRNS+ GECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG
         S EKQVLS     DD PENLET N  EWQRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEY RTHSDQVKVKISSSLRRVWG
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG

Query:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN
        KRL+KKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYDKIKQE LHQ+LQ  AEK KLK +RAENAK R+V+ RVRKKEKGDD AKTKK+KMCSRRR+
Subjt:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN

Query:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV
         GKRK KE +D   KMKK T  ERS+LKQRLKKIRKKI+INGAV  QGSIASV PQNTSWE LDLDLIKKGQMRKE SLADQIQVAKNRKAES ACKVL+
Subjt:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV

Query:  -ASTLSYGCTREAER
         ASTL++ CT  AER
Subjt:  -ASTLSYGCTREAER

XP_011653304.1 uncharacterized protein LOC101207813 isoform X1 [Cucumis sativus]1.5e-16377.78Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S VRN S+FSPFNIFN+T+ SQ+ L MVDEGRNSN GECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG
         S EKQVLS     DD PENLET N KEWQRR+KIGLANKGRVPWNKGKKHNLETR RIKQRTIEALRDPEVRRKMSEY R HSDQVKVKISSSLRRVWG
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG

Query:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN
        KRLMKKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYDKIKQE LHQ+L+  AEK KLK +R ENAK +KV+ RV KKEKGDDNAKTKK+KMCSRRR+
Subjt:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN

Query:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV
         GKRK KE ++++ K KK T  ERS+LKQRLKKIRKKI+INGAV AQGSIASV PQN  WEKLDLDLIKKGQ  KE SLADQIQVAKNRKAES ACKVL+
Subjt:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV

Query:  ASTLSYGCTREAER
        ASTL++ CT  AER
Subjt:  ASTLSYGCTREAER

XP_038897968.1 uncharacterized protein LOC120085828 [Benincasa hispida]2.1e-16877.51Show/hide
Query:  INIFVLDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYK
        + + ++DC    M     RL G TLSVNLAPN A+WKI YYP+ANINLP NA P+NQQ++ +R+ S+FSP +IFN+ +SSQ++L MVDEGRNSN  ECYK
Subjt:  INIFVLDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYK

Query:  SKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSL
        SKCSSG  EK V+S     +D PENLET N KE QRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDP+VRRKMSEY RTHSDQVKVKISSSL
Subjt:  SKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSL

Query:  RRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMC
        R VWGKRLMKKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYDKIKQE+LHQ LQ  AEK KLKV RAEN KK+KV+G V KKEKG+DN+KTKK+KMC
Subjt:  RRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMC

Query:  SRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIA
        SRRRNGGKRK KEG+D   KMKK T  ERS+LKQRLKKIRKKI+ NGAV AQGSIASV P+NTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAES A
Subjt:  SRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIA

Query:  CKVLVASTLSYGCTREAE
        CKVL+ASTL+Y CT  AE
Subjt:  CKVLVASTLSYGCTREAE

TrEMBL top hitse value%identityAlignment
A0A0A0L0E1 IENR2 domain-containing protein7.5e-16477.78Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S VRN S+FSPFNIFN+T+ SQ+ L MVDEGRNSN GECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG
         S EKQVLS     DD PENLET N KEWQRR+KIGLANKGRVPWNKGKKHNLETR RIKQRTIEALRDPEVRRKMSEY R HSDQVKVKISSSLRRVWG
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG

Query:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN
        KRLMKKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYDKIKQE LHQ+L+  AEK KLK +R ENAK +KV+ RV KKEKGDDNAKTKK+KMCSRRR+
Subjt:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN

Query:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV
         GKRK KE ++++ K KK T  ERS+LKQRLKKIRKKI+INGAV AQGSIASV PQN  WEKLDLDLIKKGQ  KE SLADQIQVAKNRKAES ACKVL+
Subjt:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV

Query:  ASTLSYGCTREAER
        ASTL++ CT  AER
Subjt:  ASTLSYGCTREAER

A0A1S3BS78 uncharacterized protein LOC103492929 isoform X12.9e-16878.55Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S +R+ SLFSPFN+FN+T+SSQ+ L MVDEGRNS+ GECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG
         S EKQVLS     DD PENLET N  EWQRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEY RTHSDQVKVKISSSLRRVWG
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWG

Query:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN
        KRL+KKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYDKIKQE LHQ+LQ  AEK KLK +RAENAK R+V+ RVRKKEKGDD AKTKK+KMCSRRR+
Subjt:  KRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN

Query:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV
         GKRK KE +D   KMKK T  ERS+LKQRLKKIRKKI+INGAV  QGSIASV PQNTSWE LDLDLIKKGQMRKE SLADQIQVAKNRKAES ACKVL+
Subjt:  GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV

Query:  -ASTLSYGCTREAER
         ASTL++ CT  AER
Subjt:  -ASTLSYGCTREAER

A0A6J1CMC2 uncharacterized protein LOC111012783 isoform X16.6e-16078.25Show/hide
Query:  LSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAED
        LSGAT S+NLA N  +WKIF YPVA INLPSN +PVN QIS +++ S  SP +I N+T+ S  LL M DEGRNSN G CYKSKCS  S EK+V  R+  D
Subjt:  LSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAED

Query:  DDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFF
        DDCP+NL   N KE QRRR+IGLANKG VPWNKGKKHN+ETR+RIKQRTIEALRDP+VRRKMSEY RTHSDQVKVKISSSLRRVWGKRLMKKRLNETFF 
Subjt:  DDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFF

Query:  SWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQT
        SW ESIAVAAKKGGKE +ELDWDSY KIKQEML QKLQ AAEKA LK  RAENAKKRKVE R+RK+EKGD N K K+MKMCS+ RNG KRKAKEGEDIQ 
Subjt:  SWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQT

Query:  KMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAER
        +MKKLTA ERSRLKQRLK+IRKKI+INGAVAA+GSIASVIPQNTSWEKLDLDLIKKGQMRK VSLA+QIQVAK+RKAESIACKVL+AST +Y CT  AE+
Subjt:  KMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAER

A0A6J1FPR6 uncharacterized protein LOC1114471167.8e-15376.21Show/hide
Query:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS
        +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSS
Subjt:  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSS

Query:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW
        GSFEKQV SR+  DDDCPEN ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEALR+P+VRRKMSEY R THSDQVK KISSSLRRVW
Subjt:  GSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVW

Query:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR
        GKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KM SRRR
Subjt:  GKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR

Query:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRK
        NGGKR+ KEGED+Q   K+LTA ERSRLKQRLKKIRKKI ING VAAQGS+ASV+P+ T+WEK+DLDLIKKG++R+EVSLADQIQ AKNRK
Subjt:  NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRK

A0A6J1ITB7 uncharacterized protein LOC1114804101.1e-16277.53Show/hide
Query:  MKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQV
        M  +  +L GAT+SVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN SL SPFN+FN+TNSSQSLL +V EGR SNSGECYKSKCSSGSFEKQV
Subjt:  MKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQV

Query:  LSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVWGKRLMKK
         SR+  DDDCPEN ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY R THSDQVK KISSSLRRVWGKRL+KK
Subjt:  LSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQVKVKISSSLRRVWGKRLMKK

Query:  RLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKA
        RLNE FF SW ESIAVAAKKGGKEEQELDWDS+DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRRNGGKRK 
Subjt:  RLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKA

Query:  KEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSY
        KE EDIQ  +K+LTA ERSRLKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+WEKLDLDLIKKG++R+ VSLADQIQ AK RKAESIACK+LVASTLSY
Subjt:  KEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSY

Query:  GCTRE
        GC  E
Subjt:  GCTRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53250.1 unknown protein3.1e-5345.1Show/hide
Query:  NSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVK
        N  E ++ + +S   E + +++D E D   +       KE +RRRKIGLANKG+VPWNKG+KH+ +TR+RIKQRTIEAL +P+VR+KMS++ + HS++ K
Subjt:  NSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVK

Query:  VKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAK
         KI +S+++VW +R   KRL E F  SW E+IA AA+KGG  E ELDWDSY+KIKQ+   ++LQ A EKA+ K    E  K    E    + EK    A+
Subjt:  VKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAK

Query:  TKKMKMCSRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKN
         KK +    RR G  RK K+      + +  T   RS+LK+RL KI KK T  G +A       V+      EKLDLDLI+K + R ++SLADQIQ AKN
Subjt:  TKKMKMCSRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKN

Query:  RKAESI
        ++   +
Subjt:  RKAESI

AT1G53800.1 unknown protein1.8e-1626.64Show/hide
Query:  EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTH
        E   S+S     SK S+GS            DD  E ++    +E  RR +I  AN+G  PWNKG+KH+ ET ++I++RT  A++DP+++ K++      
Subjt:  EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTH

Query:  SDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKG
        + + ++KI   +R  W +R  ++++ ET  F W   +A AAK+G  +E+EL WDSY+ + Q+    +L+W     + K ++   + +R  +   +++   
Subjt:  SDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKG

Query:  DDNA-----KTKKMKMCS-------------RRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT
        +  A      + + ++CS             RRR   +  A+  +   TK     +    + + ++ K+RK+ T
Subjt:  DDNA-----KTKKMKMCS-------------RRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT

AT1G53800.2 unknown protein1.8e-1626.64Show/hide
Query:  EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTH
        E   S+S     SK S+GS            DD  E ++    +E  RR +I  AN+G  PWNKG+KH+ ET ++I++RT  A++DP+++ K++      
Subjt:  EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTH

Query:  SDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKG
        + + ++KI   +R  W +R  ++++ ET  F W   +A AAK+G  +E+EL WDSY+ + Q+    +L+W     + K ++   + +R  +   +++   
Subjt:  SDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKG

Query:  DDNA-----KTKKMKMCS-------------RRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT
        +  A      + + ++CS             RRR   +  A+  +   TK     +    + + ++ K+RK+ T
Subjt:  DDNA-----KTKKMKMCS-------------RRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATACATTCTCATTGTCGCCCAGATTTCCCCCATTAACATCTTTGTTTTAGATTGTAAGGAAGTGGCCATGAAAGTTATGGCACAGAGATTATCAGGTGCAACATT
ATCTGTAAATCTTGCTCCAAATTATGCTATCTGGAAGATTTTCTATTACCCAGTAGCCAACATCAATCTTCCATCAAATGCGCTGCCAGTGAATCAGCAGATATCTACAG
TCAGGAACACTTCATTATTTTCTCCTTTCAATATCTTCAACAAAACAAATTCTTCTCAATCCTTGCTGCTCATGGTTGATGAGGGTAGAAATTCCAACTCTGGTGAGTGC
TACAAGTCCAAGTGTTCCTCAGGTTCATTTGAGAAGCAGGTTTTGAGCAGAGATGCCGAGGATGATGATTGTCCAGAAAATCTTGAGACTGGAAATTACAAGGAATGGCA
AAGACGAAGAAAAATAGGACTGGCAAATAAAGGCAGAGTACCATGGAACAAGGGCAAGAAACACAACTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCT
TGAGAGACCCTGAGGTGAGGAGGAAGATGTCCGAATATTCCCGCACTCATAGTGACCAGGTCAAGGTTAAAATTAGCTCTTCACTCAGACGTGTATGGGGAAAGAGATTA
ATGAAGAAGAGATTAAATGAGACATTCTTTTTTTCCTGGATGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGCAAGGAAGAACAAGAACTCGACTGGGATAGCTATGA
CAAGATAAAACAAGAAATGCTCCACCAAAAGCTTCAATGGGCTGCAGAGAAGGCAAAGTTAAAAGTGGTGAGAGCAGAGAATGCAAAAAAGAGAAAAGTCGAAGGGAGGG
TTCGTAAGAAAGAAAAGGGGGATGATAATGCCAAAACAAAGAAAATGAAAATGTGTTCCAGAAGAAGAAATGGGGGGAAAAGGAAGGCAAAAGAGGGAGAAGACATCCAG
ACAAAAATGAAGAAACTGACTGCCACTGAAAGATCAAGGCTTAAGCAGAGACTAAAAAAGATTCGCAAAAAGATTACAATAAATGGTGCAGTTGCTGCTCAAGGAAGCAT
AGCATCAGTTATACCCCAAAACACATCCTGGGAAAAACTGGATCTAGATCTTATAAAGAAAGGACAAATGAGGAAAGAAGTATCGCTGGCAGATCAAATTCAAGTTGCCA
AGAATAGGAAAGCAGAATCTATAGCCTGCAAAGTTCTTGTAGCCTCTACTTTGTCGTACGGATGCACCAGGGAAGCGGAAAGATTCTTGGAACATTTAGATCCATTGGAG
TCACAGAAGCTTTGTATTTGCAGGCTGGATTCTGATAAGCTATTGAGCAACAGCTGCATGTCCATGGCATCAGATAGATGGCTTGCTTGCCTTTTGCTCAATCTGTTCCA
TAATTGTGTGGGGTATCTAAAGATACTAAATGCCTCTCTGAAGTCAAAGTTGGTATATTTACAAACAAAAAGTTGGATTTTTTTTTTTTCAAAGAAAGATATGGTAGAAA
AATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATACATTCTCATTGTCGCCCAGATTTCCCCCATTAACATCTTTGTTTTAGATTGTAAGGAAGTGGCCATGAAAGTTATGGCACAGAGATTATCAGGTGCAACATT
ATCTGTAAATCTTGCTCCAAATTATGCTATCTGGAAGATTTTCTATTACCCAGTAGCCAACATCAATCTTCCATCAAATGCGCTGCCAGTGAATCAGCAGATATCTACAG
TCAGGAACACTTCATTATTTTCTCCTTTCAATATCTTCAACAAAACAAATTCTTCTCAATCCTTGCTGCTCATGGTTGATGAGGGTAGAAATTCCAACTCTGGTGAGTGC
TACAAGTCCAAGTGTTCCTCAGGTTCATTTGAGAAGCAGGTTTTGAGCAGAGATGCCGAGGATGATGATTGTCCAGAAAATCTTGAGACTGGAAATTACAAGGAATGGCA
AAGACGAAGAAAAATAGGACTGGCAAATAAAGGCAGAGTACCATGGAACAAGGGCAAGAAACACAACTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCT
TGAGAGACCCTGAGGTGAGGAGGAAGATGTCCGAATATTCCCGCACTCATAGTGACCAGGTCAAGGTTAAAATTAGCTCTTCACTCAGACGTGTATGGGGAAAGAGATTA
ATGAAGAAGAGATTAAATGAGACATTCTTTTTTTCCTGGATGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGCAAGGAAGAACAAGAACTCGACTGGGATAGCTATGA
CAAGATAAAACAAGAAATGCTCCACCAAAAGCTTCAATGGGCTGCAGAGAAGGCAAAGTTAAAAGTGGTGAGAGCAGAGAATGCAAAAAAGAGAAAAGTCGAAGGGAGGG
TTCGTAAGAAAGAAAAGGGGGATGATAATGCCAAAACAAAGAAAATGAAAATGTGTTCCAGAAGAAGAAATGGGGGGAAAAGGAAGGCAAAAGAGGGAGAAGACATCCAG
ACAAAAATGAAGAAACTGACTGCCACTGAAAGATCAAGGCTTAAGCAGAGACTAAAAAAGATTCGCAAAAAGATTACAATAAATGGTGCAGTTGCTGCTCAAGGAAGCAT
AGCATCAGTTATACCCCAAAACACATCCTGGGAAAAACTGGATCTAGATCTTATAAAGAAAGGACAAATGAGGAAAGAAGTATCGCTGGCAGATCAAATTCAAGTTGCCA
AGAATAGGAAAGCAGAATCTATAGCCTGCAAAGTTCTTGTAGCCTCTACTTTGTCGTACGGATGCACCAGGGAAGCGGAAAGATTCTTGGAACATTTAGATCCATTGGAG
TCACAGAAGCTTTGTATTTGCAGGCTGGATTCTGATAAGCTATTGAGCAACAGCTGCATGTCCATGGCATCAGATAGATGGCTTGCTTGCCTTTTGCTCAATCTGTTCCA
TAATTGTGTGGGGTATCTAAAGATACTAAATGCCTCTCTGAAGTCAAAGTTGGTATATTTACAAACAAAAAGTTGGATTTTTTTTTTTTCAAAGAAAGATATGGTAGAAA
AATAA
Protein sequenceShow/hide protein sequence
MPYILIVAQISPINIFVLDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGEC
YKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRL
MKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQ
TKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAERFLEHLDPLE
SQKLCICRLDSDKLLSNSCMSMASDRWLACLLLNLFHNCVGYLKILNASLKSKLVYLQTKSWIFFFSKKDMVEK