; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:18991738..18996316
RNA-Seq ExpressionMoc04g26100
SyntenyMoc04g26100
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS34365.1 hypothetical protein Acr_00g0033580 [Actinidia rufa]4.4e-5236.54Show/hide
Query:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT
        R+   Y +EF+RL AR NL ES+   + +++ GL   I++Q+ LQ +  LNEA++ A  +E Q       Q    T  +     K++  S    S P+  
Subjt:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT

Query:  SSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYT-EPN------DGELV--SC
        S       S+       ST    G  N N Y + +  KC+RCG+  H SN C +R  + LVE    +ED  ++  E   Y+ +PN      +GE +  S 
Subjt:  SSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYT-EPN------DGELV--SC

Query:  VLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPL-----SPHPAPYKVSWINKGD----LTNLPSSVNLSLEAEEMA
        V+++++LTPK     QRH +F+ RCTIN ++C++I+DSGS ENI++  L+  L  P      + H   Y V    + D    L  L  S    +  EE  
Subjt:  VLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPL-----SPHPAPYKVSWINKGD----LTNLPSSVNLSLEAEEMA

Query:  NRIQELH--QEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKKKLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNY
        + +  +H   EV   +  SN KYKA AD  RR K F  GDLVM++LRK RF  GTY+KLK KK  PF I+++   N+Y + LPA   IS  FN+A+LY Y
Subjt:  NRIQELH--QEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKKKLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNY

Query:  HPSDD
        +P DD
Subjt:  HPSDD

KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-5833.92Show/hide
Query:  NKEMEQHFILSSRSSTARLLSVEGEVKTIQKDVCEIKHILETINEKLETLSVQQTPVRTSPHPQTR-----------MNQEVGVEDRRNTYLEDRQAALP
        N+E E++ +LS ++++ RLLS+E  V+ I+  +  +   LE +          Q  VR       R                 V++RR  + +D Q   P
Subjt:  NKEMEQHFILSSRSSTARLLSVEGEVKTIQKDVCEIKHILETINEKLETLSVQQTPVRTSPHPQTR-----------MNQEVGVEDRRNTYLEDRQAALP

Query:  RRLQEVHLGKETFKNHSKCAK----VNRN----PLFQQRHDQMFDS-------SSDEEEQP----------------SEFEGERFG-RSIADYTKEFHRL
        R  QE++   + + +     +     N+N    P F +R  ++ +S       S D +E P                S+++  R G R++ADY KEFH L
Subjt:  RRLQEVHLGKETFKNHSKCAK----VNRN----PLFQQRHDQMFDS-------SSDEEEQP----------------SEFEGERFG-RSIADYTKEFHRL

Query:  GARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSEL-D
        GAR NL E++ + + R+IGGL  +IKE+I LQP  +L+EAIS A T+EE                           +     +P  TS+  KGK+ E  D
Subjt:  GARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSEL-D

Query:  LDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVE-EENGQEDSVNDLEEEIEYTEPNDGELVSCVLERVILTPKSELPHQRHAL
        L   K  + V   K  N+YNRP+LGKCFRCGQ  H SN C QRK I L + EE+   +S  +LEEE +  E +DG  VSCV++RV+L PK E   Q H+L
Subjt:  LDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVE-EENGQEDSVNDLEEEIEYTEPNDGELVSCVLERVILTPKSELPHQRHAL

Query:  FKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG------DLTNLPSSV------NLSLEAEEMANRIQELHQEVHDHIAKS
        FKTRCTINGK+C++I+D+GS+EN +A KL+ AL+L   PHP PYK+ W+ KG      ++  +P S+       +  +  EM  R Q+L   +   + KS
Subjt:  FKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG------DLTNLPSSV------NLSLEAEEMANRIQELHQEVHDHIAKS

Query:  NEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTG------TYSKLKKKKLAPFPILERYRSNSYKLQL
        NE+   T D  R  + F+       HL+K   P G         ++     A  PIL  YR + ++ Q+
Subjt:  NEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTG------TYSKLKKKKLAPFPILERYRSNSYKLQL

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.1e-5442.41Show/hide
Query:  QMFDSSSDEEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTT
        Q F   + E+   ++++  R G R  A+Y +EFHRLG RTNL+E + +L+  ++GGL  ++KE++ LQP  +L+EAI+ A T+EE I N  + + +R+  
Subjt:  QMFDSSSDEEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTT

Query:  WEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEE-ENGQEDSVNDLEE
        WE    SKK  A    L       +A   K  E +   GK       KK  N Y RP  G C+RCGQ  H SN+C QRK I + ++ ++G   S+ + +E
Subjt:  WEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEE-ENGQEDSVNDLEE

Query:  EIEYTEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLT
        E E  E ++G+ +SC+L+RV+++PK E   QRH+LFKTRCTI GK+CN+I+DSGS+EN ++ KL+ AL+L   PH  PYK+ WI KG  T
Subjt:  EIEYTEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLT

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.9e-5847.7Show/hide
Query:  EEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSK
        E+   ++++  R G RS+A+Y +EFHRL ARTNL E++ + V R++GGL  +IKE++ LQP  +L+EAIS A T+EE I     K  +RR+ WE   T  
Subjt:  EEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSK

Query:  KMAASGDNLSSPLQTSSAMKGKQ---SELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQ--EDSVNDLEEEIEY
        K   + D  S    TS+  KGK+    E+ +++ K  +        N Y+RP+LGKCFRCGQT HLS+ C QRK I  + EE GQ  EDS+ + EEE E 
Subjt:  KMAASGDNLSSPLQTSSAMKGKQ---SELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQ--EDSVNDLEEEIEY

Query:  TEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG
         E +DGE VSCV++R+++TPK E   QRH LFKTRCTING++C++I+DSGS+EN +A KL+  L+L    HP PYK+ W+ KG
Subjt:  TEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]8.9e-5345.72Show/hide
Query:  EEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSK
        E+   ++++  R G R++A+Y +EFHRL ARTNL E++ + V R++GGL  +IKE++ LQP  +L+EAIS A T+EE I     K  +RR+ WE   T  
Subjt:  EEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSK

Query:  KMAASGDNLSSPLQTSSAMKGKQ---SELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYTE
        K   + D  S    TS+  KGK+    E+ +++ K  +        N Y+RP+LGKCFRCGQT HLSN C QRK I + EE     +   + EEE E  E
Subjt:  KMAASGDNLSSPLQTSSAMKGKQ---SELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYTE

Query:  PNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPH
         +DGE VSCV++R+++TPK E   QRH LFKTRCTING++C++I+DSGS+EN +A KL+  L+L    H
Subjt:  PNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPH

TrEMBL top hitse value%identityAlignment
A0A5A7SQX1 Transposon Ty3-G Gag-Pol polyprotein5.8e-5034.23Show/hide
Query:  DYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKK-----------MAASGDN
        DYT+EF+RLGAR NL E++H  + R +  LH  IK+ + L P+ +L+ AIS A+ IE+       K Y R+     G  S K              +  +
Subjt:  DYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKK-----------MAASGDN

Query:  LSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYTEPNDGELVSCVL
        L  P +  +     QS   +  G+S+      K  N YNRPTL KCF+CGQ  HLSNEC QR+ +T+ EE   Q+D  +D +   + + P++ + + CV+
Subjt:  LSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYTEPNDGELVSCVL

Query:  ERVILTPKSELPHQRHALFKTRCTINGKICNIIVD---SGSTENI----MASKLIMALHLPLSPHPAPYKVSWINKG-----------------------
        +R++LTP+++   QR++L +TRCTING+      D    G    I    M+ ++I+   LP+   P   K+S  NKG                       
Subjt:  ERVILTPKSELPHQRHALFKTRCTINGKICNIIVD---SGSTENI----MASKLIMALHLPLSPHPAPYKVSWINKG-----------------------

Query:  -DLTNLPSSVNLSL--------------------EAEEMANRIQELHQEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKK
          L + P   +L+L                    E E MA+R  +LHQEV DH+  +N+ YK  A+  +RS+  +   L+  +L KS FP G +SK+  K
Subjt:  -DLTNLPSSVNLSL--------------------EAEEMANRIQELHQEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKK

Query:  KLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNYHPSDDFTIS
        ++ PF +LER   NSY+L LPAT  I+P FN++DL  YH  D F+++
Subjt:  KLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNYHPSDDFTIS

A0A5A7UXS4 CCHC-type domain-containing protein2.7e-4739.93Show/hide
Query:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT
        RSIA+Y +EFHRL ARTNL E++ + + R+IG       E++ + P+                      K  +R+TTW+   + K+  +S  N     Q 
Subjt:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT

Query:  SSAMKGKQSELDL-DKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEE-NGQEDSVNDLEEEIEYTEPNDGELVSCVLERVIL
        S+++ GK  ++D  D  K  DN    K+ N Y RP+L KCFRCGQ+ HLSN C QR+ I+L ++E N   +   + EEE E+ E +DG+ +S V++RV++
Subjt:  SSAMKGKQSELDL-DKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEE-NGQEDSVNDLEEEIEYTEPNDGELVSCVLERVIL

Query:  TPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLTNLPSSVNLSL
         PK E   QRH+LFKTRCTIN ++C++I+DSGS+EN +A KL+  L+L  +P+P PYK+ W+ KG   ++     +SL
Subjt:  TPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLTNLPSSVNLSL

A0A5D3C3X9 Reverse transcriptase6.9e-5933.92Show/hide
Query:  NKEMEQHFILSSRSSTARLLSVEGEVKTIQKDVCEIKHILETINEKLETLSVQQTPVRTSPHPQTR-----------MNQEVGVEDRRNTYLEDRQAALP
        N+E E++ +LS ++++ RLLS+E  V+ I+  +  +   LE +          Q  VR       R                 V++RR  + +D Q   P
Subjt:  NKEMEQHFILSSRSSTARLLSVEGEVKTIQKDVCEIKHILETINEKLETLSVQQTPVRTSPHPQTR-----------MNQEVGVEDRRNTYLEDRQAALP

Query:  RRLQEVHLGKETFKNHSKCAK----VNRN----PLFQQRHDQMFDS-------SSDEEEQP----------------SEFEGERFG-RSIADYTKEFHRL
        R  QE++   + + +     +     N+N    P F +R  ++ +S       S D +E P                S+++  R G R++ADY KEFH L
Subjt:  RRLQEVHLGKETFKNHSKCAK----VNRN----PLFQQRHDQMFDS-------SSDEEEQP----------------SEFEGERFG-RSIADYTKEFHRL

Query:  GARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSEL-D
        GAR NL E++ + + R+IGGL  +IKE+I LQP  +L+EAIS A T+EE                           +     +P  TS+  KGK+ E  D
Subjt:  GARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSEL-D

Query:  LDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVE-EENGQEDSVNDLEEEIEYTEPNDGELVSCVLERVILTPKSELPHQRHAL
        L   K  + V   K  N+YNRP+LGKCFRCGQ  H SN C QRK I L + EE+   +S  +LEEE +  E +DG  VSCV++RV+L PK E   Q H+L
Subjt:  LDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVE-EENGQEDSVNDLEEEIEYTEPNDGELVSCVLERVILTPKSELPHQRHAL

Query:  FKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG------DLTNLPSSV------NLSLEAEEMANRIQELHQEVHDHIAKS
        FKTRCTINGK+C++I+D+GS+EN +A KL+ AL+L   PHP PYK+ W+ KG      ++  +P S+       +  +  EM  R Q+L   +   + KS
Subjt:  FKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKG------DLTNLPSSV------NLSLEAEEMANRIQELHQEVHDHIAKS

Query:  NEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTG------TYSKLKKKKLAPFPILERYRSNSYKLQL
        NE+   T D  R  + F+       HL+K   P G         ++     A  PIL  YR + ++ Q+
Subjt:  NEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTG------TYSKLKKKKLAPFPILERYRSNSYKLQL

A0A5D3DGR0 Reverse transcriptase1.0e-5442.41Show/hide
Query:  QMFDSSSDEEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTT
        Q F   + E+   ++++  R G R  A+Y +EFHRLG RTNL+E + +L+  ++GGL  ++KE++ LQP  +L+EAI+ A T+EE I N  + + +R+  
Subjt:  QMFDSSSDEEEQPSEFEGERFG-RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTT

Query:  WEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEE-ENGQEDSVNDLEE
        WE    SKK  A    L       +A   K  E +   GK       KK  N Y RP  G C+RCGQ  H SN+C QRK I + ++ ++G   S+ + +E
Subjt:  WEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEE-ENGQEDSVNDLEE

Query:  EIEYTEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLT
        E E  E ++G+ +SC+L+RV+++PK E   QRH+LFKTRCTI GK+CN+I+DSGS+EN ++ KL+ AL+L   PH  PYK+ WI KG  T
Subjt:  EIEYTEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLT

A0A7J0DG77 Uncharacterized protein2.1e-5236.54Show/hide
Query:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT
        R+   Y +EF+RL AR NL ES+   + +++ GL   I++Q+ LQ +  LNEA++ A  +E Q       Q    T  +     K++  S    S P+  
Subjt:  RSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFKKQYSRRTTWEQGGTSKKMAASGDNLSSPLQT

Query:  SSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYT-EPN------DGELV--SC
        S       S+       ST    G  N N Y + +  KC+RCG+  H SN C +R  + LVE    +ED  ++  E   Y+ +PN      +GE +  S 
Subjt:  SSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIEYT-EPN------DGELV--SC

Query:  VLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPL-----SPHPAPYKVSWINKGD----LTNLPSSVNLSLEAEEMA
        V+++++LTPK     QRH +F+ RCTIN ++C++I+DSGS ENI++  L+  L  P      + H   Y V    + D    L  L  S    +  EE  
Subjt:  VLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPL-----SPHPAPYKVSWINKGD----LTNLPSSVNLSLEAEEMA

Query:  NRIQELH--QEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKKKLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNY
        + +  +H   EV   +  SN KYKA AD  RR K F  GDLVM++LRK RF  GTY+KLK KK  PF I+++   N+Y + LPA   IS  FN+A+LY Y
Subjt:  NRIQELH--QEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKKKLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNY

Query:  HPSDD
        +P DD
Subjt:  HPSDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATCCTTCTCTTGTCAACAAAGAAATGGAACAACATTTCATTCTCTCATCACGATCATCAACCGCACGCTTGTTATCGGTTGAAGGAGAAGTGAAAACTATCCA
AAAGGATGTATGTGAGATTAAACACATTCTGGAAACCATCAATGAAAAACTTGAGACGTTGAGTGTGCAACAAACTCCGGTGAGAACTTCTCCTCATCCCCAAACAAGAA
TGAATCAAGAAGTTGGAGTTGAGGACCGAAGAAACACCTATCTGGAAGATAGGCAAGCAGCCCTACCAAGAAGGCTGCAAGAAGTTCATCTAGGCAAAGAAACTTTCAAG
AACCACAGCAAATGCGCCAAAGTCAATCGGAATCCATTATTCCAGCAGCGACATGACCAAATGTTTGACTCCTCAAGTGATGAAGAGGAACAACCGTCGGAATTCGAAGG
TGAAAGGTTTGGTAGGTCTATAGCAGATTATACGAAAGAATTTCATCGATTAGGAGCAAGAACCAATTTGGTGGAAAGTCAACATTACTTAGTTGTAAGATACATTGGCG
GCTTGCACGCTAACATTAAAGAACAGATAGCCTTGCAACCAATAGGATACTTAAATGAAGCTATTTCCACGGCAACCACTATCGAAGAACAGATTGGTAATTGTTTCAAG
AAGCAATATTCAAGAAGAACCACGTGGGAACAAGGAGGAACATCCAAAAAGATGGCTGCTTCCGGAGACAATCTCTCTTCTCCTCTCCAAACGTCAAGCGCAATGAAAGG
TAAACAATCTGAACTTGATCTTGATAAAGGTAAATCAACTGATAATGTGGCAGGAAAGAAGAATAGCAACAGATACAACCGCCCAACATTAGGTAAGTGTTTCCGTTGTG
GGCAAACCAGCCACTTATCTAATGAATGTCTTCAAAGGAAAGTCATTACATTGGTAGAAGAAGAAAATGGTCAAGAAGACAGTGTTAATGATCTTGAAGAAGAGATCGAG
TATACCGAACCAAACGACGGGGAACTAGTTTCTTGTGTTCTTGAGAGAGTTATTCTAACACCTAAATCAGAATTACCCCACCAACGTCATGCTCTTTTCAAGACAAGATG
CACGATCAATGGTAAGATTTGCAACATCATAGTCGATAGTGGAAGTACAGAAAACATTATGGCAAGTAAGTTGATCATGGCTTTGCATTTACCCTTATCTCCTCACCCTG
CACCATATAAGGTGTCTTGGATCAATAAGGGAGATCTAACTAATCTACCTTCCTCTGTTAATCTCAGTCTTGAAGCAGAGGAAATGGCCAACAGAATTCAAGAGCTCCAT
CAGGAAGTTCATGATCACATAGCCAAGTCAAATGAGAAATACAAAGCTACAGCTGACAAAGGACGTCGTTCGAAAGAATTTCAAGTGGGAGATTTGGTCATGATTCATTT
GAGAAAAAGCAGATTCCCTACAGGGACATACTCTAAGCTAAAGAAAAAGAAGCTAGCCCCCTTTCCAATACTTGAACGTTATAGATCCAATTCTTACAAGCTACAACTTC
CAGCAACGTATAACATAAGCCCTGTCTTCAACATTGCTGATTTGTATAATTATCACCCTTCGGATGACTTTACAATATCTACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACGATCCTTCTCTTGTCAACAAAGAAATGGAACAACATTTCATTCTCTCATCACGATCATCAACCGCACGCTTGTTATCGGTTGAAGGAGAAGTGAAAACTATCCA
AAAGGATGTATGTGAGATTAAACACATTCTGGAAACCATCAATGAAAAACTTGAGACGTTGAGTGTGCAACAAACTCCGGTGAGAACTTCTCCTCATCCCCAAACAAGAA
TGAATCAAGAAGTTGGAGTTGAGGACCGAAGAAACACCTATCTGGAAGATAGGCAAGCAGCCCTACCAAGAAGGCTGCAAGAAGTTCATCTAGGCAAAGAAACTTTCAAG
AACCACAGCAAATGCGCCAAAGTCAATCGGAATCCATTATTCCAGCAGCGACATGACCAAATGTTTGACTCCTCAAGTGATGAAGAGGAACAACCGTCGGAATTCGAAGG
TGAAAGGTTTGGTAGGTCTATAGCAGATTATACGAAAGAATTTCATCGATTAGGAGCAAGAACCAATTTGGTGGAAAGTCAACATTACTTAGTTGTAAGATACATTGGCG
GCTTGCACGCTAACATTAAAGAACAGATAGCCTTGCAACCAATAGGATACTTAAATGAAGCTATTTCCACGGCAACCACTATCGAAGAACAGATTGGTAATTGTTTCAAG
AAGCAATATTCAAGAAGAACCACGTGGGAACAAGGAGGAACATCCAAAAAGATGGCTGCTTCCGGAGACAATCTCTCTTCTCCTCTCCAAACGTCAAGCGCAATGAAAGG
TAAACAATCTGAACTTGATCTTGATAAAGGTAAATCAACTGATAATGTGGCAGGAAAGAAGAATAGCAACAGATACAACCGCCCAACATTAGGTAAGTGTTTCCGTTGTG
GGCAAACCAGCCACTTATCTAATGAATGTCTTCAAAGGAAAGTCATTACATTGGTAGAAGAAGAAAATGGTCAAGAAGACAGTGTTAATGATCTTGAAGAAGAGATCGAG
TATACCGAACCAAACGACGGGGAACTAGTTTCTTGTGTTCTTGAGAGAGTTATTCTAACACCTAAATCAGAATTACCCCACCAACGTCATGCTCTTTTCAAGACAAGATG
CACGATCAATGGTAAGATTTGCAACATCATAGTCGATAGTGGAAGTACAGAAAACATTATGGCAAGTAAGTTGATCATGGCTTTGCATTTACCCTTATCTCCTCACCCTG
CACCATATAAGGTGTCTTGGATCAATAAGGGAGATCTAACTAATCTACCTTCCTCTGTTAATCTCAGTCTTGAAGCAGAGGAAATGGCCAACAGAATTCAAGAGCTCCAT
CAGGAAGTTCATGATCACATAGCCAAGTCAAATGAGAAATACAAAGCTACAGCTGACAAAGGACGTCGTTCGAAAGAATTTCAAGTGGGAGATTTGGTCATGATTCATTT
GAGAAAAAGCAGATTCCCTACAGGGACATACTCTAAGCTAAAGAAAAAGAAGCTAGCCCCCTTTCCAATACTTGAACGTTATAGATCCAATTCTTACAAGCTACAACTTC
CAGCAACGTATAACATAAGCCCTGTCTTCAACATTGCTGATTTGTATAATTATCACCCTTCGGATGACTTTACAATATCTACCTAA
Protein sequenceShow/hide protein sequence
MNDPSLVNKEMEQHFILSSRSSTARLLSVEGEVKTIQKDVCEIKHILETINEKLETLSVQQTPVRTSPHPQTRMNQEVGVEDRRNTYLEDRQAALPRRLQEVHLGKETFK
NHSKCAKVNRNPLFQQRHDQMFDSSSDEEEQPSEFEGERFGRSIADYTKEFHRLGARTNLVESQHYLVVRYIGGLHANIKEQIALQPIGYLNEAISTATTIEEQIGNCFK
KQYSRRTTWEQGGTSKKMAASGDNLSSPLQTSSAMKGKQSELDLDKGKSTDNVAGKKNSNRYNRPTLGKCFRCGQTSHLSNECLQRKVITLVEEENGQEDSVNDLEEEIE
YTEPNDGELVSCVLERVILTPKSELPHQRHALFKTRCTINGKICNIIVDSGSTENIMASKLIMALHLPLSPHPAPYKVSWINKGDLTNLPSSVNLSLEAEEMANRIQELH
QEVHDHIAKSNEKYKATADKGRRSKEFQVGDLVMIHLRKSRFPTGTYSKLKKKKLAPFPILERYRSNSYKLQLPATYNISPVFNIADLYNYHPSDDFTIST