; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g29340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g29340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:21384682..21393640
RNA-Seq ExpressionMoc11g29340
SyntenyMoc11g29340
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]5.6e-8137.21Show/hide
Query:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI
        AMR+Y      +LNS + N  P  A FE KP+M QML  +G F GL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFSL   A  WLNA   ++I
Subjt:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI

Query:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK
         T +++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+R CP+ G+P CVQIE F+R  D  + MMLN AANG    KS NEIV+IL++
Subjt:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK

Query:  MTDINDQ----------------------------------GEIGRSLPRNK---------------YQL-ESLSQGAQRNFNPYSNSYNPGWRHHPNFS
        +++ NDQ                                   ++ +++ +N                YQ+ ES  Q  Q+ FNPYSN YNPGW+ HPNFS
Subjt:  MTDINDQ----------------------------------GEIGRSLPRNK---------------YQL-ESLSQGAQRNFNPYSNSYNPGWRHHPNFS

Query:  WSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQQIKN--------------------PKRDREGKE-------------HCKAVITRSGLS
        WS QG  SSS     QQYKQ YTP  FP   A    PQQYNQQ KN                     K D   KE               K  + R+ ++
Subjt:  WSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQQIKN--------------------PKRDREGKE-------------HCKAVITRSGLS

Query:  YEG--------------------PSLPDEGTHVVTPVP------------------------------APPPIHNKKRKQN-------------------
                               PS  +E   +V+P P                              +PPP   +  ++N                   
Subjt:  YEG--------------------PSLPDEGTHVVTPVP------------------------------APPPIHNKKRKQN-------------------

Query:  ---------------------LRKKLGEHETVALTKCSSDALGNPLPVKCKDPGSFTIPYSIGAFNV
                              +KKLGE+ETVALT+CSS+   +  P K KDPGSFTI   IG  +V
Subjt:  ---------------------LRKKLGEHETVALTKCSSDALGNPLPVKCKDPGSFTIPYSIGAFNV

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]3.0e-8252.6Show/hide
Query:  VAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEP
        VA+RNYVTHAFHNLNS +  + P+ +A                       NEDPYSHLKSFIEIANAFQL GVSEDALRLKM                  
Subjt:  VAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEP

Query:  NSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDI
                              NADLREDIVSFRQKENEAVQE WERFKELLRRC SHGLPTCVQIEQFYRGLDR SRMMLNTAAN SL EKS++EI+DI
Subjt:  NSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDI

Query:  LNKMTDINDQGEIGRSLPRNK-----YQLESLS------------------------------------------------------------------Q
        LNKMTD NDQGEIGRSLP+ +     ++L++++                                                                  Q
Subjt:  LNKMTDINDQGEIGRSLPRNK-----YQLESLS------------------------------------------------------------------Q

Query:  GAQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLASQPQQYNQQ
         AQRNFNPYSN+Y+P WR+HPNFSWSNQGVASSSAQ  AQQYKQNYTP  FPTQ ASQPQQYNQQ
Subjt:  GAQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLASQPQQYNQQ

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]5.7e-13477.02Show/hide
Query:  VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNS
        VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMG FGGLTNEDPYSHLKSFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNS
Subjt:  VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNS

Query:  INTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILN
        INTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKSVNEIVD+LN
Subjt:  INTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILN

Query:  KMTDINDQGEIGRSLPRNK-----YQLESLS-----------------------------------------------QGAQRNFNPYSNSYNPGWRHHP
        KMTDINDQGE+GRSLP+ +     ++L++++                                               QGAQRNFNPYSN+YNPGWRHHP
Subjt:  KMTDINDQGEIGRSLPRNK-----YQLESLS-----------------------------------------------QGAQRNFNPYSNSYNPGWRHHP

Query:  NFSWSNQGVASSSAQASAQQYK
        NFSWSNQGVASSSAQA AQQYK
Subjt:  NFSWSNQGVASSSAQASAQQYK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]9.9e-8660.84Show/hide
Query:  MGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQ
        M  FGG TNEDPYSHLKSFI+IANAFQLPGVSEDALRLKMFPFSLRDGA TW+N LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+E+EAVQ
Subjt:  MGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQ

Query:  EAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPRNK-----YQLESLS---------
        EAWERFKELL+RC SHGLPTCVQI+QFYRGLD   RMM +TAAN SLLEKSVNEI+DILNKM DINDQ E+GRSLP+ +     ++L++++         
Subjt:  EAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPRNK-----YQLESLS---------

Query:  ----------------------------------------------------------QGAQRNFNPYSNSYNPGWRHHPNFSWSN
                                                                  QG QRNFNPYSN+YNPGWR HPNFS SN
Subjt:  ----------------------------------------------------------QGAQRNFNPYSNSYNPGWRHHPNFSWSN

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]8.1e-8835.89Show/hide
Query:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI
        AMR+Y      +LNS + N  P  A+FE KP+M QML  +G FGGL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFS+   A  WLNA   ++I
Subjt:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI

Query:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK
         TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+  CP+ G+P CVQIE F+RG D  ++MMLN AANG    KS NEIV+IL++
Subjt:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK

Query:  MTDINDQ-----------------------------------------------------------------------GEI--GRSLPRNKYQLESLSQG
        +++ N Q                                                                       G++    + P N   +  + Q 
Subjt:  MTDINDQ-----------------------------------------------------------------------GEI--GRSLPRNKYQLESLSQG

Query:  AQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQ-----------------------------------
         Q+ FNPYSN+YNPGW+ HPNFSWS QG  SS+     QQYK+ YTP GFP   A    P QYNQ                                   
Subjt:  AQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQ-----------------------------------

Query:  -------------------------------QIKNPKRDRE--------------GKEHCKAVITRSGLSYEGPSLPDEGTH--------------VVTP
                                       Q+ N  R R               GKEHC ++ TRSGL YEGP +PDE +H              +V P
Subjt:  -------------------------------QIKNPKRDRE--------------GKEHCKAVITRSGLSYEGPSLPDEGTH--------------VVTP

Query:  ---VPAPPPIHN-------------KKRKQNLR------------------------------------KKLGEHETVALTKCSSDALGNPLPVKCKDPG
           VP  P + N             K +  N R                                    KKLGE+ETVALT+CSS+   + +P K KDPG
Subjt:  ---VPAPPPIHN-------------KKRKQNLR------------------------------------KKLGEHETVALTKCSSDALGNPLPVKCKDPG

Query:  SFTIPYSIGAFNV
        SFTIP  IG  +V
Subjt:  SFTIPYSIGAFNV

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129471.4e-8037.21Show/hide
Query:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI
        AMR+Y      +LNS + N  P  A FE KP+M QML  +G F GL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFSL   A  WLNA   ++I
Subjt:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI

Query:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK
         T +++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+R CP+ G+P CVQIE F+R  D  + MMLN AANG    KS NEIV+IL++
Subjt:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK

Query:  MTDINDQ----------------------------------GEIGRSLPRNK---------------YQL-ESLSQGAQRNFNPYSNSYNPGWRHHPNFS
        +++ NDQ                                   ++ +++ +N                YQ+ ES  Q  Q+ FNPYSN YNPGW+ HPNFS
Subjt:  MTDINDQ----------------------------------GEIGRSLPRNK---------------YQL-ESLSQGAQRNFNPYSNSYNPGWRHHPNFS

Query:  WSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQQIKN--------------------PKRDREGKE-------------HCKAVITRSGLS
        WS QG  SSS     QQYKQ YTP  FP   A    PQQYNQQ KN                     K D   KE               K  + R+ ++
Subjt:  WSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQQIKN--------------------PKRDREGKE-------------HCKAVITRSGLS

Query:  YEG--------------------PSLPDEGTHVVTPVP------------------------------APPPIHNKKRKQN-------------------
                               PS  +E   +V+P P                              +PPP   +  ++N                   
Subjt:  YEG--------------------PSLPDEGTHVVTPVP------------------------------APPPIHNKKRKQN-------------------

Query:  ---------------------LRKKLGEHETVALTKCSSDALGNPLPVKCKDPGSFTIPYSIGAFNV
                              +KKLGE+ETVALT+CSS+   +  P K KDPGSFTI   IG  +V
Subjt:  ---------------------LRKKLGEHETVALTKCSSDALGNPLPVKCKDPGSFTIPYSIGAFNV

A0A6J1DY39 uncharacterized protein LOC1110256533.9e-8835.89Show/hide
Query:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI
        AMR+Y      +LNS + N  P  A+FE KP+M QML  +G FGGL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFS+   A  WLNA   ++I
Subjt:  AMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSI

Query:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK
         TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+  CP+ G+P CVQIE F+RG D  ++MMLN AANG    KS NEIV+IL++
Subjt:  NTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNK

Query:  MTDINDQ-----------------------------------------------------------------------GEI--GRSLPRNKYQLESLSQG
        +++ N Q                                                                       G++    + P N   +  + Q 
Subjt:  MTDINDQ-----------------------------------------------------------------------GEI--GRSLPRNKYQLESLSQG

Query:  AQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQ-----------------------------------
         Q+ FNPYSN+YNPGW+ HPNFSWS QG  SS+     QQYK+ YTP GFP   A    P QYNQ                                   
Subjt:  AQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLA--SQPQQYNQ-----------------------------------

Query:  -------------------------------QIKNPKRDRE--------------GKEHCKAVITRSGLSYEGPSLPDEGTH--------------VVTP
                                       Q+ N  R R               GKEHC ++ TRSGL YEGP +PDE +H              +V P
Subjt:  -------------------------------QIKNPKRDRE--------------GKEHCKAVITRSGLSYEGPSLPDEGTH--------------VVTP

Query:  ---VPAPPPIHN-------------KKRKQNLR------------------------------------KKLGEHETVALTKCSSDALGNPLPVKCKDPG
           VP  P + N             K +  N R                                    KKLGE+ETVALT+CSS+   + +P K KDPG
Subjt:  ---VPAPPPIHN-------------KKRKQNLR------------------------------------KKLGEHETVALTKCSSDALGNPLPVKCKDPG

Query:  SFTIPYSIGAFNV
        SFTIP  IG  +V
Subjt:  SFTIPYSIGAFNV

A0A6J1DYY9 uncharacterized protein LOC1110255573.7e-8661.19Show/hide
Query:  MGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQ
        M  FGG TNEDPYSHLKSFI+IANAFQLPGVSEDALRLKMFPFSLRDGA TWLN LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+E+EAVQ
Subjt:  MGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQ

Query:  EAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPRNK-----YQLESLS---------
        EAWERFKELL+RC SHGLPTCVQI+QFYRGLD   RMM +TAAN SLLEKSVNEI+DILNKM DINDQ E+GRSLP+ +     ++L++++         
Subjt:  EAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPRNK-----YQLESLS---------

Query:  ----------------------------------------------------------QGAQRNFNPYSNSYNPGWRHHPNFSWSN
                                                                  QG QRNFNPYSN+YNPGWR HPNFS SN
Subjt:  ----------------------------------------------------------QGAQRNFNPYSNSYNPGWRHHPNFSWSN

A0A6J1DZ19 uncharacterized protein LOC1110248241.4e-8252.6Show/hide
Query:  VAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEP
        VA+RNYVTHAFHNLNS +  + P+ +A                       NEDPYSHLKSFIEIANAFQL GVSEDALRLKM                  
Subjt:  VAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEP

Query:  NSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDI
                              NADLREDIVSFRQKENEAVQE WERFKELLRRC SHGLPTCVQIEQFYRGLDR SRMMLNTAAN SL EKS++EI+DI
Subjt:  NSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDI

Query:  LNKMTDINDQGEIGRSLPRNK-----YQLESLS------------------------------------------------------------------Q
        LNKMTD NDQGEIGRSLP+ +     ++L++++                                                                  Q
Subjt:  LNKMTDINDQGEIGRSLPRNK-----YQLESLS------------------------------------------------------------------Q

Query:  GAQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLASQPQQYNQQ
         AQRNFNPYSN+Y+P WR+HPNFSWSNQGVASSSAQ  AQQYKQNYTP  FPTQ ASQPQQYNQQ
Subjt:  GAQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLASQPQQYNQQ

A0A6J1E251 uncharacterized protein LOC1110253022.8e-13477.02Show/hide
Query:  VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNS
        VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMG FGGLTNEDPYSHLKSFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNS
Subjt:  VAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNS

Query:  INTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILN
        INTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKSVNEIVD+LN
Subjt:  INTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILN

Query:  KMTDINDQGEIGRSLPRNK-----YQLESLS-----------------------------------------------QGAQRNFNPYSNSYNPGWRHHP
        KMTDINDQGE+GRSLP+ +     ++L++++                                               QGAQRNFNPYSN+YNPGWRHHP
Subjt:  KMTDINDQGEIGRSLPRNK-----YQLESLS-----------------------------------------------QGAQRNFNPYSNSYNPGWRHHP

Query:  NFSWSNQGVASSSAQASAQQYK
        NFSWSNQGVASSSAQA AQQYK
Subjt:  NFSWSNQGVASSSAQASAQQYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGGAGTTCGGTAGCCGTTAGCCCTTTAAACCCTTGTTATGAAGAATTTACGTTTGACTCTTTAGAATTCTTTGACCCAGTTGGGCTTCCTTCTCCCGTACTTTC
ATTAGACGTAGGAACCTCGAGGGTTGGGGGACATACCGATGAGTGTTCCTTTGCTCAGGCGGTGGTTGAGAGAGCGGCTGCTGGGGTAGCGGACAGTGAGTTTTCAGGTA
GAACTTCAGACGAACCTGAGAGTGTATATTTGAGTTGTGTCGATCAGTACTCATTTGCGGGGCCCGCCAGCATTCGCTCCACCATTACTCAGAGTACTCTGGACTTTTAT
CGTTTTACTTATGAAATTCCAGAGAACATTGTTATGCGCGTGTCGGGTCCTCATGAGCGGATTTGCAGCCCACCTCCAGGTGAAACAGTTTTGTATACTGCCATGTTTAG
TCATGGTTTGCGGATTCCTCTTCATCCATGTGTGCAGCACTTTTTGCGTGATGTTGGCTTAGCTCCCACCCAGCTGGTCCCTAATGGGTGGAATCTATTACTGGCATGTT
TCATAGCTTGGCAATACGGCTGCGGAGGAGGCTTGACCGTAGAGACTTCTGCTATTCTGCACACCTTTGTTTTGAAGGTTGTTCCTCATCGGGAAAACCAATATTTTTGT
TTATCAAAGAGGAAGGAGAAGAAGCCCTTAGTGGAGTCAGAAAGGGCTGGAAAGCAGAAGACCTCTAATATGGCTTCTAGTCGTCGTGATAGACAGAAAGTTACTATGTC
TAGCTCTAATAAAAACTGGAAAAGTGAGTGGTTTTCTATTAGTGGGGATTGGCTTTCCACTAGGGGAGGTATTTATGATATCCCGACTGATTTCAGGGCTCCTGAAGGCA
GTTTGCGTCACCACGGCCTATCAGATTTGACCCGACCTCTCTCCGAGGCTCAGAGTCGACTTGTTAGTGAGCCTCTATCTCCTAGTCATATTAGCGATTCCACTTTAGGA
CCTTACCAAGCACCATTGCTACCGGAAGAGATTGATAGGGTTGGGAGACATATTCAGGAGAAGGTGCAGAGGGGGATTCTTCCTGGCTCATCCTCCGATGGCGACCCTAA
TCGGGACATTTGTCATGCTTTGCGTGAGATACACAAGCATTGCACATTGGTTGGTTCATCGGTTTATCGTATGGGAACCGAGCTGTCTTATTTGCAGCAGCAGAAGATTG
GGGTCGATATAGCTTTGGAAGAATCTCGAGAACTTACGCGCCAACTTAGCCGACAGAAGGCAGAGCTCAAGAAAACGCTTGGTCAGATACAATCTGAGCATGACCGTGCG
ATGGCAGAAATTGAGGCTATGAGGGAGAATCTGACTAGACATCAGACGGAGTCAGATTGGGCTAAGGTCGAGCTTGCAAAATGCAAGGCCGAGCTCGAGGCTTTACATAC
TGATTGGCGAGAGCCTAGCAGACTGGAGGCCGCGATGCGAGAGGCAGCTCTAGAGGCGGAGTTGGTTCGACCAGGCGAGTCTAGTCATCCTGAGGTTGTGGTGGTCGAGG
CAGGTGACGTGGCGACATCTACCGCGGGTCCTACCAGTTCTTCCACTCGTCTAAATGACCTGGCCTCTCAGTTGGATGATACTATGGATACACAGGGTGAGACTTGGGAC
GATGCTTCAGGGAGGGATGAGGTGTGCTGGGGAGGAGCAAAAATGACGGATGTTGAGTTGCTCGCAGAAATCTTTAAAGTTGGCGTTATCAAACTACTTGCCGTTATCGG
AGATGCAGCTTCCTCTTGGTCCAGACACTTTAGCAGGGGTTGTGAATATCCTCGTTTGTACAATACATCATGGATCATGTATTGGGCTGCTTGTCGCTGTAATCGACGGG
TCTCGATCATGTTGGTTACCCTAGCCTCTGCAGCAACCAAAAGTGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAA
CATAGTGTGTTTTTCATGTTTTGCATCAAAACTAAGATAATCGAGATTGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCC
TTTACCCCAAGCCGCACAGTTCGAGCTGAAGCCAGTCATGTTCCAGATGTTACAGACGATGGGCCACTTCGGAGGATTGACTAATGAAGATCCTTACTCCCATCTCAAAT
CCTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTTTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAAC
GCGTTAGAACCAAATTCTATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCCAAGTACCACACTTTGACCAGGAACGCAGACCTCCGAGAGGACATTGTGTCTTT
TAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCACATGTGTGCAGATTGAACAATTCT
ATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGAC
ATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAGAAACAAGTATCAGCTGGAATCTTTGAGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACTCTTACAA
CCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCATCCGCTCAACAATACAAGCAAAACTATACTCCTTCTGGTT
TTCCAACTCAACTGGCGTCGCAGCCTCAACAATATAATCAGCAAATAAAAAACCCAAAGCGAGATCGTGAGGGAAAGGAGCATTGTAAGGCGGTTATCACAAGAAGCGGA
TTAAGTTATGAAGGACCTTCACTTCCAGACGAAGGAACTCATGTAGTTACACCTGTTCCTGCACCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGCGTAAGAA
GTTAGGTGAGCATGAGACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAAGGACCCAGGAAGTTTTACCATCCCCTACTCAA
TAGGAGCCTTCAATGTCCTCGATGCGATGCATCTCCCGGATGAAGTCGAGGAGTGCTTTACAATAGGAGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTG
GAAGCAAATTTGGAGGCTGCAGAAAAAGAATCCAAAATTGCGCCTGACGCAATTTTGCCCCAACTTGAGCGTTTTGAGTTTTTGCAGCAGACAATAGCGGATTTGAAGGC
CTTGCAACCTTCCATCATTGAACCTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGGGAGTTCGGTAGCCGTTAGCCCTTTAAACCCTTGTTATGAAGAATTTACGTTTGACTCTTTAGAATTCTTTGACCCAGTTGGGCTTCCTTCTCCCGTACTTTC
ATTAGACGTAGGAACCTCGAGGGTTGGGGGACATACCGATGAGTGTTCCTTTGCTCAGGCGGTGGTTGAGAGAGCGGCTGCTGGGGTAGCGGACAGTGAGTTTTCAGGTA
GAACTTCAGACGAACCTGAGAGTGTATATTTGAGTTGTGTCGATCAGTACTCATTTGCGGGGCCCGCCAGCATTCGCTCCACCATTACTCAGAGTACTCTGGACTTTTAT
CGTTTTACTTATGAAATTCCAGAGAACATTGTTATGCGCGTGTCGGGTCCTCATGAGCGGATTTGCAGCCCACCTCCAGGTGAAACAGTTTTGTATACTGCCATGTTTAG
TCATGGTTTGCGGATTCCTCTTCATCCATGTGTGCAGCACTTTTTGCGTGATGTTGGCTTAGCTCCCACCCAGCTGGTCCCTAATGGGTGGAATCTATTACTGGCATGTT
TCATAGCTTGGCAATACGGCTGCGGAGGAGGCTTGACCGTAGAGACTTCTGCTATTCTGCACACCTTTGTTTTGAAGGTTGTTCCTCATCGGGAAAACCAATATTTTTGT
TTATCAAAGAGGAAGGAGAAGAAGCCCTTAGTGGAGTCAGAAAGGGCTGGAAAGCAGAAGACCTCTAATATGGCTTCTAGTCGTCGTGATAGACAGAAAGTTACTATGTC
TAGCTCTAATAAAAACTGGAAAAGTGAGTGGTTTTCTATTAGTGGGGATTGGCTTTCCACTAGGGGAGGTATTTATGATATCCCGACTGATTTCAGGGCTCCTGAAGGCA
GTTTGCGTCACCACGGCCTATCAGATTTGACCCGACCTCTCTCCGAGGCTCAGAGTCGACTTGTTAGTGAGCCTCTATCTCCTAGTCATATTAGCGATTCCACTTTAGGA
CCTTACCAAGCACCATTGCTACCGGAAGAGATTGATAGGGTTGGGAGACATATTCAGGAGAAGGTGCAGAGGGGGATTCTTCCTGGCTCATCCTCCGATGGCGACCCTAA
TCGGGACATTTGTCATGCTTTGCGTGAGATACACAAGCATTGCACATTGGTTGGTTCATCGGTTTATCGTATGGGAACCGAGCTGTCTTATTTGCAGCAGCAGAAGATTG
GGGTCGATATAGCTTTGGAAGAATCTCGAGAACTTACGCGCCAACTTAGCCGACAGAAGGCAGAGCTCAAGAAAACGCTTGGTCAGATACAATCTGAGCATGACCGTGCG
ATGGCAGAAATTGAGGCTATGAGGGAGAATCTGACTAGACATCAGACGGAGTCAGATTGGGCTAAGGTCGAGCTTGCAAAATGCAAGGCCGAGCTCGAGGCTTTACATAC
TGATTGGCGAGAGCCTAGCAGACTGGAGGCCGCGATGCGAGAGGCAGCTCTAGAGGCGGAGTTGGTTCGACCAGGCGAGTCTAGTCATCCTGAGGTTGTGGTGGTCGAGG
CAGGTGACGTGGCGACATCTACCGCGGGTCCTACCAGTTCTTCCACTCGTCTAAATGACCTGGCCTCTCAGTTGGATGATACTATGGATACACAGGGTGAGACTTGGGAC
GATGCTTCAGGGAGGGATGAGGTGTGCTGGGGAGGAGCAAAAATGACGGATGTTGAGTTGCTCGCAGAAATCTTTAAAGTTGGCGTTATCAAACTACTTGCCGTTATCGG
AGATGCAGCTTCCTCTTGGTCCAGACACTTTAGCAGGGGTTGTGAATATCCTCGTTTGTACAATACATCATGGATCATGTATTGGGCTGCTTGTCGCTGTAATCGACGGG
TCTCGATCATGTTGGTTACCCTAGCCTCTGCAGCAACCAAAAGTGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAA
CATAGTGTGTTTTTCATGTTTTGCATCAAAACTAAGATAATCGAGATTGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCC
TTTACCCCAAGCCGCACAGTTCGAGCTGAAGCCAGTCATGTTCCAGATGTTACAGACGATGGGCCACTTCGGAGGATTGACTAATGAAGATCCTTACTCCCATCTCAAAT
CCTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTTTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAAC
GCGTTAGAACCAAATTCTATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCCAAGTACCACACTTTGACCAGGAACGCAGACCTCCGAGAGGACATTGTGTCTTT
TAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCACATGTGTGCAGATTGAACAATTCT
ATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGAC
ATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAGAAACAAGTATCAGCTGGAATCTTTGAGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACTCTTACAA
CCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCATCCGCTCAACAATACAAGCAAAACTATACTCCTTCTGGTT
TTCCAACTCAACTGGCGTCGCAGCCTCAACAATATAATCAGCAAATAAAAAACCCAAAGCGAGATCGTGAGGGAAAGGAGCATTGTAAGGCGGTTATCACAAGAAGCGGA
TTAAGTTATGAAGGACCTTCACTTCCAGACGAAGGAACTCATGTAGTTACACCTGTTCCTGCACCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGCGTAAGAA
GTTAGGTGAGCATGAGACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAAGGACCCAGGAAGTTTTACCATCCCCTACTCAA
TAGGAGCCTTCAATGTCCTCGATGCGATGCATCTCCCGGATGAAGTCGAGGAGTGCTTTACAATAGGAGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTG
GAAGCAAATTTGGAGGCTGCAGAAAAAGAATCCAAAATTGCGCCTGACGCAATTTTGCCCCAACTTGAGCGTTTTGAGTTTTTGCAGCAGACAATAGCGGATTTGAAGGC
CTTGCAACCTTCCATCATTGAACCTCCATAA
Protein sequenceShow/hide protein sequence
MHGSSVAVSPLNPCYEEFTFDSLEFFDPVGLPSPVLSLDVGTSRVGGHTDECSFAQAVVERAAAGVADSEFSGRTSDEPESVYLSCVDQYSFAGPASIRSTITQSTLDFY
RFTYEIPENIVMRVSGPHERICSPPPGETVLYTAMFSHGLRIPLHPCVQHFLRDVGLAPTQLVPNGWNLLLACFIAWQYGCGGGLTVETSAILHTFVLKVVPHRENQYFC
LSKRKEKKPLVESERAGKQKTSNMASSRRDRQKVTMSSSNKNWKSEWFSISGDWLSTRGGIYDIPTDFRAPEGSLRHHGLSDLTRPLSEAQSRLVSEPLSPSHISDSTLG
PYQAPLLPEEIDRVGRHIQEKVQRGILPGSSSDGDPNRDICHALREIHKHCTLVGSSVYRMGTELSYLQQQKIGVDIALEESRELTRQLSRQKAELKKTLGQIQSEHDRA
MAEIEAMRENLTRHQTESDWAKVELAKCKAELEALHTDWREPSRLEAAMREAALEAELVRPGESSHPEVVVVEAGDVATSTAGPTSSSTRLNDLASQLDDTMDTQGETWD
DASGRDEVCWGGAKMTDVELLAEIFKVGVIKLLAVIGDAASSWSRHFSRGCEYPRLYNTSWIMYWAACRCNRRVSIMLVTLASAATKSGSERVELKSQEKSGIAPGAFSQ
HSVFFMFCIKTKIIEIVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGHFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLN
ALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDILNKMTD
INDQGEIGRSLPRNKYQLESLSQGAQRNFNPYSNSYNPGWRHHPNFSWSNQGVASSSAQASAQQYKQNYTPSGFPTQLASQPQQYNQQIKNPKRDREGKEHCKAVITRSG
LSYEGPSLPDEGTHVVTPVPAPPPIHNKKRKQNLRKKLGEHETVALTKCSSDALGNPLPVKCKDPGSFTIPYSIGAFNVLDAMHLPDEVEECFTIGAIMEELQQMMVEDL
EANLEAAEKESKIAPDAILPQLERFEFLQQTIADLKALQPSIIEPP