; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr8:12470747..12474219
RNA-Seq ExpressionMoc08g16230
SyntenyMoc08g16230
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]5.1e-13961.1Show/hide
Query:  SGRPLFDVPARFGNL-----VSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGS
        S +PL DV A+  ++     +SIKPIPEL QATFDTLK+YKDNFPRGRKIGTLVTD L+LESGLLDYNPLVRP EASRPNSELAMVCGFTS+VKRKSKG 
Subjt:  SGRPLFDVPARFGNL-----VSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGS

Query:  AHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSARGASPKHWTCHRFVSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQV
        AHALK VQSSDP TPAVDQ+AAQDQAGPSS  PTPVIELDSTG                                          R R + S SE     
Subjt:  AHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSARGASPKHWTCHRFVSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQV

Query:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELL
                                                              EAL                           +V  LR   EA AELL
Subjt:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELL

Query:  KRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMP
        KR DE+HKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA E  DA IGRL AELKAEKERL+NG LLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAAD+P
Subjt:  KRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMP

Query:  HLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGELSSHLGSS
        HL++DLG+LKKRYAEKWASGPNGT GP SLVDKYVR+LDSDYSD++E++ PSQEPTEVGTTQE  PSQQ GSQEVNLLGSQGELSSHLGSS
Subjt:  HLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGELSSHLGSS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.4e-12582.93Show/hide
Query:  MRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARG
        MRFR+E SSS VK+QVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+A EAF ASIHSA+M+KAELDGREAL AKER N ST LEAATTLKG+LLKA+G
Subjt:  MRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARG

Query:  EVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFS
        EVD+LRAEV+A  +LLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q  E  DA+IGRLT ELK  KERL++G LLE +FRQHP+FDGFAKDFS
Subjt:  EVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGG
        DAGFKFLMKGIAADMPHLQ+DL +LKKRY+E WASGPNGTPGPQSLVDKYVRELDSDYSD+EEEDAPSQEPT+VGTTQEEAPSQ GG
Subjt:  DAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.1e-14185.08Show/hide
Query:  MGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLK
        MGGTFDV+ RFR+EPSSS VK+QVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSAIM+KAELDGREALAAKER NSS ALEAATTLK
Subjt:  MGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLK

Query:  GKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDF
        G+LLKA+GEV +LRAEV+A AELLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q  EG D +IGRLTAELK  KERL+NG+LLE +FRQH DF
Subjt:  GKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQ+DL NLKK+Y+EKWASGPNGTPGPQSLV KYVRELDSDYSD+EEEDAPSQEP E+GTTQEE PSQQ GSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.1e-13372.43Show/hide
Query:  LARRLESELEEIENF---------------------------------RGFKIPNNILLRIPEEGERADNPPEGWVTLYLKIFEYGLRLPFHPFAQEFLN
        LARRLES+LEEIEN                                  RGF IP NILLR+PEEGERADNPPEGWVTLY K+FEYGLRLP HPF QEFL 
Subjt:  LARRLESELEEIENF---------------------------------RGFKIPNNILLRIPEEGERADNPPEGWVTLYLKIFEYGLRLPFHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPL
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  V QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPT IKGWV KWF+ASGEWLAKDESGR  
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPL

Query:  FDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSS
        FDVP RFGNLVSI+P+PELTQA+FDTLKYYK+ FPRGRK+GTLVTD L+LESGLLDYNP VRP E+SRPNSELAMVCGF S VKRKSKG AHAL+  QSS
Subjt:  FDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSS

Query:  DPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSAR
         PATPAV         GP+SE P  VIEL+S+G  P R  R
Subjt:  DPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSAR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.1e-21375.47Show/hide
Query:  MCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPLFDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNP
        MCARKG GGIVKGPT IKGWVGKWFFASGEWLAKDESGR  FDVP RFGNLVSIK IPEL QATFDTLK+YKD+FPR RKI TLVTD L+LESGLLDYNP
Subjt:  MCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPLFDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNP

Query:  LVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGS----------------SPGRSARGA
        LVR  EASRPNSELAMVCGFT +VKRKSKG AHALKTV  ++P TP V +  AQ  +GPSS VPTPVIELD +G                 SP    RG 
Subjt:  LVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGS----------------SPGRSARGA

Query:  SP------KHWTCHRF---------VSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFT
        SP      K  T              SHADLVDDP+ARM GT +V+MRF +EPSSS VK+QVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF 
Subjt:  SP------KHWTCHRF---------VSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFT

Query:  ASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH A+M+KAELDGREALAAKER NS  ALEAATTLKG+LLKA+GEVD+LRAEV+A  +LLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  FEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVREL
         E  DA+IGRLT ELK  KERL+NGTLLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQ+DL  LKK+Y+EKWASGPNGTP PQSLVDKYVREL
Subjt:  FEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVREL

Query:  DSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYSD+EEEDAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124672.5e-13961.1Show/hide
Query:  SGRPLFDVPARFGNL-----VSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGS
        S +PL DV A+  ++     +SIKPIPEL QATFDTLK+YKDNFPRGRKIGTLVTD L+LESGLLDYNPLVRP EASRPNSELAMVCGFTS+VKRKSKG 
Subjt:  SGRPLFDVPARFGNL-----VSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGS

Query:  AHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSARGASPKHWTCHRFVSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQV
        AHALK VQSSDP TPAVDQ+AAQDQAGPSS  PTPVIELDSTG                                          R R + S SE     
Subjt:  AHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSARGASPKHWTCHRFVSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQV

Query:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELL
                                                              EAL                           +V  LR   EA AELL
Subjt:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELL

Query:  KRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMP
        KR DE+HKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA E  DA IGRL AELKAEKERL+NG LLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAAD+P
Subjt:  KRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMP

Query:  HLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGELSSHLGSS
        HL++DLG+LKKRYAEKWASGPNGT GP SLVDKYVR+LDSDYSD++E++ PSQEPTEVGTTQE  PSQQ GSQEVNLLGSQGELSSHLGSS
Subjt:  HLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGELSSHLGSS

A0A6J1D1N9 uncharacterized protein LOC1110161937.0e-12682.93Show/hide
Query:  MRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARG
        MRFR+E SSS VK+QVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+A EAF ASIHSA+M+KAELDGREAL AKER N ST LEAATTLKG+LLKA+G
Subjt:  MRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARG

Query:  EVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFS
        EVD+LRAEV+A  +LLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q  E  DA+IGRLT ELK  KERL++G LLE +FRQHP+FDGFAKDFS
Subjt:  EVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGG
        DAGFKFLMKGIAADMPHLQ+DL +LKKRY+E WASGPNGTPGPQSLVDKYVRELDSDYSD+EEEDAPSQEPT+VGTTQEEAPSQ GG
Subjt:  DAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGG

A0A6J1DF31 uncharacterized protein LOC1110199095.3e-14285.08Show/hide
Query:  MGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLK
        MGGTFDV+ RFR+EPSSS VK+QVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSAIM+KAELDGREALAAKER NSS ALEAATTLK
Subjt:  MGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKERANSSTALEAATTLK

Query:  GKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDF
        G+LLKA+GEV +LRAEV+A AELLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q  EG D +IGRLTAELK  KERL+NG+LLE +FRQH DF
Subjt:  GKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQ+DL NLKK+Y+EKWASGPNGTPGPQSLV KYVRELDSDYSD+EEEDAPSQEP E+GTTQEE PSQQ GSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

A0A6J1DXS5 uncharacterized protein LOC1110255023.4e-13372.43Show/hide
Query:  LARRLESELEEIENF---------------------------------RGFKIPNNILLRIPEEGERADNPPEGWVTLYLKIFEYGLRLPFHPFAQEFLN
        LARRLES+LEEIEN                                  RGF IP NILLR+PEEGERADNPPEGWVTLY K+FEYGLRLP HPF QEFL 
Subjt:  LARRLESELEEIENF---------------------------------RGFKIPNNILLRIPEEGERADNPPEGWVTLYLKIFEYGLRLPFHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPL
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  V QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPT IKGWV KWF+ASGEWLAKDESGR  
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPL

Query:  FDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSS
        FDVP RFGNLVSI+P+PELTQA+FDTLKYYK+ FPRGRK+GTLVTD L+LESGLLDYNP VRP E+SRPNSELAMVCGF S VKRKSKG AHAL+  QSS
Subjt:  FDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSS

Query:  DPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSAR
         PATPAV         GP+SE P  VIEL+S+G  P R  R
Subjt:  DPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSAR

A0A6J1DZB3 uncharacterized protein LOC1110256655.2e-21475.47Show/hide
Query:  MCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPLFDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNP
        MCARKG GGIVKGPT IKGWVGKWFFASGEWLAKDESGR  FDVP RFGNLVSIK IPEL QATFDTLK+YKD+FPR RKI TLVTD L+LESGLLDYNP
Subjt:  MCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPLFDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGTLVTDTLMLESGLLDYNP

Query:  LVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGS----------------SPGRSARGA
        LVR  EASRPNSELAMVCGFT +VKRKSKG AHALKTV  ++P TP V +  AQ  +GPSS VPTPVIELD +G                 SP    RG 
Subjt:  LVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGS----------------SPGRSARGA

Query:  SP------KHWTCHRF---------VSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFT
        SP      K  T              SHADLVDDP+ARM GT +V+MRF +EPSSS VK+QVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF 
Subjt:  SP------KHWTCHRF---------VSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFT

Query:  ASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH A+M+KAELDGREALAAKER NS  ALEAATTLKG+LLKA+GEVD+LRAEV+A  +LLK+  EKHKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSAIMIKAELDGREALAAKERANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  FEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVREL
         E  DA+IGRLT ELK  KERL+NGTLLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQ+DL  LKK+Y+EKWASGPNGTP PQSLVDKYVREL
Subjt:  FEGNDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVREL

Query:  DSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYSD+EEEDAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related3.8e-0728.15Show/hide
Query:  IPNNILLRIPEEGERADNPPEGWVTLYLKIF-EYGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRI
        +P  + +RIP + +R  + PEG++ L+   F E GLR P   F   F     +A +Q+       I   A L  L AR       LSV  +       ++
Subjt:  IPNNILLRIPEEGERADNPPEGWVTLYLKIF-EYGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRI

Query:  AKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFA
          K G++Y+ + +G   +  GP+  + W+G +F+A
Subjt:  AKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFA

AT2G15420.1 myosin heavy chain-related1.0e-0429.85Show/hide
Query:  PNNILLRIPEEGERADNPPEGWVTLYLKIF-EYGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  P   F  E+  R  +A +Q+          LAIL       E + +              R+ 
Subjt:  PNNILLRIPEEGERADNPPEGWVTLYLKIF-EYGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIA

Query:  KKPGRYYMCARKGAGGIVKGP-TFIKGWVGKWFF
        + PG YY  A K    IV G  + I GW  ++FF
Subjt:  KKPGRYYMCARKGAGGIVKGP-TFIKGWVGKWFF

AT5G38190.1 INVOLVED IN: biological_process unknown1.6e-0524.88Show/hide
Query:  FTFRIMVVFLSSPSSSDSIGSSGRTISSSPPKPSDSGEVLARRLE--------SELEEIENFRGF-KIPNNILLRIPEEGERADNPPEGWVTLYLKIF-E
        F+ R+  V  S    +DS G+  R          ++    +R+ E        S  E +     F  +P  + +RIP + +R  + PEG++ L+   F E
Subjt:  FTFRIMVVFLSSPSSSDSIGSSGRTISSSPPKPSDSGEVLARRLE--------SELEEIENFRGF-KIPNNILLRIPEEGERADNPPEGWVTLYLKIF-E

Query:  YGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWF
         GLR P   F   F     +A +Q+       I   A L  L AR       LSV  +       ++  K G++Y+ + +G   +   P+  + W+G +F
Subjt:  YGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWF

Query:  FASGEW-LAKDESGRPL
        +A  +  L +D S   L
Subjt:  FASGEW-LAKDESGRPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTGCTGGAATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGGGTGGACCTGCACAAGAGGGTGATG
GATCCGACAGTAGACACGACCGGCGGTTATGTGTCTTTTCTCATATCAGGTCTGTCGGGTTCCGAGCAGGTCGGACCCCAGTCAGTCATTTTCACTTTTACTTTT
CGAATTATGGTGGTGTTCCTGTCTTCCCCCTCCAGTAGTGATAGCATAGGTAGTTCGGGTCGGACCATAAGTAGTTCGCCCCCCAAACCAAGTGATTCTGGGGAG
GTCTTAGCTCGTAGGTTAGAGTCCGAGCTTGAAGAAATAGAGAACTTTAGGGGGTTTAAAATTCCAAACAACATCCTCCTTAGGATCCCGGAGGAAGGGGAAAGA
GCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAGATATTTGAATACGGCCTCAGACTTCCCTTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACT
GGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGAGCTGCTA
AGCGTTCACCAACTTCTTGGGTGTTTTGAGGCCAAGAGAATAGCCAAAAAACCAGGTCGGTACTATATGTGTGCGAGGAAGGGCGCGGGTGGTATAGTCAAGGGG
CCGACCTTCATAAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCCCTTGTTTGACGTGCCTGCTAGGTTT
GGGAACCTAGTATCAATTAAGCCGATTCCCGAGCTCACTCAAGCCACCTTCGACACCCTCAAATACTACAAGGACAACTTCCCAAGGGGCCGGAAGATCGGGACC
TTAGTCACAGACACGCTGATGCTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGTCCGAGTGAAGCTTCAAGGCCAAACTCCGAGCTCGCCATGGTGTGT
GGATTCACGAGCAACGTGAAACGCAAGTCTAAGGGCAGTGCTCATGCCCTTAAGACAGTTCAGAGCTCTGATCCAGCTACCCCTGCTGTGGATCAGCATGCAGCT
CAGGACCAGGCGGGTCCATCTTCTGAAGTTCCAACTCCAGTGATCGAGCTGGATTCTACTGGGAGCTCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAAGCAC
TGGACGTGTCACCGCTTCGTGAGCCACGCCGACCTGGTAGACGACCCTAAAGCTCGGATGGGGGGGACATTTGACGTGAAGATGCGGTTCAGAGTGGAACCGTCG
AGCTCCGAGGTGAAGAACCAAGTGTCACGCATCTCGGCTGCGTGCTTGGATCGCTGTCTAAGGAGAGCGTCCAAGTTTGTAAGTGACCCGGGGTCCGTGCTGCAA
CGGACCATCGACCACGCTATTGAGGCGTTCACTGCCTCCATCCACTCAGCAATCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAAGAGAGG
GCGAACTCCTCTACTGCCTTGGAGGCTGCCACCACGCTCAAGGGCAAGCTGCTGAAGGCTCGGGGCGAGGTGGACGTACTGAGGGCCGAGGTAGAAGCCAATGCC
GAACTGCTGAAAAGGGGAGATGAAAAGCATAAGGCCCACCTTCGAGCTGCCCACGCCATCACCAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAG
GACGACATGCTCCAGGCCTTCGAAGGGAATGACGCTACGATCGGGCGTCTCACTGCCGAGCTGAAGGCGGAGAAGGAGCGTCTTTCCAATGGAACTCTTCTGGAG
GCAGCCTTCAGGCAACACCCAGATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCAGGCTTCAAGTTTCTGATGAAGGGCATTGCTGCCGATATGCCGCACCTC
CAGCTCGACCTCGGCAATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAACGGCACTCCTGGTCCCCAATCCCTGGTGGACAAGTACGTCAGGGAG
CTTGACTCTGACTACTCTGATGTGGAGGAAGAGGATGCCCCAAGCCAAGAGCCTACCGAGGTCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCC
CAGGAGGTCAACCTTCTAGGCTCCCAGGGCGAACTGTCCTCCCACCTCGGAAGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTGCTGGAATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGGGTGGACCTGCACAAGAGGGTGATG
GATCCGACAGTAGACACGACCGGCGGTTATGTGTCTTTTCTCATATCAGGTCTGTCGGGTTCCGAGCAGGTCGGACCCCAGTCAGTCATTTTCACTTTTACTTTT
CGAATTATGGTGGTGTTCCTGTCTTCCCCCTCCAGTAGTGATAGCATAGGTAGTTCGGGTCGGACCATAAGTAGTTCGCCCCCCAAACCAAGTGATTCTGGGGAG
GTCTTAGCTCGTAGGTTAGAGTCCGAGCTTGAAGAAATAGAGAACTTTAGGGGGTTTAAAATTCCAAACAACATCCTCCTTAGGATCCCGGAGGAAGGGGAAAGA
GCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAGATATTTGAATACGGCCTCAGACTTCCCTTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACT
GGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGAGCTGCTA
AGCGTTCACCAACTTCTTGGGTGTTTTGAGGCCAAGAGAATAGCCAAAAAACCAGGTCGGTACTATATGTGTGCGAGGAAGGGCGCGGGTGGTATAGTCAAGGGG
CCGACCTTCATAAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCCCTTGTTTGACGTGCCTGCTAGGTTT
GGGAACCTAGTATCAATTAAGCCGATTCCCGAGCTCACTCAAGCCACCTTCGACACCCTCAAATACTACAAGGACAACTTCCCAAGGGGCCGGAAGATCGGGACC
TTAGTCACAGACACGCTGATGCTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGTCCGAGTGAAGCTTCAAGGCCAAACTCCGAGCTCGCCATGGTGTGT
GGATTCACGAGCAACGTGAAACGCAAGTCTAAGGGCAGTGCTCATGCCCTTAAGACAGTTCAGAGCTCTGATCCAGCTACCCCTGCTGTGGATCAGCATGCAGCT
CAGGACCAGGCGGGTCCATCTTCTGAAGTTCCAACTCCAGTGATCGAGCTGGATTCTACTGGGAGCTCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAAGCAC
TGGACGTGTCACCGCTTCGTGAGCCACGCCGACCTGGTAGACGACCCTAAAGCTCGGATGGGGGGGACATTTGACGTGAAGATGCGGTTCAGAGTGGAACCGTCG
AGCTCCGAGGTGAAGAACCAAGTGTCACGCATCTCGGCTGCGTGCTTGGATCGCTGTCTAAGGAGAGCGTCCAAGTTTGTAAGTGACCCGGGGTCCGTGCTGCAA
CGGACCATCGACCACGCTATTGAGGCGTTCACTGCCTCCATCCACTCAGCAATCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAAGAGAGG
GCGAACTCCTCTACTGCCTTGGAGGCTGCCACCACGCTCAAGGGCAAGCTGCTGAAGGCTCGGGGCGAGGTGGACGTACTGAGGGCCGAGGTAGAAGCCAATGCC
GAACTGCTGAAAAGGGGAGATGAAAAGCATAAGGCCCACCTTCGAGCTGCCCACGCCATCACCAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAG
GACGACATGCTCCAGGCCTTCGAAGGGAATGACGCTACGATCGGGCGTCTCACTGCCGAGCTGAAGGCGGAGAAGGAGCGTCTTTCCAATGGAACTCTTCTGGAG
GCAGCCTTCAGGCAACACCCAGATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCAGGCTTCAAGTTTCTGATGAAGGGCATTGCTGCCGATATGCCGCACCTC
CAGCTCGACCTCGGCAATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAACGGCACTCCTGGTCCCCAATCCCTGGTGGACAAGTACGTCAGGGAG
CTTGACTCTGACTACTCTGATGTGGAGGAAGAGGATGCCCCAAGCCAAGAGCCTACCGAGGTCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCC
CAGGAGGTCAACCTTCTAGGCTCCCAGGGCGAACTGTCCTCCCACCTCGGAAGTAGCTGA
Protein sequenceShow/hide protein sequence
MSVAGICTTVLHESSSNPVSGSDLNTRVDLHKRVMDPTVDTTGGYVSFLISGLSGSEQVGPQSVIFTFTFRIMVVFLSSPSSSDSIGSSGRTISSSPPKPSDSGE
VLARRLESELEEIENFRGFKIPNNILLRIPEEGERADNPPEGWVTLYLKIFEYGLRLPFHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELL
SVHQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTFIKGWVGKWFFASGEWLAKDESGRPLFDVPARFGNLVSIKPIPELTQATFDTLKYYKDNFPRGRKIGT
LVTDTLMLESGLLDYNPLVRPSEASRPNSELAMVCGFTSNVKRKSKGSAHALKTVQSSDPATPAVDQHAAQDQAGPSSEVPTPVIELDSTGSSPGRSARGASPKH
WTCHRFVSHADLVDDPKARMGGTFDVKMRFRVEPSSSEVKNQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAIEAFTASIHSAIMIKAELDGREALAAKER
ANSSTALEAATTLKGKLLKARGEVDVLRAEVEANAELLKRGDEKHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQAFEGNDATIGRLTAELKAEKERLSNGTLLE
AAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQLDLGNLKKRYAEKWASGPNGTPGPQSLVDKYVRELDSDYSDVEEEDAPSQEPTEVGTTQEEAPSQQGGS
QEVNLLGSQGELSSHLGSS