; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023359 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023359
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCytochrome c-type biogenesis protein
Genome locationChr05:33338742..33350640
RNA-Seq ExpressionHG10023359
SyntenyHG10023359
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032991 - protein-containing complex (cellular component)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR005616 - CcmH/CycL/Ccl2/NrfF family
IPR025110 - AMP-binding enzyme, C-terminal domain
IPR025564 - Cyanobacterial aminoacyl-tRNA synthetase, CAAD domain
IPR038297 - CcmH/CycL/Ccl2/NrfF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059311.1 protein CURVATURE THYLAKOID 1C [Cucumis melo var. makuwa]2.7e-6591.67Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+V   A GR GNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDR ALFGLGFA VATAWTATNLVTAIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

KAG6593469.1 Malonate--CoA ligase, partial [Cucurbita argyrosperma subsp. sororia]2.5e-8737.81Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQ-KLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKL
        MASI ATLPPPLLAP KSFT L T Q KLTVFPIA GRS N +VKA+G SSESSTS+DIIKSVRNVWDQPEDRL LFGLGFA V  AWTATNLVTA+DKL
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQ-KLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKL

Query:  PLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFGHC--------SAVWSRSMVSK-MESDENAVKK----TQLVEARARNISHNVRCIE
        PLLPG LEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ DV G          S ++S S  S  ME  ++A        + V  RA   SH+   + 
Subjt:  PLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFGHC--------SAVWSRSMVSK-MESDENAVKK----TQLVEARARNISHNVRCIE

Query:  CGSQSIED-----------------------------------------------------------SQADIAILLRKLIRDEIRSGKTDKEIYK-----
          +  I D                                                           + ++++++L      EI      K   K     
Subjt:  CGSQSIED-----------------------------------------------------------SQADIAILLRKLIRDEIRSGKTDKEIYK-----

Query:  KLENDYGETILYAPKFDLQTAA----------IWLSPVLV------AGAAAGV-----------------WAYSKYKQKSNIHIMALNLVRGVPLTPK--
         + + Y ET       D++  A          +   P LV       G   GV                 W Y+   Q   +H + L+ +     T +  
Subjt:  KLENDYGETILYAPKFDLQTAA----------IWLSPVLV------AGAAAGV-----------------WAYSKYKQKSNIHIMALNLVRGVPLTPK--

Query:  EKQTMDGM---GRTSTLENSLQ--PACNCKQG-PDMYPAAVNPYTSYGENTQSDQVEFSENLKLPHSMSIAFEKQI---WKNTASFAI---FKLSRIARA
         K ++ G+    R S   N  +   A     G P MY   +  Y +   + Q      +  L+L    S A    I   WK      +   + ++    A
Subjt:  EKQTMDGM---GRTSTLENSLQ--PACNCKQG-PDMYPAAVNPYTSYGENTQSDQVEFSENLKLPHSMSIAFEKQI---WKNTASFAI---FKLSRIARA

Query:  SSE--------------------------------------------------SIPEMNRQFLVNG----------------------LSTDIMEVGGYK
         S                                                    +P++ ++  ++G                      +S DIM+VGGYK
Subjt:  SSE--------------------------------------------------SIPEMNRQFLVNG----------------------LSTDIMEVGGYK

Query:  LSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMGKVGCIFNRHCF
        LSALEIESVILQH SVIECCVLGL DKDYGERVCAIIVL PN+K   PD+SKP MS  ++ TWAKDKLAPYK PT+LL KDS PRNAMGK          
Subjt:  LSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMGKVGCIFNRHCF

Query:  PFFHSLELQVNKKELMKKLATE
                 VNKKEL KKLA E
Subjt:  PFFHSLELQVNKKELMKKLATE

XP_004141763.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Cucumis sativus]1.1e-6692.36Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+VF  A+GRSGNVVVKAVGGSSESSTSLDI+KSVRNVWDQPEDRLALFGLGFA VATAWTATN+VTAIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

XP_008462170.1 PREDICTED: protein CURVATURE THYLAKOID 1C, chloroplastic [Cucumis melo]1.3e-6490.97Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+V   A GR GNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDR ALFGLGFA VATAWTATNLV AIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

XP_038897597.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Benincasa hispida]2.0e-6895.14Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFTLLNTSQKLT FPIA+ RSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFA VAT WTATNLVTAIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

TrEMBL top hitse value%identityAlignment
A0A0A0KC66 CAAD domain-containing protein5.3e-6792.36Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+VF  A+GRSGNVVVKAVGGSSESSTSLDI+KSVRNVWDQPEDRLALFGLGFA VATAWTATN+VTAIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

A0A1S3CHU0 protein CURVATURE THYLAKOID 1C, chloroplastic6.5e-6590.97Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+V   A GR GNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDR ALFGLGFA VATAWTATNLV AIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

A0A5D3BWY3 Protein CURVATURE THYLAKOID 1C1.3e-6591.67Show/hide
Query:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP
        MASIVATLPPPLLAPRKSFT+LN SQKL+V   A GR GNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDR ALFGLGFA VATAWTATNLVTAIDKLP
Subjt:  MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLP

Query:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
        LLPG LEFIGALVSWWFVYRYLLFKPNREELLQIINKSI+DVFG
Subjt:  LLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

A0A6J1GPV4 Cytochrome c-type biogenesis protein7.2e-6493.33Show/hide
Query:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA
        MESD +AVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGK+DKEI+KKLENDYGETILY PKFDLQTAA+WLSPVLVAGAAA
Subjt:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA

Query:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
        GVWAYSKYKQKSN+HIMALN+VRGVPLTP+EKQTM
Subjt:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

A0A6J1JTI4 Cytochrome c-type biogenesis protein7.2e-6493.33Show/hide
Query:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA
        MESD +AVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGK+DKEI+KKLENDYGETILY PKFDLQTAA+WLSPVLVAGAAA
Subjt:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA

Query:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
        GVWAYSKYKQKSN+HIMALN+VRGVPLTP+EKQTM
Subjt:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

SwissProt top hitse value%identityAlignment
B8AFK5 Cytochrome c-type biogenesis CcmH-like mitochondrial protein2.2e-5781.68Show/hide
Query:  ENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAAGVWA
        E  VK+ Q++E+RARNISHNVRC ECGSQSIEDSQADIAILLRKLIRDEI+SGK+DKEIYKKL+ DYGETILY PKFDLQTAAIWLSPV+V G AAGVWA
Subjt:  ENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAAGVWA

Query:  YSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
        Y K++Q++N+HIMALNLVRGVPLTP+EK+TM
Subjt:  YSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

M4IS90 Probable CoA ligase CCL81.1e-2957.14Show/hide
Query:  STDIMEVGGYKLSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMG
        S DIM+VGGYKLSALEIESV+L+H +V ECCVLGL DKDYGE V AIIV     K    +ESKP +S  E+ +WA+ KLAPYK PT L   DS PRNAMG
Subjt:  STDIMEVGGYKLSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMG

Query:  KVGCIFNRHCFPFFHSLELQVNKKELMKKLATE
        K                   VNKKEL KKL  E
Subjt:  KVGCIFNRHCFPFFHSLELQVNKKELMKKLATE

Q6K7S7 Cytochrome c-type biogenesis CcmH-like mitochondrial protein2.2e-5781.68Show/hide
Query:  ENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAAGVWA
        E  VK+ Q++E+RARNISHNVRC ECGSQSIEDSQADIAILLRKLIRDEI+SGK+DKEIYKKL+ DYGETILY PKFDLQTAAIWLSPV+V G AAGVWA
Subjt:  ENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAAGVWA

Query:  YSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
        Y K++Q++N+HIMALNLVRGVPLTP+EK+TM
Subjt:  YSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

Q9M812 Protein CURVATURE THYLAKOID 1C, chloroplastic2.2e-3349.03Show/hide
Query:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA
        MASI ATLP P LL  RKS         F+L   +  L+   +    S  +++VKA G SS+SST LD++ +++NVWD+ EDRL L GLGFAG+   W +
Subjt:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA

Query:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
         NL+TAIDKLP++    E +G L S WF YRYLLFKP+R+EL +I+ KS+ D+ G
Subjt:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

Q9XI46 Cytochrome c-type biogenesis CcmH-like mitochondrial protein1.8e-4868.89Show/hide
Query:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA
        ME  +   KK Q+++ARARNISHNVRC ECGSQSIEDSQADIAILLR+LIR+EI +GKTDKEIY KLE+++GET+LYAPKFDLQTAA+WL+PV++AG  A
Subjt:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA

Query:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
            Y K++ + N+ IMALNL+RGVPLTPKE+ T+
Subjt:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

Arabidopsis top hitse value%identityAlignment
AT1G15220.1 cytochrome c biogenesis protein family1.3e-4968.89Show/hide
Query:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA
        ME  +   KK Q+++ARARNISHNVRC ECGSQSIEDSQADIAILLR+LIR+EI +GKTDKEIY KLE+++GET+LYAPKFDLQTAA+WL+PV++AG  A
Subjt:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA

Query:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
            Y K++ + N+ IMALNL+RGVPLTPKE+ T+
Subjt:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

AT1G15220.2 cytochrome c biogenesis protein family1.3e-4968.89Show/hide
Query:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA
        ME  +   KK Q+++ARARNISHNVRC ECGSQSIEDSQADIAILLR+LIR+EI +GKTDKEIY KLE+++GET+LYAPKFDLQTAA+WL+PV++AG  A
Subjt:  MESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDEIRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAA

Query:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM
            Y K++ + N+ IMALNL+RGVPLTPKE+ T+
Subjt:  GVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTM

AT1G52220.1 FUNCTIONS IN: molecular_function unknown1.5e-3449.03Show/hide
Query:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA
        MASI ATLP P LL  RKS         F+L   +  L+   +    S  +++VKA G SS+SST LD++ +++NVWD+ EDRL L GLGFAG+   W +
Subjt:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA

Query:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
         NL+TAIDKLP++    E +G L S WF YRYLLFKP+R+EL +I+ KS+ D+ G
Subjt:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

AT1G52220.2 FUNCTIONS IN: molecular_function unknown1.1e-3248.39Show/hide
Query:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA
        MASI ATLP P LL  RKS         F+L   +  L+   +    S  +++VKA G SS+SST LD++ +++N WD+ EDRL L GLGFAG+   W +
Subjt:  MASIVATLPPP-LLAPRKS---------FTLLNTSQKLTVFPIASGRSG-NVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTA

Query:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG
         NL+TAIDKLP++    E +G L S WF YRYLLFKP+R+EL +I+ KS+ D+ G
Subjt:  TNLVTAIDKLPLLPGALEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFG

AT3G16170.1 AMP-dependent synthetase and ligase family protein1.0e-3054.89Show/hide
Query:  STDIMEVGGYKLSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMG
        S DIM+VGGYKLSALEIES +L+H +V ECCVLGL D DYGE V AII+ +   K    DESKP ++  E+  WAKDKLAPYK PT LL  +S PRNAMG
Subjt:  STDIMEVGGYKLSALEIESVILQHLSVIECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMG

Query:  KVGCIFNRHCFPFFHSLELQVNKKELMKKLATE
        K                   VNKKEL K L  +
Subjt:  KVGCIFNRHCFPFFHSLELQVNKKELMKKLATE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATCGTTGCTACTCTACCTCCGCCATTGTTGGCGCCTCGCAAAAGCTTTACCCTTCTGAACACTTCTCAGAAGCTCACTGTTTTCCCCATTGCAAGT
GGACGTTCTGGCAATGTGGTTGTAAAGGCTGTTGGAGGCAGCTCTGAGTCTTCTACTTCCCTTGATATTATTAAGTCTGTTCGAAATGTTTGGGATCAACCTGAA
GACCGACTGGCGCTTTTTGGCCTGGGATTTGCAGGTGTAGCAACTGCATGGACAGCAACAAATCTTGTTACGGCCATCGACAAGCTACCACTGCTTCCAGGTGCA
TTAGAATTCATAGGAGCACTGGTTTCTTGGTGGTTCGTGTATCGCTACCTCCTGTTCAAGCCAAACCGGGAAGAGCTTTTGCAGATAATCAACAAGTCAATAGTA
GATGTATTTGGACATTGCAGTGCAGTCTGGTCACGCTCCATGGTGTCTAAAATGGAGAGTGATGAGAATGCTGTGAAAAAAACACAGCTGGTGGAAGCCCGAGCA
AGAAATATTAGTCACAATGTTAGGTGCATCGAGTGTGGTAGTCAATCCATTGAAGATTCACAAGCAGATATCGCGATTCTCCTCAGAAAGTTGATCCGTGATGAG
ATTCGGTCTGGGAAAACGGACAAGGAGATCTATAAAAAGCTTGAGAATGATTATGGGGAGACGATCCTTTATGCCCCAAAGTTTGACCTTCAAACTGCAGCGATA
TGGCTATCACCGGTATTAGTGGCTGGTGCTGCTGCAGGAGTATGGGCTTATAGTAAGTACAAGCAAAAGTCTAACATTCACATCATGGCTTTAAATCTAGTCAGG
GGTGTTCCATTGACCCCTAAAGAGAAGCAAACCATGGATGGAATGGGGAGGACTTCAACATTGGAAAACTCCTTACAACCTGCATGCAACTGCAAGCAAGGTCCA
GACATGTATCCAGCTGCAGTTAACCCTTATACTAGTTATGGTGAAAATACACAATCTGATCAAGTCGAATTCTCAGAAAACCTCAAGCTTCCTCACTCAATGTCG
ATTGCATTTGAAAAACAGATTTGGAAAAATACTGCTTCCTTTGCCATCTTCAAACTATCCCGAATCGCCAGAGCTTCGAGTGAATCAATCCCTGAAATGAATCGG
CAGTTCTTGGTGAATGGTCTGAGCACCGATATCATGGAAGTCGGAGGATATAAATTATCAGCATTAGAGATTGAATCAGTGATTTTGCAGCATCTGTCTGTCATA
GAATGTTGTGTGTTGGGATTACTAGACAAAGACTATGGAGAACGTGTTTGTGCAATTATTGTGCTTCAGCCTAACACAAAAATGATGACACCTGATGAGTCCAAG
CCTACTATGAGCTCTCATGAAATCCCCACATGGGCTAAAGACAAGCTTGCTCCATACAAGTCACCGACTCTGTTATTGTCGAAGGACTCACCACCTCGGAACGCC
ATGGGAAAGGTGGGTTGCATCTTCAACAGACATTGCTTCCCATTCTTTCATTCTCTTGAACTACAGGTCAACAAGAAAGAGTTGATGAAAAAGCTAGCAACCGAA
GGACTATCCTATTGGACCCATTGGCGATCAAAAGTGAGGCATACACATTTGCTCTTAGTAATAGTACTAGTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATCGTTGCTACTCTACCTCCGCCATTGTTGGCGCCTCGCAAAAGCTTTACCCTTCTGAACACTTCTCAGAAGCTCACTGTTTTCCCCATTGCAAGT
GGACGTTCTGGCAATGTGGTTGTAAAGGCTGTTGGAGGCAGCTCTGAGTCTTCTACTTCCCTTGATATTATTAAGTCTGTTCGAAATGTTTGGGATCAACCTGAA
GACCGACTGGCGCTTTTTGGCCTGGGATTTGCAGGTGTAGCAACTGCATGGACAGCAACAAATCTTGTTACGGCCATCGACAAGCTACCACTGCTTCCAGGTGCA
TTAGAATTCATAGGAGCACTGGTTTCTTGGTGGTTCGTGTATCGCTACCTCCTGTTCAAGCCAAACCGGGAAGAGCTTTTGCAGATAATCAACAAGTCAATAGTA
GATGTATTTGGACATTGCAGTGCAGTCTGGTCACGCTCCATGGTGTCTAAAATGGAGAGTGATGAGAATGCTGTGAAAAAAACACAGCTGGTGGAAGCCCGAGCA
AGAAATATTAGTCACAATGTTAGGTGCATCGAGTGTGGTAGTCAATCCATTGAAGATTCACAAGCAGATATCGCGATTCTCCTCAGAAAGTTGATCCGTGATGAG
ATTCGGTCTGGGAAAACGGACAAGGAGATCTATAAAAAGCTTGAGAATGATTATGGGGAGACGATCCTTTATGCCCCAAAGTTTGACCTTCAAACTGCAGCGATA
TGGCTATCACCGGTATTAGTGGCTGGTGCTGCTGCAGGAGTATGGGCTTATAGTAAGTACAAGCAAAAGTCTAACATTCACATCATGGCTTTAAATCTAGTCAGG
GGTGTTCCATTGACCCCTAAAGAGAAGCAAACCATGGATGGAATGGGGAGGACTTCAACATTGGAAAACTCCTTACAACCTGCATGCAACTGCAAGCAAGGTCCA
GACATGTATCCAGCTGCAGTTAACCCTTATACTAGTTATGGTGAAAATACACAATCTGATCAAGTCGAATTCTCAGAAAACCTCAAGCTTCCTCACTCAATGTCG
ATTGCATTTGAAAAACAGATTTGGAAAAATACTGCTTCCTTTGCCATCTTCAAACTATCCCGAATCGCCAGAGCTTCGAGTGAATCAATCCCTGAAATGAATCGG
CAGTTCTTGGTGAATGGTCTGAGCACCGATATCATGGAAGTCGGAGGATATAAATTATCAGCATTAGAGATTGAATCAGTGATTTTGCAGCATCTGTCTGTCATA
GAATGTTGTGTGTTGGGATTACTAGACAAAGACTATGGAGAACGTGTTTGTGCAATTATTGTGCTTCAGCCTAACACAAAAATGATGACACCTGATGAGTCCAAG
CCTACTATGAGCTCTCATGAAATCCCCACATGGGCTAAAGACAAGCTTGCTCCATACAAGTCACCGACTCTGTTATTGTCGAAGGACTCACCACCTCGGAACGCC
ATGGGAAAGGTGGGTTGCATCTTCAACAGACATTGCTTCCCATTCTTTCATTCTCTTGAACTACAGGTCAACAAGAAAGAGTTGATGAAAAAGCTAGCAACCGAA
GGACTATCCTATTGGACCCATTGGCGATCAAAAGTGAGGCATACACATTTGCTCTTAGTAATAGTACTAGTTCTTTGA
Protein sequenceShow/hide protein sequence
MASIVATLPPPLLAPRKSFTLLNTSQKLTVFPIASGRSGNVVVKAVGGSSESSTSLDIIKSVRNVWDQPEDRLALFGLGFAGVATAWTATNLVTAIDKLPLLPGA
LEFIGALVSWWFVYRYLLFKPNREELLQIINKSIVDVFGHCSAVWSRSMVSKMESDENAVKKTQLVEARARNISHNVRCIECGSQSIEDSQADIAILLRKLIRDE
IRSGKTDKEIYKKLENDYGETILYAPKFDLQTAAIWLSPVLVAGAAAGVWAYSKYKQKSNIHIMALNLVRGVPLTPKEKQTMDGMGRTSTLENSLQPACNCKQGP
DMYPAAVNPYTSYGENTQSDQVEFSENLKLPHSMSIAFEKQIWKNTASFAIFKLSRIARASSESIPEMNRQFLVNGLSTDIMEVGGYKLSALEIESVILQHLSVI
ECCVLGLLDKDYGERVCAIIVLQPNTKMMTPDESKPTMSSHEIPTWAKDKLAPYKSPTLLLSKDSPPRNAMGKVGCIFNRHCFPFFHSLELQVNKKELMKKLATE
GLSYWTHWRSKVRHTHLLLVIVLVL