; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011963 (gene) of Snake gourd v1 genome

Gene IDTan0011963
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNADH-ubiquinone oxidoreductase chain 5
Genome locationContig00124:292195..294922
RNA-Seq ExpressionTan0011963
SyntenyTan0011963
Gene Ontology termsGO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0042773 - ATP synthesis coupled electron transport (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0045263 - proton-transporting ATP synthase complex, coupling factor F(o) (cellular component)
GO:0008137 - NADH dehydrogenase (ubiquinone) activity (molecular function)
GO:0015078 - proton transmembrane transporter activity (molecular function)
InterPro domainsIPR000454 - ATP synthase, F0 complex, subunit C
IPR001750 - NADH:quinone oxidoreductase/Mrp antiporter, membrane subunit
IPR002379 - V-ATPase proteolipid subunit C-like domain
IPR020537 - ATP synthase, F0 complex, subunit C, DCCD-binding site
IPR035921 - F/V-ATP synthase subunit C superfamily
IPR038662 - F1F0 ATP synthase subunit C superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GER38147.1 NADH-ubiquinone oxidoreductase chain 5 [Striga asiatica]3.2e-13777.62Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF
        KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHS  LVGGTGEPRE A  + I      R G  G  P    C TL +Q      +F
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF

Query:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------
          AC+           PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P       
Subjt:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------

Query:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL
                     GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSL
Subjt:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL

Query:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

KAB5511274.1 hypothetical protein DKX38_030069 [Salix brachista]1.0e-13577.29Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKGPKHG-RCRTLEWQRKARSYLFF
        KAKTAPRIWVARGRLCPDGGA  GRAQPNRD TPPTSSAPLYRLNHSK LVGGTGEPRE A  + I      R G  G   G   R   +Q    S +F 
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKGPKHG-RCRTLEWQRKARSYLFF

Query:  QACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP--------
         AC+           PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR    ++  P        
Subjt:  QACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP--------

Query:  --------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLS
                    GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLS
Subjt:  --------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLS

Query:  LIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        LIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  LIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

KAB8477461.1 hypothetical protein FH972_025327 [Carpinus fangiana]6.3e-19469.79Show/hide
Query:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVFRS-------KKEGPDNSL--RRVMGS
        MLEGAKS+GAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFA MMAFLISSVFRS        +  PD +L  RR    
Subjt:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVFRS-------KKEGPDNSL--RRVMGS

Query:  GLVPENARSSDVGAYNSPCDSFLRSSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLGSSVRKAKTAPRIWVARGRLCPDGGALWGRAQPNRDS
         L+P     S  G      D + R                                        S   KAKTAPRIWVARGRLCPDGGALWGRAQPNRDS
Subjt:  GLVPENARSSDVGAYNSPCDSFLRSSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLGSSVRKAKTAPRIWVARGRLCPDGGALWGRAQPNRDS

Query:  TPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRKIKL--------------
        TPPTSSAPLYRLNHSKSLVGGTGEPRE           QAVDSFIREGKKGPKHG CRTLEWQRKA SYLFFQACSDIRFP+KIKL              
Subjt:  TPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRKIKL--------------

Query:  -----------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS-
                                      PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR  
Subjt:  -----------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS-

Query:  --YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CL
          ++  P                    GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG    
Subjt:  --YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CL

Query:  LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
          PFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

KJB09767.1 hypothetical protein B456_001G163400 [Gossypium raimondii]9.4e-16658.29Show/hide
Query:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYA-ILG-FALTEAIALFAPMMAFL--ISSVFRSKKEGPDNSLRRVMGSGLVPE
        MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLA    G A   G F  +E  A+          I S+    +  P  S   +  +  +  
Subjt:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYA-ILG-FALTEAIALFAPMMAFL--ISSVFRSKKEGPDNSLRRVMGSGLVPE

Query:  NARSSDVGAYNSPCDSFLR------SSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLG-------SSVRKAKTAPRIWVARGRLCPDGGALWG
            +  G +    D+ +         +L T  + S + +               +L V  PL+G       S   KAKTAPRIWVARGRLCPDGGALWG
Subjt:  NARSSDVGAYNSPCDSFLR------SSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLG-------SSVRKAKTAPRIWVARGRLCPDGGALWG

Query:  RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE--------------------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRK
        RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE                    Q VDSFIREGKKGPKHG CRTLEWQRKA SYLFFQAC DIRFPRK
Subjt:  RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE--------------------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRK

Query:  IKL-----------------------------------------------------------------------------------------------CP
        IKL                                                                                                P
Subjt:  IKL-----------------------------------------------------------------------------------------------CP

Query:  RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQ
        RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GILQ
Subjt:  RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQ

Query:  NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVI
        NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLSLIGFPFLTGFYSKDVI
Subjt:  NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVI

Query:  LELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        LELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  LELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

YP_009526581.1 NADH dehydrogenase subunit 5 [Ammopiptanthus mongolicus]9.8e-16377.7Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARS
        KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE           QAVDSFIREGKKGPKHGRCRTLEWQRK RS
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARS

Query:  YLFFQACSDIRFPRKIKL-------------------------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW
        YLFFQACSDIRFPRKIKL                                            PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW
Subjt:  YLFFQACSDIRFPRKIKL-------------------------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW

Query:  SPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH
        SPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH
Subjt:  SPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH

Query:  AFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT
        AFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT
Subjt:  AFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT

Query:  FLVPTNSF
        FLVPTNSF
Subjt:  FLVPTNSF

TrEMBL top hitse value%identityAlignment
A0A0D2LZE3 Proton_antipo_M domain-containing protein4.6e-16658.29Show/hide
Query:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYA-ILG-FALTEAIALFAPMMAFL--ISSVFRSKKEGPDNSLRRVMGSGLVPE
        MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLA    G A   G F  +E  A+          I S+    +  P  S   +  +  +  
Subjt:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYA-ILG-FALTEAIALFAPMMAFL--ISSVFRSKKEGPDNSLRRVMGSGLVPE

Query:  NARSSDVGAYNSPCDSFLR------SSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLG-------SSVRKAKTAPRIWVARGRLCPDGGALWG
            +  G +    D+ +         +L T  + S + +               +L V  PL+G       S   KAKTAPRIWVARGRLCPDGGALWG
Subjt:  NARSSDVGAYNSPCDSFLR------SSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLG-------SSVRKAKTAPRIWVARGRLCPDGGALWG

Query:  RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE--------------------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRK
        RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE                    Q VDSFIREGKKGPKHG CRTLEWQRKA SYLFFQAC DIRFPRK
Subjt:  RAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE--------------------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRK

Query:  IKL-----------------------------------------------------------------------------------------------CP
        IKL                                                                                                P
Subjt:  IKL-----------------------------------------------------------------------------------------------CP

Query:  RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQ
        RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GILQ
Subjt:  RNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQ

Query:  NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVI
        NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLSLIGFPFLTGFYSKDVI
Subjt:  NDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVI

Query:  LELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        LELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  LELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

A0A2N9G584 Proton_antipo_M domain-containing protein4.3e-14078.45Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF
        KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE A  + I      R G  G  P    C TL +Q    S +F
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF

Query:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------
         +A +           PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR    ++  P       
Subjt:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------

Query:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL
                     GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSL
Subjt:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL

Query:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

A0A385G2B9 NADH dehydrogenase subunit 54.7e-16377.7Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARS
        KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE           QAVDSFIREGKKGPKHGRCRTLEWQRK RS
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARS

Query:  YLFFQACSDIRFPRKIKL-------------------------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW
        YLFFQACSDIRFPRKIKL                                            PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW
Subjt:  YLFFQACSDIRFPRKIKL-------------------------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTW

Query:  SPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH
        SPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH
Subjt:  SPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNH

Query:  AFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT
        AFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT
Subjt:  AFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLT

Query:  FLVPTNSF
        FLVPTNSF
Subjt:  FLVPTNSF

A0A5A7Q196 NADH-ubiquinone oxidoreductase chain 51.5e-13777.62Show/hide
Query:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF
        KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHS  LVGGTGEPRE A  + I      R G  G  P    C TL +Q      +F
Subjt:  KAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQAVDSFI------REGKKG--PKHGRCRTLEWQRKARSYLF

Query:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------
          AC+           PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P       
Subjt:  FQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP-------

Query:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL
                     GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSL
Subjt:  ---------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSL

Query:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  SLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

A0A5N6L1B2 Uncharacterized protein3.0e-19469.79Show/hide
Query:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVFRS-------KKEGPDNSL--RRVMGS
        MLEGAKS+GAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFA MMAFLISSVFRS        +  PD +L  RR    
Subjt:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVFRS-------KKEGPDNSL--RRVMGS

Query:  GLVPENARSSDVGAYNSPCDSFLRSSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLGSSVRKAKTAPRIWVARGRLCPDGGALWGRAQPNRDS
         L+P     S  G      D + R                                        S   KAKTAPRIWVARGRLCPDGGALWGRAQPNRDS
Subjt:  GLVPENARSSDVGAYNSPCDSFLRSSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLGSSVRKAKTAPRIWVARGRLCPDGGALWGRAQPNRDS

Query:  TPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRKIKL--------------
        TPPTSSAPLYRLNHSKSLVGGTGEPRE           QAVDSFIREGKKGPKHG CRTLEWQRKA SYLFFQACSDIRFP+KIKL              
Subjt:  TPPTSSAPLYRLNHSKSLVGGTGEPRE-----------QAVDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRKIKL--------------

Query:  -----------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS-
                                      PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR  
Subjt:  -----------------------------CPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS-

Query:  --YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CL
          ++  P                    GILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG    
Subjt:  --YDVIP----------------CGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CL

Query:  LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
          PFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
Subjt:  LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

SwissProt top hitse value%identityAlignment
P10330 NADH-ubiquinone oxidoreductase chain 51.1e-10584.55Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRNSWISCNMRLNAITLICILLLIGAVGKSAQIG HTW PDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      PFTYAMMLMGSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

P26849 NADH-ubiquinone oxidoreductase chain 51.5e-9778.14Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        P + ++ CNM  +AIT+ICIL+ IGAVGKSAQIG HTW PDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHA FKALLFLSAGSVIHAMSDEQDMRKMG    L+PFTYAMML+GSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSFR
        ILELAYTKYTISGNFAFWLGSVSV FTSYYSFR LFLTFL PTNSF+
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSFR

P29388 NADH-ubiquinone oxidoreductase chain 56.9e-10382.52Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRNSWI CNMRLNAI+LICILL IGAVGKSAQIG HTW PDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      P TYAMML+GSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

Q35099 NADH-ubiquinone oxidoreductase chain 51.2e-7363.48Show/hide
Query:  TLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARSYDVIP-------------------CGNHGILQNDLKRVIAYSTCSQ
        TLIC+ L IGAVGKSAQ+G HTW PDAMEGPTPVSA IHAATMVTAGVF++ RS  ++                        G++QNDLK+VIAYSTCSQ
Subjt:  TLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARSYDVIP-------------------CGNHGILQNDLKRVIAYSTCSQ

Query:  LGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGACL-LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNF
        LGYM+ ACG+S YS+S+FHLMNHAFFKALLFLSAGSVIHA++DEQDMRKMG  +  +PFTY M+++GSLSL+GFP+LTGFYSKD+ILELAY +Y ++  F
Subjt:  LGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGACL-LVPFTYAMMLMGSLSLIGFPFLTGFYSKDVILELAYTKYTISGNF

Query:  AFWLGSVSVLFTSYYSFRSLFLTFLVPTNS
        A WLG  S L T+ YS R ++LTF+  TN+
Subjt:  AFWLGSVSVLFTSYYSFRSLFLTFLVPTNS

Q37680 NADH-ubiquinone oxidoreductase chain 52.4e-10382.93Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRN WI CNMRLNAITLICILL IGAVGKSAQIG HTW PDAMEGPTPVSA IHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      P TYAMMLMGSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

Arabidopsis top hitse value%identityAlignment
AT2G07671.1 ATP synthase subunit C family protein8.9e-2994.59Show/hide
Query:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVF
        MLEGAKS+GAGAATIASAGAA+GIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLI  VF
Subjt:  MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVF

ATCG01010.1 NADH-Ubiquinone oxidoreductase (complex I), chain 5 protein1.0e-3743.15Show/hide
Query:  NMRLNA--ITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR---SYDVIPC----------------GNHGILQNDLKR
        N R+N   +TL   LL +G + KSAQ   H W PDAMEGPTP+SA IHAATMV AG+F++AR    + VIP                     + Q D+KR
Subjt:  NMRLNA--ITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR---SYDVIPC----------------GNHGILQNDLKR

Query:  VIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAM--------SDEQDMRKMGACLL-VPFTYAMMLMGSLSLIGFPFLTGFYSK
         +AYST SQLGYM+ A G+ +Y  ++FHL+ HA+ KALLFL +GS+IH+M           Q+M  MG     VP T    L+G+LSL G P L  F+SK
Subjt:  VIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAM--------SDEQDMRKMGACLL-VPFTYAMMLMGSLSLIGFPFLTGFYSK

Query:  DVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        D IL         S  FA    S + L T++Y FR   LTF    N++
Subjt:  DVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

ATMG00060.1 NADH dehydrogenase subunit 5C3.6e-10784.55Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRNSWISCNMRLNAI+LICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      P TYAMML+GSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

ATMG00513.1 NADH dehydrogenase 5A3.6e-10784.55Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRNSWISCNMRLNAI+LICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      P TYAMML+GSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF

ATMG00665.1 NADH dehydrogenase 5B3.6e-10784.55Show/hide
Query:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL
        PRNSWISCNMRLNAI+LICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIAR    ++  P                    GIL
Subjt:  PRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARS---YDVIP----------------CGNHGIL

Query:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV
        QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMG      P TYAMML+GSLSLIGFPFLTGFYSKDV
Subjt:  QNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGA-CLLVPFTYAMMLMGSLSLIGFPFLTGFYSKDV

Query:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF
        ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFR LFLTFLVPTNSF
Subjt:  ILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGAAGGTGCAAAATCAATGGGTGCCGGAGCAGCTACAATTGCTTCAGCGGGAGCTGCAGTCGGTATTGGAAACGTGTTCAGTTCTTTGATCCATTCCGTGGCGCG
AAATCCATCATTGGCTAAACAATCATTTGGTTATGCCATTTTGGGCTTTGCTCTAACCGAAGCTATTGCATTGTTTGCTCCAATGATGGCCTTTTTGATCTCATCCGTAT
TCCGATCGAAGAAAGAAGGACCGGACAATAGCCTTCGTCGAGTAATGGGAAGCGGGCTGGTCCCCGAAAATGCTCGTTCCTCTGACGTTGGGGCTTACAATTCACCTTGT
GACTCATTCCTTCGGTCGAGCGTTCTTCGGACGTCGATCAATCTATCACAGGCCGCTCTGTCATTGTCTGATTTTTGTTGTCTGATCACACTCGAAATTATGTATCTACT
TATCGTATTTTTGCCCCTGCTCGGTAGTTCCGTCAGGAAGGCTAAGACGGCGCCTCGCATATGGGTAGCAAGAGGGCGCTTATGCCCCGACGGTGGGGCCTTATGGGGAA
GGGCCCAGCCCAATAGGGACAGCACACCCCCCACTTCAAGCGCACCTCTGTATCGACTGAATCACTCTAAGAGTCTAGTCGGTGGAACCGGTGAACCACGCGAGCAGGCG
GTGGACTCTTTCATTAGGGAAGGGAAGAAGGGGCCTAAGCACGGCAGATGCCGTACACTTGAGTGGCAAAGGAAAGCGAGATCGTACCTCTTTTTCCAGGCCTGTTCGGA
CATACGGTTCCCGCGGAAGATCAAGTTGTGCCCCAGAAATTCTTGGATTTCTTGCAATATGAGATTGAATGCCATAACTCTGATTTGTATTTTACTTCTTATTGGTGCTG
TTGGGAAATCTGCACAGATAGGATCGCATACTTGGTCACCCGATGCTATGGAGGGTCCCACTCCAGTATCCGCTTCGATTCATGCAGCAACTATGGTAACAGCTGGCGTT
TTCATGATAGCAAGGAGCTACGACGTCATTCCTTGCGGCAACCACGGAATATTACAGAACGATCTAAAGAGGGTCATAGCTTATTCAACTTGCAGTCAATTAGGCTATAT
GATCTTTGCTTGCGGCATCTCTAACTATTCGGTTAGCGTCTTTCACTTAATGAATCACGCGTTTTTCAAAGCATTACTATTCCTGAGTGCAGGTTCGGTGATTCATGCCA
TGTCGGATGAGCAAGATATGCGGAAGATGGGGGCTTGCCTCCTCGTTCCCTTTACCTATGCCATGATGCTCATGGGCAGCTTATCTCTAATTGGATTTCCTTTTCTAACT
GGATTTTATTCCAAAGATGTGATCTTAGAGCTCGCTTACACTAAGTATACCATCAGTGGGAACTTTGCTTTCTGGTTGGGAAGTGTCTCTGTCCTTTTCACTTCTTATTA
CTCTTTTCGTTCACTTTTTCTAACATTTCTAGTACCAACTAATTCATTCCGGGCGAGACATCTTACGATGTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGAAGGTGCAAAATCAATGGGTGCCGGAGCAGCTACAATTGCTTCAGCGGGAGCTGCAGTCGGTATTGGAAACGTGTTCAGTTCTTTGATCCATTCCGTGGCGCG
AAATCCATCATTGGCTAAACAATCATTTGGTTATGCCATTTTGGGCTTTGCTCTAACCGAAGCTATTGCATTGTTTGCTCCAATGATGGCCTTTTTGATCTCATCCGTAT
TCCGATCGAAGAAAGAAGGACCGGACAATAGCCTTCGTCGAGTAATGGGAAGCGGGCTGGTCCCCGAAAATGCTCGTTCCTCTGACGTTGGGGCTTACAATTCACCTTGT
GACTCATTCCTTCGGTCGAGCGTTCTTCGGACGTCGATCAATCTATCACAGGCCGCTCTGTCATTGTCTGATTTTTGTTGTCTGATCACACTCGAAATTATGTATCTACT
TATCGTATTTTTGCCCCTGCTCGGTAGTTCCGTCAGGAAGGCTAAGACGGCGCCTCGCATATGGGTAGCAAGAGGGCGCTTATGCCCCGACGGTGGGGCCTTATGGGGAA
GGGCCCAGCCCAATAGGGACAGCACACCCCCCACTTCAAGCGCACCTCTGTATCGACTGAATCACTCTAAGAGTCTAGTCGGTGGAACCGGTGAACCACGCGAGCAGGCG
GTGGACTCTTTCATTAGGGAAGGGAAGAAGGGGCCTAAGCACGGCAGATGCCGTACACTTGAGTGGCAAAGGAAAGCGAGATCGTACCTCTTTTTCCAGGCCTGTTCGGA
CATACGGTTCCCGCGGAAGATCAAGTTGTGCCCCAGAAATTCTTGGATTTCTTGCAATATGAGATTGAATGCCATAACTCTGATTTGTATTTTACTTCTTATTGGTGCTG
TTGGGAAATCTGCACAGATAGGATCGCATACTTGGTCACCCGATGCTATGGAGGGTCCCACTCCAGTATCCGCTTCGATTCATGCAGCAACTATGGTAACAGCTGGCGTT
TTCATGATAGCAAGGAGCTACGACGTCATTCCTTGCGGCAACCACGGAATATTACAGAACGATCTAAAGAGGGTCATAGCTTATTCAACTTGCAGTCAATTAGGCTATAT
GATCTTTGCTTGCGGCATCTCTAACTATTCGGTTAGCGTCTTTCACTTAATGAATCACGCGTTTTTCAAAGCATTACTATTCCTGAGTGCAGGTTCGGTGATTCATGCCA
TGTCGGATGAGCAAGATATGCGGAAGATGGGGGCTTGCCTCCTCGTTCCCTTTACCTATGCCATGATGCTCATGGGCAGCTTATCTCTAATTGGATTTCCTTTTCTAACT
GGATTTTATTCCAAAGATGTGATCTTAGAGCTCGCTTACACTAAGTATACCATCAGTGGGAACTTTGCTTTCTGGTTGGGAAGTGTCTCTGTCCTTTTCACTTCTTATTA
CTCTTTTCGTTCACTTTTTCTAACATTTCTAGTACCAACTAATTCATTCCGGGCGAGACATCTTACGATGTCATGA
Protein sequenceShow/hide protein sequence
MLEGAKSMGAGAATIASAGAAVGIGNVFSSLIHSVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLISSVFRSKKEGPDNSLRRVMGSGLVPENARSSDVGAYNSPC
DSFLRSSVLRTSINLSQAALSLSDFCCLITLEIMYLLIVFLPLLGSSVRKAKTAPRIWVARGRLCPDGGALWGRAQPNRDSTPPTSSAPLYRLNHSKSLVGGTGEPREQA
VDSFIREGKKGPKHGRCRTLEWQRKARSYLFFQACSDIRFPRKIKLCPRNSWISCNMRLNAITLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGV
FMIARSYDVIPCGNHGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVSVFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGACLLVPFTYAMMLMGSLSLIGFPFLT
GFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRSLFLTFLVPTNSFRARHLTMS