; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:2879338..2893774
RNA-Seq ExpressionMoc01g04400
SyntenyMoc01g04400
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]2.9e-18173.71Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPRGE A D NVPP VP GVAPPVPQ AP+GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPRRAQIQQEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGA+FML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ                                
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSIST
                            QRIPATAATQGGTHRAR+FALTRGDVEH EAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFL S+ST
Subjt:  TVSPVSPLCPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSIST

Query:  PLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVS
          GSVL  SQVVKGGQLSFD                    GMDWLAANRANIDCSKKEVSFRLPSGQNF FKGVKAGVPRVVS
Subjt:  PLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVS

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.3e-19773.6Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN RGE A DPNV P VPGGV PPVPQAAP+GVPQVNPQVALLAEALQVLL NAN AGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
        AQYERKFTELSRFG QY+PTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFS+SQ SRGH    QRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCP----------------------------MTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGS
        T  PV P C                             MTGSNTQAL Q+ P   ATQGGT  ARVFALTRGDVEH EAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVSPVSPLCP----------------------------MTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELES GF LS+STP GSVLV SQVVKGGQLSF                     GMDWLAANRANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVSALKANHLL
        GVKAGVPRVVSALKA++LL
Subjt:  GVKAGVPRVVSALKANHLL

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]2.9e-16569.62Show/hide
Query:  TMAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFG
        TMAFRRNTRAHNYEDPNPRGE A DPNVP AVPGGVAP VPQAAP+GVPQ                   N AGGAQ QQPRRAQ  QEEVQFIRDFKRFG
Subjt:  TMAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFG

Query:  PPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYY-----FPVT-VRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQ
        PPVFN V               E   AAE+       W R  + L+ Y      F V    NEKRAEFLRLTQGSLTVAQYERKFTELSRF MQYIP EQ
Subjt:  PPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYY-----FPVT-VRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQ

Query:  LKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL--------------
        LKIDKFID L REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGH   VQRQT  PV P               
Subjt:  LKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL--------------

Query:  --------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF
                      CPMTG NTQ LGQRIP T A QGGTHRARVFALTRGDV H EAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGF
Subjt:  --------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF

Query:  LLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL
        LLS+STP GSVLV SQ+VKGGQLSFD                    GMDWLAAN+ANIDCSKKE SFRLPS QNFTFKGVKA VPRVVSALKA+H L
Subjt:  LLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]1.6e-20378.31Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPRGE A DPNVP  VPG VAPPVPQAAP+GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
        AQYERKFTELSRFGMQYIPTEQLKIDKFID LR EIKGLLV+KEPTTYAAA+RCALVMDKCLEEPQSQQVMGSSSGVKRKFA FSSSQ SRGH   VQRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCPMTGSNTQALGQRI-------PATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLG
        T  PV P C    +    LG+RI       PA AA QGGT RARVFALTRGDVEH EAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRHADLELESLG
Subjt:  TVSPVSPLCPMTGSNTQALGQRI-------PATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLG

Query:  FLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL
        FLLS+STP GSVLV SQVVKGGQLSFD                    GMDWLAANRANI+CSKKEVSFRLPSGQNFTFK VK GVPRVVSALKAN+LL
Subjt:  FLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]8.6e-20277.89Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEV--------QFI
        MAFRRNTRAHNYEDPNPRGE A DPNVPPAVPGGVAPP PQAA +GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPR AQI QEEV        +++
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEV--------QFI

Query:  RDFKR----FGPPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQY
        R+ +      G     +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTELSRFGMQY
Subjt:  RDFKR----FGPPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQY

Query:  IPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL---------
        IPT+QLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GSSSGVKRKFASFSSSQPSR H   VQRQT  PV P          
Subjt:  IPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL---------

Query:  -------------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLEL
                           CPMTGSNTQALGQRIPATAA QGGTHRARVFALTRGDVE+ EAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADLEL
Subjt:  -------------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLEL

Query:  ESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANH
        ESLGFLLS+STP GSVLV SQVVKGGQLSFD                    GMDWLAANRANIDCSKK+VSFRLPSGQNFTFKGVKAGVPRVV ALKA+H
Subjt:  ESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANH

Query:  LL
        LL
Subjt:  LL

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase6.2e-19873.6Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN RGE A DPNV P VPGGV PPVPQAAP+GVPQVNPQVALLAEALQVLL NAN AGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
        AQYERKFTELSRFG QY+PTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFS+SQ SRGH    QRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCP----------------------------MTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGS
        T  PV P C                             MTGSNTQAL Q+ P   ATQGGT  ARVFALTRGDVEH EAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVSPVSPLCP----------------------------MTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELES GF LS+STP GSVLV SQVVKGGQLSF                     GMDWLAANRANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVSALKANHLL
        GVKAGVPRVVSALKA++LL
Subjt:  GVKAGVPRVVSALKANHLL

A0A6J1DRF5 uncharacterized protein LOC1110224741.4e-18173.71Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPRGE A D NVPP VP GVAPPVPQ AP+GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPRRAQIQQEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGA+FML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ                                
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSIST
                            QRIPATAATQGGTHRAR+FALTRGDVEH EAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFL S+ST
Subjt:  TVSPVSPLCPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSIST

Query:  PLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVS
          GSVL  SQVVKGGQLSFD                    GMDWLAANRANIDCSKKEVSFRLPSGQNF FKGVKAGVPRVVS
Subjt:  PLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVS

A0A6J1DTA8 uncharacterized protein LOC1110241147.6e-20478.31Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPRGE A DPNVP  VPG VAPPVPQAAP+GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                             +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN-----------------------------RVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ
        AQYERKFTELSRFGMQYIPTEQLKIDKFID LR EIKGLLV+KEPTTYAAA+RCALVMDKCLEEPQSQQVMGSSSGVKRKFA FSSSQ SRGH   VQRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQ

Query:  TVSPVSPLCPMTGSNTQALGQRI-------PATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLG
        T  PV P C    +    LG+RI       PA AA QGGT RARVFALTRGDVEH EAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRHADLELESLG
Subjt:  TVSPVSPLCPMTGSNTQALGQRI-------PATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLG

Query:  FLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL
        FLLS+STP GSVLV SQVVKGGQLSFD                    GMDWLAANRANI+CSKKEVSFRLPSGQNFTFK VK GVPRVVSALKAN+LL
Subjt:  FLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL

A0A6J1DTE5 uncharacterized protein LOC1110238211.4e-16569.62Show/hide
Query:  TMAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFG
        TMAFRRNTRAHNYEDPNPRGE A DPNVP AVPGGVAP VPQAAP+GVPQ                   N AGGAQ QQPRRAQ  QEEVQFIRDFKRFG
Subjt:  TMAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFIRDFKRFG

Query:  PPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYY-----FPVT-VRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQ
        PPVFN V               E   AAE+       W R  + L+ Y      F V    NEKRAEFLRLTQGSLTVAQYERKFTELSRF MQYIP EQ
Subjt:  PPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYY-----FPVT-VRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQ

Query:  LKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL--------------
        LKIDKFID L REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGH   VQRQT  PV P               
Subjt:  LKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL--------------

Query:  --------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF
                      CPMTG NTQ LGQRIP T A QGGTHRARVFALTRGDV H EAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGF
Subjt:  --------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF

Query:  LLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL
        LLS+STP GSVLV SQ+VKGGQLSFD                    GMDWLAAN+ANIDCSKKE SFRLPS QNFTFKGVKA VPRVVSALKA+H L
Subjt:  LLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANHLL

A0A6J1DWP4 uncharacterized protein LOC1110252154.2e-20277.89Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEV--------QFI
        MAFRRNTRAHNYEDPNPRGE A DPNVPPAVPGGVAPP PQAA +GVPQVNPQVALLAEALQVLLDNAN AGGAQ QQPR AQI QEEV        +++
Subjt:  MAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEV--------QFI

Query:  RDFKR----FGPPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQY
        R+ +      G     +VRGAVFML+GEAVNWWESVAAAEDH N PVTWARFKDLL+EYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTELSRFGMQY
Subjt:  RDFKR----FGPPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQY

Query:  IPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL---------
        IPT+QLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GSSSGVKRKFASFSSSQPSR H   VQRQT  PV P          
Subjt:  IPTEQLKIDKFIDSLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPL---------

Query:  -------------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLEL
                           CPMTGSNTQALGQRIPATAA QGGTHRARVFALTRGDVE+ EAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADLEL
Subjt:  -------------------CPMTGSNTQALGQRIPATAATQGGTHRARVFALTRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLEL

Query:  ESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANH
        ESLGFLLS+STP GSVLV SQVVKGGQLSFD                    GMDWLAANRANIDCSKK+VSFRLPSGQNFTFKGVKAGVPRVV ALKA+H
Subjt:  ESLGFLLSISTPLGSVLVNSQVVKGGQLSFD--------------------GMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVPRVVSALKANH

Query:  LL
        LL
Subjt:  LL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAACAGCAAAGATTCACGGCGGCAGCAGACCCGCGCGACCCTGACAATGACTCCACGGCGGCGACGATCTCACAACTCTGCGGTAATGGCAAGAACATATGAGGC
AAGTGACACACTTGAGTGTGCCATGGGGACCGTGTCGCAGGGGAATGCCGGGGCGAAGAGGTGCCGAATCCGGATTTCGAATCCTGGGTCTGGGGCGTTACAGTTGAATA
TGCCTCCACGTAGAAGAAGGTCTGTGAGACGGGGTGGGCTAAATAGAGATGTTGTCCCTGATACAGTAGAGCCGACGGAGAAGAACCAAACTTCAGCCCACACATCAGCC
TTGATCACCACCACGACCCATATACTCCGTGATTTGGAACATGCGGTGGTTGCCGTGGGTGAGGTTACTGCACATTTGGGCCAGTGGGCTGATACGTCACTAGTTGGGTA
CCGAGGCTCTGGGTACAAAGGTCGGGGATCGATACGTCAACTCTGGACAAAGATGAGTGTCGAGGCTTCGAATAAATGGTCGGGGGTCGGTGCAATGAATCTTAGAGGGA
AGGCAAGTATTGGGGCCTCGTGGGAAAAAGACAAGGACCAATATCCGGCGGCTCCTTTTTCTCCGGCGAAGCGGCACAGCAGTAGCAGTGAGCCCCACGCCGGCGTCCTC
CCGCAGCAGCTCGTGGCGGCGCACGAACGACGGTGTCCACTGCGGCCGCGTTCCTCAGATCCGCGACGACCCACGCGATTTTGGCTCCATCTGAACGGGTCTGTTGGAGT
TCGATGGTCGGTGACAAGCTCGGGTCGGATCCGGCAGTGCAACAGCATCTCCACGGCGGCGCGACGGTTTCTGCACCACCCTGCAGCCCTACTCGCGGCGGTGTGCGACG
AGCGGCGTTCCTTCACGGCGGCGGACGACGCACAGGTCAACTCCACGGCGGTCCCGACGACCTGCTGCTACAACCTCAACCCCGACGACGTGCGCAACTCCCGACGGCTG
CCTCCGACGAACAGCAGCTCCGGCGACTCGTTCTTTTTGCAATCCGACGCGAACAGTAGCAGATTCGCGAAGACTGTAACTCTTGTTTATGTTGTTTTGTATGCCTGGTT
AGTGTTGAATGTCCGTCGGCGAAGGATGTTTATATTGTGGAAGCACGCTGAATTGTGTTGGTCAGCTGAGAGATGCGTTGAGATCATACTAGTGGTGGTGTTGTTGATGG
GTGTTGTGATGGTTGGTGCTATTAGGGAAGTAGTACTGAAATCTTGGTTGGAGGCTAGAAATGTTATGTGGAATTATGAAAGTTATGAAGTGAATGGGTCGTCAGTGTAC
GGTCCTTGTCAGTCTCCCCATCATCACCAGATACGTTTCCGAGTTCTGGATACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTACGAGGATCCGAACCCTAGGGG
TGAGGAAGCAACGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGTTGCACCCCCGGTCCCGCAGGCAGCACCTGAAGGAGTTCCCCAGGTGAATCCCCAGGTGGCGT
TACTAGCTGAGGCATTGCAAGTATTGCTGGATAATGCGAATGAAGCCGGTGGGGCTCAGGCGCAGCAGCCTCGCCGGGCACAGATTCAACAAGAGGAGGTCCAGTTTATC
AGGGATTTCAAACGCTTCGGACCACCCGTTTTTAACAGAGTCCGGGGAGCAGTGTTTATGCTTCAAGGAGAAGCAGTAAACTGGTGGGAGTCGGTGGCGGCAGCGGAGGA
TCACGGCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCCATGAGTACTATTTCCCCGTGACTGTCAGGAACGAAAAACGGGCAGAGTTTCTCCGTCTCACTC
AAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGTTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACAGT
TTGCGTAGGGAGATCAAGGGGCTACTTGTTCTTAAGGAGCCAACTACTTATGCAGCAGCAGTCAGGTGTGCGTTGGTTATGGACAAATGTCTCGAGGAGCCTCAATCTCA
ACAGGTGATGGGCTCTAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCGAGGGGACACCTGCAGCTTGTGCAAAGGCAGACTGTTTCTCCGG
TGTCCCCTCTTTGTCCGATGACCGGCTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACGGCAGCAACTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTT
ACGAGGGGGGATGTTGAGCATGGCGAGGCGGTGGTCACAGGGACTGTTTTAGTACTCAGTATGCCTGCTTACGCTTTATTTGACTCGGGGTCTAGTCACTCTTTCATTGC
TTCTACCTTTGTTCGGCATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGATATCCACTCCGTTAGGATCTGTGTTGGTCAATAGTCAAGTGGTGAAAGGAG
GCCAGCTCTCTTTCGATGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATCGATTGCTCGAAGAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTACCTTT
AAAGGAGTCAAGGCCGGGGTCCCGAGGGTGGTGTCGGCATTGAAGGCCAACCATCTGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCAACAGCAAAGATTCACGGCGGCAGCAGACCCGCGCGACCCTGACAATGACTCCACGGCGGCGACGATCTCACAACTCTGCGGTAATGGCAAGAACATATGAGGC
AAGTGACACACTTGAGTGTGCCATGGGGACCGTGTCGCAGGGGAATGCCGGGGCGAAGAGGTGCCGAATCCGGATTTCGAATCCTGGGTCTGGGGCGTTACAGTTGAATA
TGCCTCCACGTAGAAGAAGGTCTGTGAGACGGGGTGGGCTAAATAGAGATGTTGTCCCTGATACAGTAGAGCCGACGGAGAAGAACCAAACTTCAGCCCACACATCAGCC
TTGATCACCACCACGACCCATATACTCCGTGATTTGGAACATGCGGTGGTTGCCGTGGGTGAGGTTACTGCACATTTGGGCCAGTGGGCTGATACGTCACTAGTTGGGTA
CCGAGGCTCTGGGTACAAAGGTCGGGGATCGATACGTCAACTCTGGACAAAGATGAGTGTCGAGGCTTCGAATAAATGGTCGGGGGTCGGTGCAATGAATCTTAGAGGGA
AGGCAAGTATTGGGGCCTCGTGGGAAAAAGACAAGGACCAATATCCGGCGGCTCCTTTTTCTCCGGCGAAGCGGCACAGCAGTAGCAGTGAGCCCCACGCCGGCGTCCTC
CCGCAGCAGCTCGTGGCGGCGCACGAACGACGGTGTCCACTGCGGCCGCGTTCCTCAGATCCGCGACGACCCACGCGATTTTGGCTCCATCTGAACGGGTCTGTTGGAGT
TCGATGGTCGGTGACAAGCTCGGGTCGGATCCGGCAGTGCAACAGCATCTCCACGGCGGCGCGACGGTTTCTGCACCACCCTGCAGCCCTACTCGCGGCGGTGTGCGACG
AGCGGCGTTCCTTCACGGCGGCGGACGACGCACAGGTCAACTCCACGGCGGTCCCGACGACCTGCTGCTACAACCTCAACCCCGACGACGTGCGCAACTCCCGACGGCTG
CCTCCGACGAACAGCAGCTCCGGCGACTCGTTCTTTTTGCAATCCGACGCGAACAGTAGCAGATTCGCGAAGACTGTAACTCTTGTTTATGTTGTTTTGTATGCCTGGTT
AGTGTTGAATGTCCGTCGGCGAAGGATGTTTATATTGTGGAAGCACGCTGAATTGTGTTGGTCAGCTGAGAGATGCGTTGAGATCATACTAGTGGTGGTGTTGTTGATGG
GTGTTGTGATGGTTGGTGCTATTAGGGAAGTAGTACTGAAATCTTGGTTGGAGGCTAGAAATGTTATGTGGAATTATGAAAGTTATGAAGTGAATGGGTCGTCAGTGTAC
GGTCCTTGTCAGTCTCCCCATCATCACCAGATACGTTTCCGAGTTCTGGATACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTACGAGGATCCGAACCCTAGGGG
TGAGGAAGCAACGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGTTGCACCCCCGGTCCCGCAGGCAGCACCTGAAGGAGTTCCCCAGGTGAATCCCCAGGTGGCGT
TACTAGCTGAGGCATTGCAAGTATTGCTGGATAATGCGAATGAAGCCGGTGGGGCTCAGGCGCAGCAGCCTCGCCGGGCACAGATTCAACAAGAGGAGGTCCAGTTTATC
AGGGATTTCAAACGCTTCGGACCACCCGTTTTTAACAGAGTCCGGGGAGCAGTGTTTATGCTTCAAGGAGAAGCAGTAAACTGGTGGGAGTCGGTGGCGGCAGCGGAGGA
TCACGGCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCCATGAGTACTATTTCCCCGTGACTGTCAGGAACGAAAAACGGGCAGAGTTTCTCCGTCTCACTC
AAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGTTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACAGT
TTGCGTAGGGAGATCAAGGGGCTACTTGTTCTTAAGGAGCCAACTACTTATGCAGCAGCAGTCAGGTGTGCGTTGGTTATGGACAAATGTCTCGAGGAGCCTCAATCTCA
ACAGGTGATGGGCTCTAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCGAGGGGACACCTGCAGCTTGTGCAAAGGCAGACTGTTTCTCCGG
TGTCCCCTCTTTGTCCGATGACCGGCTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACGGCAGCAACTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTT
ACGAGGGGGGATGTTGAGCATGGCGAGGCGGTGGTCACAGGGACTGTTTTAGTACTCAGTATGCCTGCTTACGCTTTATTTGACTCGGGGTCTAGTCACTCTTTCATTGC
TTCTACCTTTGTTCGGCATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGATATCCACTCCGTTAGGATCTGTGTTGGTCAATAGTCAAGTGGTGAAAGGAG
GCCAGCTCTCTTTCGATGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATCGATTGCTCGAAGAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTACCTTT
AAAGGAGTCAAGGCCGGGGTCCCGAGGGTGGTGTCGGCATTGAAGGCCAACCATCTGCTCTAG
Protein sequenceShow/hide protein sequence
MSNSKDSRRQQTRATLTMTPRRRRSHNSAVMARTYEASDTLECAMGTVSQGNAGAKRCRIRISNPGSGALQLNMPPRRRRSVRRGGLNRDVVPDTVEPTEKNQTSAHTSA
LITTTTHILRDLEHAVVAVGEVTAHLGQWADTSLVGYRGSGYKGRGSIRQLWTKMSVEASNKWSGVGAMNLRGKASIGASWEKDKDQYPAAPFSPAKRHSSSSEPHAGVL
PQQLVAAHERRCPLRPRSSDPRRPTRFWLHLNGSVGVRWSVTSSGRIRQCNSISTAARRFLHHPAALLAAVCDERRSFTAADDAQVNSTAVPTTCCYNLNPDDVRNSRRL
PPTNSSSGDSFFLQSDANSSRFAKTVTLVYVVLYAWLVLNVRRRRMFILWKHAELCWSAERCVEIILVVVLLMGVVMVGAIREVVLKSWLEARNVMWNYESYEVNGSSVY
GPCQSPHHHQIRFRVLDTMAFRRNTRAHNYEDPNPRGEEATDPNVPPAVPGGVAPPVPQAAPEGVPQVNPQVALLAEALQVLLDNANEAGGAQAQQPRRAQIQQEEVQFI
RDFKRFGPPVFNRVRGAVFMLQGEAVNWWESVAAAEDHGNAPVTWARFKDLLHEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDS
LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSSSGVKRKFASFSSSQPSRGHLQLVQRQTVSPVSPLCPMTGSNTQALGQRIPATAATQGGTHRARVFAL
TRGDVEHGEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSISTPLGSVLVNSQVVKGGQLSFDGMDWLAANRANIDCSKKEVSFRLPSGQNFTF
KGVKAGVPRVVSALKANHLL