; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g19480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g19480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr9:15103998..15109369
RNA-Seq ExpressionMoc09g19480
SyntenyMoc09g19480
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]1.5e-19278.04Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPNPRG+ AAD NVPP VP G APPVPQ A QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQIQQEE+QFIRDFKRFGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSDEFKVRGA+FMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKRAEFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAY--CFSGVPHLECPMTGS
         QYERKFTELSRFGMQYIPTEQLKIDKFID LR +++                GL         +VL + T       CA     C      LE P +  
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAY--CFSGVPHLECPMTGS

Query:  NTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKG
              Q IPATAATQGGTHRAR+FALTRGD EHAEAVVTGTVLVL+MPAYALFDSGSSHSFIASTF +HADLELESLGFL SVST SGSVL T+QVVKG
Subjt:  NTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKG

Query:  GQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVS
        GQLSFDGQ ++VKLIQLDM+DF+VILGMDWLAA RANIDCSKKEVSFRLPSGQNF FKG K GVPRVVS
Subjt:  GQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVS

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]9.6e-21672.24Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPN RG+ AADPNV P VPGG  PPVPQAA QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQI Q+E+QFIRDFK FGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPV  RNEKR EFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR---------------------------------VGYG-----QMSRGTSVSTDDGLQLGGQEE
        AQYERKFTELSRFG QY+PTEQLKIDKFIDGLR +++                                 +G       + +  ++  +  G Q   Q +
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR---------------------------------VGYG-----QMSRGTSVSTDDGLQLGGQEE

Query:  ICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGS
            +  S  +     C   K  CF     G    EC MTGSNTQAL Q  P   ATQGGT  ARVFALTRGD EHAEAVVTGT+L+L++PAYALFDSGS
Subjt:  ICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGS

Query:  SHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTF +HADLELES GF LSVSTPSGSVLVT+QVVKGGQLSF GQT+EV LIQL+M+DF+VILGMDWLAA RANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFK

Query:  GAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPP
        G K GVPRVVSALKAS+LLQRGVWAYLAS+VD RKVVP+IE VRVVNEF+DVFPEDLPGLPP
Subjt:  GAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPP

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.5e-0868.85Show/hide
Query:  KLERLEVELTVDDVSALLARLSVEPGLRQRITVAQKEDPSWAMGISLSRVEHRAACFAGSA
        +LE  EVELTVDDVSALLARLSVEP LRQRI VAQKEDPS A G S+  V H     +G A
Subjt:  KLERLEVELTVDDVSALLARLSVEPGLRQRITVAQKEDPSWAMGISLSRVEHRAACFAGSA

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.4e-21375.66Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NY+DPNPRG+ AADPNVP  VPG  APPVPQAA QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQI Q+E+QFIRDFKRFGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDHTN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAYCFSGVP-------HLE-
        AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLR +++ G   +   T+ +      L   + +     Q  +  +     K   FS          H++ 
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAYCFSGVP-------HLE-

Query:  ------CPMTGSNTQA---LGQGI-------PATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESL
              CP    N      LG+ I       PA AA QGGT RARVFALTRGD EHAEAVVTGT+LV++MPAYALFDSGSSHSFIASTF +HADLELESL
Subjt:  ------CPMTGSNTQA---LGQGI-------PATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESL

Query:  GFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVSALKASHLLQ
        GFLLSVSTPSGSVLV +QVVKGGQLSFDGQT EVKLIQLDM+DF+VILGMDWLAA RANI+CSKKEVSFRLPSGQNFTFK  KVGVPRVVSALKA++LLQ
Subjt:  GFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVSALKASHLLQ

Query:  RGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFP
        RG WAYLAS+VD RKVVP+IEAVRVVNEF+DVFP
Subjt:  RGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFP

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]3.4e-17663.07Show/hide
Query:  RQSPRHHQTMAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQF
        R+     +TMAFRRNT A NYEDPNPRG+ AADPNVP AVPGG AP VPQAA QGVPQ                   NGAGGAQ QQPRRAQ  QEE+QF
Subjt:  RQSPRHHQTMAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQF

Query:  IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFL
        IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKV+GAV                                            NEKRAEFL
Subjt:  IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFL

Query:  RLSQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR-----------------------------------VGYGQMSRGTSVSTDD---
        RL+QGSLTVAQYERKFTELSRF MQYIP EQLKIDKFIDGL  +++                                      G   +  S S+     
Subjt:  RLSQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR-----------------------------------VGYGQMSRGTSVSTDD---

Query:  GLQLGGQEEICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMP
        G Q   Q +    +  S  +     C   K  C+     G    ECPMTG NTQ LGQ IP T A QGGTHRARVFALTRGD  HAEAVV GTVLVL+MP
Subjt:  GLQLGGQEEICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMP

Query:  AYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRL
        AYALFDS SSHSFIASTF +HADLELESLGFLLSVSTPSGSVLVT+Q+VKGGQLSFDGQT+EVKLIQLDM+DF+VILGMDWLAA +ANIDCSKKE SFRL
Subjt:  AYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRL

Query:  PSGQNFTFKGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPPKLE
        PS QNFTFKG K  VPRVVSALKASH LQRG WAYLAS+VD RKVVP+IEAVRVVNEF+DVFPEDLPGLPP  E
Subjt:  PSGQNFTFKGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPPKLE

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]7.9e-20272.38Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPNPRG+ AADPNVPPAVPGG APP PQAASQGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPR AQI QEE            
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKR EFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTS-----------------------VSTDDGL----------------QLGGQE
        A+YERKFTELSRFGMQYIPT+QLKIDKFIDGLR +++ G   +   T+                       + +  G+                Q   Q 
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTS-----------------------VSTDDGL----------------QLGGQE

Query:  EICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSG
        +    +  S  +     C   K  C+     G    ECPMTGSNTQALGQ IPATAA QGGTHRARVFALTRGD E+AEAVVT TVLVL+MPAYALFDSG
Subjt:  EICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSG

Query:  SSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTF
        SSHSFIASTF  HADLELESLGFLLSVSTPSGSVLVT+QVVKGGQLSFDGQT+EVKLIQLDM+DF+VILGMDWLAA RANIDCSKK+VSFRLPSGQNFTF
Subjt:  SSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTF

Query:  KGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEA
        KG K GVPRVV ALKASHLLQRG WAYLAS+VD RKVVP+IEA
Subjt:  KGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase4.7e-21672.24Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPN RG+ AADPNV P VPGG  PPVPQAA QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQI Q+E+QFIRDFK FGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPV  RNEKR EFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR---------------------------------VGYG-----QMSRGTSVSTDDGLQLGGQEE
        AQYERKFTELSRFG QY+PTEQLKIDKFIDGLR +++                                 +G       + +  ++  +  G Q   Q +
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR---------------------------------VGYG-----QMSRGTSVSTDDGLQLGGQEE

Query:  ICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGS
            +  S  +     C   K  CF     G    EC MTGSNTQAL Q  P   ATQGGT  ARVFALTRGD EHAEAVVTGT+L+L++PAYALFDSGS
Subjt:  ICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGS

Query:  SHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTF +HADLELES GF LSVSTPSGSVLVT+QVVKGGQLSF GQT+EV LIQL+M+DF+VILGMDWLAA RANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFK

Query:  GAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPP
        G K GVPRVVSALKAS+LLQRGVWAYLAS+VD RKVVP+IE VRVVNEF+DVFPEDLPGLPP
Subjt:  GAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPP

A0A6J1DQB9 Reverse transcriptase1.7e-0868.85Show/hide
Query:  KLERLEVELTVDDVSALLARLSVEPGLRQRITVAQKEDPSWAMGISLSRVEHRAACFAGSA
        +LE  EVELTVDDVSALLARLSVEP LRQRI VAQKEDPS A G S+  V H     +G A
Subjt:  KLERLEVELTVDDVSALLARLSVEPGLRQRITVAQKEDPSWAMGISLSRVEHRAACFAGSA

A0A6J1DQB9 Reverse transcriptase1.7e-21375.66Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NY+DPNPRG+ AADPNVP  VPG  APPVPQAA QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQI Q+E+QFIRDFKRFGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDHTN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAYCFSGVP-------HLE-
        AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLR +++ G   +   T+ +      L   + +     Q  +  +     K   FS          H++ 
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAYCFSGVP-------HLE-

Query:  ------CPMTGSNTQA---LGQGI-------PATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESL
              CP    N      LG+ I       PA AA QGGT RARVFALTRGD EHAEAVVTGT+LV++MPAYALFDSGSSHSFIASTF +HADLELESL
Subjt:  ------CPMTGSNTQA---LGQGI-------PATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESL

Query:  GFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVSALKASHLLQ
        GFLLSVSTPSGSVLV +QVVKGGQLSFDGQT EVKLIQLDM+DF+VILGMDWLAA RANI+CSKKEVSFRLPSGQNFTFK  KVGVPRVVSALKA++LLQ
Subjt:  GFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVSALKASHLLQ

Query:  RGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFP
        RG WAYLAS+VD RKVVP+IEAVRVVNEF+DVFP
Subjt:  RGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFP

A0A6J1DRF5 uncharacterized protein LOC1110224747.3e-19378.04Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPNPRG+ AAD NVPP VP G APPVPQ A QGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPRRAQIQQEE+QFIRDFKRFGP
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSDEFKVRGA+FMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKRAEFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAY--CFSGVPHLECPMTGS
         QYERKFTELSRFGMQYIPTEQLKIDKFID LR +++                GL         +VL + T       CA     C      LE P +  
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTPAACAKAY--CFSGVPHLECPMTGS

Query:  NTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKG
              Q IPATAATQGGTHRAR+FALTRGD EHAEAVVTGTVLVL+MPAYALFDSGSSHSFIASTF +HADLELESLGFL SVST SGSVL T+QVVKG
Subjt:  NTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKG

Query:  GQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVS
        GQLSFDGQ ++VKLIQLDM+DF+VILGMDWLAA RANIDCSKKEVSFRLPSGQNF FKG K GVPRVVS
Subjt:  GQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVS

A0A6J1DTE5 uncharacterized protein LOC1110238211.6e-17663.07Show/hide
Query:  RQSPRHHQTMAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQF
        R+     +TMAFRRNT A NYEDPNPRG+ AADPNVP AVPGG AP VPQAA QGVPQ                   NGAGGAQ QQPRRAQ  QEE+QF
Subjt:  RQSPRHHQTMAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQF

Query:  IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFL
        IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKV+GAV                                            NEKRAEFL
Subjt:  IRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFL

Query:  RLSQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR-----------------------------------VGYGQMSRGTSVSTDD---
        RL+QGSLTVAQYERKFTELSRF MQYIP EQLKIDKFIDGL  +++                                      G   +  S S+     
Subjt:  RLSQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVR-----------------------------------VGYGQMSRGTSVSTDD---

Query:  GLQLGGQEEICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMP
        G Q   Q +    +  S  +     C   K  C+     G    ECPMTG NTQ LGQ IP T A QGGTHRARVFALTRGD  HAEAVV GTVLVL+MP
Subjt:  GLQLGGQEEICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMP

Query:  AYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRL
        AYALFDS SSHSFIASTF +HADLELESLGFLLSVSTPSGSVLVT+Q+VKGGQLSFDGQT+EVKLIQLDM+DF+VILGMDWLAA +ANIDCSKKE SFRL
Subjt:  AYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRL

Query:  PSGQNFTFKGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPPKLE
        PS QNFTFKG K  VPRVVSALKASH LQRG WAYLAS+VD RKVVP+IEAVRVVNEF+DVFPEDLPGLPP  E
Subjt:  PSGQNFTFKGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEAVRVVNEFSDVFPEDLPGLPPKLE

A0A6J1DWP4 uncharacterized protein LOC1110252153.8e-20272.38Show/hide
Query:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP
        MAFRRNT A NYEDPNPRG+ AADPNVPPAVPGG APP PQAASQGVPQVN QVALLAEALQ LL+NANGAGGAQ QQPR AQI QEE            
Subjt:  MAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVALLAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKR EFLRL+QGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTS-----------------------VSTDDGL----------------QLGGQE
        A+YERKFTELSRFGMQYIPT+QLKIDKFIDGLR +++ G   +   T+                       + +  G+                Q   Q 
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTS-----------------------VSTDDGL----------------QLGGQE

Query:  EICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSG
        +    +  S  +     C   K  C+     G    ECPMTGSNTQALGQ IPATAA QGGTHRARVFALTRGD E+AEAVVT TVLVL+MPAYALFDSG
Subjt:  EICIVLLQSTLEGTPAAC--AKAYCF----SGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSG

Query:  SSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTF
        SSHSFIASTF  HADLELESLGFLLSVSTPSGSVLVT+QVVKGGQLSFDGQT+EVKLIQLDM+DF+VILGMDWLAA RANIDCSKK+VSFRLPSGQNFTF
Subjt:  SSHSFIASTFFQHADLELESLGFLLSVSTPSGSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTF

Query:  KGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEA
        KG K GVPRVV ALKASHLLQRG WAYLAS+VD RKVVP+IEA
Subjt:  KGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTIEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGGGTCGATGCACTAGTGTCGGGGCTCTGGGTATAAATGGCCAGGGGCTGACACTACCAAAAGCATGGTTCCGAAGAGCTCGAACAACTGAGACTGTCCTAGA
ATCTAGGTCGTCAGTGTACGGTCCTCGTCAGTCTCCTCGCCATCACCAGACAATGGCTTTTCGACGCAACACGATGGCTCGCAACTACGAGGATCCGAACCCTAGGGGCA
AGGAGGCAGCGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGAAGCACCCCCGGTTCCGCAGGCAGCATCTCAAGGAGTTCCCCAGGTGAATGCCCAGGTGGCGTTA
CTAGCTGAGGCATTGCAAGCATTGCTGAATAATGCGAATGGAGCCGGTGGGGCTCAGGCGCAGCAGCCACGCCGGGCACAGATTCAGCAAGAGGAGATCCAGTTCATCAG
AGATTTCAAACGCTTCGGACCACCCGTTTTCAACGGAGTGAGTGAGAGGCCTACCGCGGCGGAGGAATGGGTCAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGGTGCT
CCGACGAATTCAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAACTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACACCAACGCACCCGTCACATGG
GCGAGGTTTAAGGACCTACTCTATGAGTACTATTTTCCCGTGACTGTCAGGAACGAAAAACGGGCAGAGTTTCTTCGTCTCAGTCAAGGGAGCCTAACTGTGGCCCAATA
CGAGAGGAAGTTCACTGAGCTGTCCCGGTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGTCAGGTGCGCGTTGGTT
ATGGACAAATGTCTCGAGGAACCTCAGTCTCGACAGATGACGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAATCAACCCTCGAGGGGACACCA
GCAGCTTGTGCAAAGGCATACTGTTTCTCCGGTGTGCCCCACTTGGAGTGTCCGATGACCGGCTCGAATACCCAAGCTTTAGGCCAGGGGATCCCTGCGACGGCGGCAAC
TCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTTACCAGGGGGGATGCTGAACATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTACTCAACATGCCTGCTTACG
CTTTATTTGATTCGGGGTCTAGTCATTCTTTCATTGCTTCTACTTTTTTTCAGCATGCGGACTTAGAGTTAGAATCGTTGGGCTTTTTGCTGTCGGTATCCACTCCGTCA
GGATCTGTGTTGGTCACTAATCAAGTGGTTAAAGGAGGCCAGCTCTCTTTCGATGGTCAGACCGTGGAGGTGAAGTTAATCCAACTGGATATGCGAGATTTCAATGTGAT
CCTAGGCATGGATTGGTTAGCTGCCTATCGGGCTAACATTGATTGCTCGAAGAAGGAAGTTAGCTTCCGCTTGCCCTCCGGACAGAACTTTACCTTTAAAGGAGCCAAGG
TCGGGGTCCCGAGAGTGGTGTCGGCATTGAAGGCCAGCCATCTTCTCCAGCGTGGTGTCTGGGCCTATTTGGCTAGCATTGTGGATACAAGGAAGGTTGTACCGACCATT
GAGGCAGTTCGTGTGGTCAATGAGTTCTCTGACGTGTTCCCTGAGGACCTCCCCGGCTTGCCTCCGAAGTTGGAACGCTTGGAGGTGGAGCTGACGGTGGATGATGTCTC
CGCGTTGTTGGCTCGACTCTCAGTGGAACCTGGCCTGAGACAGAGGATCACTGTTGCCCAAAAGGAAGACCCTAGCTGGGCCATGGGGATTTCACTCTCTCGGGTGGAAC
ATAGGGCTGCTTGTTTTGCTGGCTCTGCCCTTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGGGTCGATGCACTAGTGTCGGGGCTCTGGGTATAAATGGCCAGGGGCTGACACTACCAAAAGCATGGTTCCGAAGAGCTCGAACAACTGAGACTGTCCTAGA
ATCTAGGTCGTCAGTGTACGGTCCTCGTCAGTCTCCTCGCCATCACCAGACAATGGCTTTTCGACGCAACACGATGGCTCGCAACTACGAGGATCCGAACCCTAGGGGCA
AGGAGGCAGCGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGAAGCACCCCCGGTTCCGCAGGCAGCATCTCAAGGAGTTCCCCAGGTGAATGCCCAGGTGGCGTTA
CTAGCTGAGGCATTGCAAGCATTGCTGAATAATGCGAATGGAGCCGGTGGGGCTCAGGCGCAGCAGCCACGCCGGGCACAGATTCAGCAAGAGGAGATCCAGTTCATCAG
AGATTTCAAACGCTTCGGACCACCCGTTTTCAACGGAGTGAGTGAGAGGCCTACCGCGGCGGAGGAATGGGTCAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGGTGCT
CCGACGAATTCAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAACTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACACCAACGCACCCGTCACATGG
GCGAGGTTTAAGGACCTACTCTATGAGTACTATTTTCCCGTGACTGTCAGGAACGAAAAACGGGCAGAGTTTCTTCGTCTCAGTCAAGGGAGCCTAACTGTGGCCCAATA
CGAGAGGAAGTTCACTGAGCTGTCCCGGTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGTCAGGTGCGCGTTGGTT
ATGGACAAATGTCTCGAGGAACCTCAGTCTCGACAGATGACGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAATCAACCCTCGAGGGGACACCA
GCAGCTTGTGCAAAGGCATACTGTTTCTCCGGTGTGCCCCACTTGGAGTGTCCGATGACCGGCTCGAATACCCAAGCTTTAGGCCAGGGGATCCCTGCGACGGCGGCAAC
TCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTTACCAGGGGGGATGCTGAACATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTACTCAACATGCCTGCTTACG
CTTTATTTGATTCGGGGTCTAGTCATTCTTTCATTGCTTCTACTTTTTTTCAGCATGCGGACTTAGAGTTAGAATCGTTGGGCTTTTTGCTGTCGGTATCCACTCCGTCA
GGATCTGTGTTGGTCACTAATCAAGTGGTTAAAGGAGGCCAGCTCTCTTTCGATGGTCAGACCGTGGAGGTGAAGTTAATCCAACTGGATATGCGAGATTTCAATGTGAT
CCTAGGCATGGATTGGTTAGCTGCCTATCGGGCTAACATTGATTGCTCGAAGAAGGAAGTTAGCTTCCGCTTGCCCTCCGGACAGAACTTTACCTTTAAAGGAGCCAAGG
TCGGGGTCCCGAGAGTGGTGTCGGCATTGAAGGCCAGCCATCTTCTCCAGCGTGGTGTCTGGGCCTATTTGGCTAGCATTGTGGATACAAGGAAGGTTGTACCGACCATT
GAGGCAGTTCGTGTGGTCAATGAGTTCTCTGACGTGTTCCCTGAGGACCTCCCCGGCTTGCCTCCGAAGTTGGAACGCTTGGAGGTGGAGCTGACGGTGGATGATGTCTC
CGCGTTGTTGGCTCGACTCTCAGTGGAACCTGGCCTGAGACAGAGGATCACTGTTGCCCAAAAGGAAGACCCTAGCTGGGCCATGGGGATTTCACTCTCTCGGGTGGAAC
ATAGGGCTGCTTGTTTTGCTGGCTCTGCCCTTGATTAA
Protein sequenceShow/hide protein sequence
MVEGRCTSVGALGINGQGLTLPKAWFRRARTTETVLESRSSVYGPRQSPRHHQTMAFRRNTMARNYEDPNPRGKEAADPNVPPAVPGGEAPPVPQAASQGVPQVNAQVAL
LAEALQALLNNANGAGGAQAQQPRRAQIQQEEIQFIRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLRGEAVNWWESVAAAEDHTNAPVTW
ARFKDLLYEYYFPVTVRNEKRAEFLRLSQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRSQVRVGYGQMSRGTSVSTDDGLQLGGQEEICIVLLQSTLEGTP
AACAKAYCFSGVPHLECPMTGSNTQALGQGIPATAATQGGTHRARVFALTRGDAEHAEAVVTGTVLVLNMPAYALFDSGSSHSFIASTFFQHADLELESLGFLLSVSTPS
GSVLVTNQVVKGGQLSFDGQTVEVKLIQLDMRDFNVILGMDWLAAYRANIDCSKKEVSFRLPSGQNFTFKGAKVGVPRVVSALKASHLLQRGVWAYLASIVDTRKVVPTI
EAVRVVNEFSDVFPEDLPGLPPKLERLEVELTVDDVSALLARLSVEPGLRQRITVAQKEDPSWAMGISLSRVEHRAACFAGSALD