; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr7:7440878..7448874
RNA-Seq ExpressionMoc07g09680
SyntenyMoc07g09680
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]7.6e-17285.41Show/hide
Query:  MLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYERKFTEL------------------------EIKRLL
        MLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF VTVRNEKRAEFLRLTQ SLTVAQY+RKFTEL                        EIK LL
Subjt:  MLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYERKFTEL------------------------EIKRLL

Query:  VLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMT
        +LKE TTYAAAVRC LVMDKCLEEPQ+QQVMGSSSGVKRKFASFS SQ S GHQH+VQRQTAPP CPSCKK+HAGPCWLGKRICF+CQKE HFARECPMT
Subjt:  VLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMT

Query:  GSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMV
        GSNTQALGQKTP AAAAQGGT RARVFALTRGDVEHAEAVVTGT+LVLS+PAY LFDSGSSHSFIASTF++HADLELESLGFLLSVSTPSGSVLVTSQ+V
Subjt:  GSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMV

Query:  KGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKAGVPRVV
        KGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAAN+ANINCSKKEV+FRLPSGQNFTFKGVKAGVPRVV
Subjt:  KGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKAGVPRVV

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.3e-21183.01Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTAAEEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF V  RNEKR EFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQV+GS+SGVKRKFASFS SQ S+GHQHH QRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKK+HA PCWLGK+ICFKCQKE HF REC MTGSNTQAL QKTP A A QGGT  ARVFALTRGDVEHAEAVVTGT+L+LSIPAY LFDSGS
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELES GF LSVSTPSGSVLVTSQ+VKGGQLSF GQTLEV LIQL++QDFDVILGMDWLAAN+ANINCSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE
        GVKAGVPRVV ALKAS+LLQRG WAYLAS+VDA KVVPSIE VRVVN+FTDVFPEDLPGLP FRE
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]1.6e-18576.56Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTAAEEWVR LEALYVYLGCSDDFKV+GAV                                            NEKRAEFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQVMGSSSGVKRKFASFS SQPS+GHQHHVQRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKKSH GPCWLGK IC++CQKE HFARECPMTG NTQ LGQ+ P   AAQGGTHRARVFALTRGDV HAEAVV GTVLVLS+PAY LFDS S
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAANQANI+CSKKE SFRLPS QNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE
        GVKA VPRVV ALKASH LQRGAWAYLAS+VDA KVVPSIEAVRVVN+FTDVFPEDLPGLP  RE
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]1.2e-19881.06Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTA EEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDH NV VTW+RF DLLYEYYF VTVRNEKRAEFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLV+KEPTTYAAA+RC LVMDKCLEEPQ+QQVMGSSSGVKRKFA FS SQ S+GHQHHVQRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKK+HAGPCWLGKRICF+C                     QKTPAAAAAQGGT RARVFALTRGDVEHAEAVVTGT+LV+S+PAY LFDSGS
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELESLGFLLSVSTPSGSVLV SQ+VKGGQLSFDGQT EVKLIQLD+QDFDVILGMDWLAAN+ANINCSKKEVSFRLPSGQNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFP
         VK GVPRVV ALKA++LLQRGAWAYLAS+VDA KVVPSIEAVRVVN+FTDVFP
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]7.8e-20986.3Show/hide
Query:  KVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYE
        +VSERPTAAEEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF VTVRNEKR EFLRLTQ SLTVA+YE
Subjt:  KVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYE

Query:  RKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPP
        RKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQV+GSSSGVKRKFASFS SQPS+ HQHHVQRQTAPP
Subjt:  RKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPP

Query:  VCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSF
        VCPSCKKSHAGPCW+GKRIC++CQKE HFARECPMTGSNTQALGQ+ PA AAAQGGTHRARVFALTRGDVE+AEAVVT TVLVLS+PAY LFDSGSSHSF
Subjt:  VCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSF

Query:  IASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKA
        IASTF+ HADLELESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAAN+ANI+CSKK+VSFRLPSGQNFTFKGVKA
Subjt:  IASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKA

Query:  GVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEA
        GVPRVVLALKASHLLQRGAWAYLAS+VDA KVVPSIEA
Subjt:  GVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase6.2e-21283.01Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTAAEEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF V  RNEKR EFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQV+GS+SGVKRKFASFS SQ S+GHQHH QRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKK+HA PCWLGK+ICFKCQKE HF REC MTGSNTQAL QKTP A A QGGT  ARVFALTRGDVEHAEAVVTGT+L+LSIPAY LFDSGS
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELES GF LSVSTPSGSVLVTSQ+VKGGQLSF GQTLEV LIQL++QDFDVILGMDWLAAN+ANINCSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE
        GVKAGVPRVV ALKAS+LLQRG WAYLAS+VDA KVVPSIE VRVVN+FTDVFPEDLPGLP FRE
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE

A0A6J1DR22 uncharacterized protein LOC1110230353.7e-17285.41Show/hide
Query:  MLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYERKFTEL------------------------EIKRLL
        MLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF VTVRNEKRAEFLRLTQ SLTVAQY+RKFTEL                        EIK LL
Subjt:  MLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYERKFTEL------------------------EIKRLL

Query:  VLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMT
        +LKE TTYAAAVRC LVMDKCLEEPQ+QQVMGSSSGVKRKFASFS SQ S GHQH+VQRQTAPP CPSCKK+HAGPCWLGKRICF+CQKE HFARECPMT
Subjt:  VLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMT

Query:  GSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMV
        GSNTQALGQKTP AAAAQGGT RARVFALTRGDVEHAEAVVTGT+LVLS+PAY LFDSGSSHSFIASTF++HADLELESLGFLLSVSTPSGSVLVTSQ+V
Subjt:  GSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMV

Query:  KGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKAGVPRVV
        KGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAAN+ANINCSKKEV+FRLPSGQNFTFKGVKAGVPRVV
Subjt:  KGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKAGVPRVV

A0A6J1DTA8 uncharacterized protein LOC1110241146.0e-19981.06Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTA EEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDH NV VTW+RF DLLYEYYF VTVRNEKRAEFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLV+KEPTTYAAA+RC LVMDKCLEEPQ+QQVMGSSSGVKRKFA FS SQ S+GHQHHVQRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKK+HAGPCWLGKRICF+C                     QKTPAAAAAQGGT RARVFALTRGDVEHAEAVVTGT+LV+S+PAY LFDSGS
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELESLGFLLSVSTPSGSVLV SQ+VKGGQLSFDGQT EVKLIQLD+QDFDVILGMDWLAAN+ANINCSKKEVSFRLPSGQNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFP
         VK GVPRVV ALKA++LLQRGAWAYLAS+VDA KVVPSIEAVRVVN+FTDVFP
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFP

A0A6J1DTE5 uncharacterized protein LOC1110238217.7e-18676.56Show/hide
Query:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV
        PV   VSERPTAAEEWVR LEALYVYLGCSDDFKV+GAV                                            NEKRAEFLRLTQ SLTV
Subjt:  PVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTV

Query:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ
        AQYERKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQVMGSSSGVKRKFASFS SQPS+GHQHHVQRQ
Subjt:  AQYERKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQ

Query:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS
        TAPPVCPSCKKSH GPCWLGK IC++CQKE HFARECPMTG NTQ LGQ+ P   AAQGGTHRARVFALTRGDV HAEAVV GTVLVLS+PAY LFDS S
Subjt:  TAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGS

Query:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK
        SHSFIASTF+RHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAANQANI+CSKKE SFRLPS QNFTFK
Subjt:  SHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE
        GVKA VPRVV ALKASH LQRGAWAYLAS+VDA KVVPSIEAVRVVN+FTDVFPEDLPGLP  RE
Subjt:  GVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGLPLFRE

A0A6J1DWP4 uncharacterized protein LOC1110252153.8e-20986.3Show/hide
Query:  KVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYE
        +VSERPTAAEEWVR LEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANV VTW+RF DLLYEYYF VTVRNEKR EFLRLTQ SLTVA+YE
Subjt:  KVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYE

Query:  RKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPP
        RKFTEL                        EIK LLVLKEPTTYAAAVRC LVMDKCLEEPQ+QQV+GSSSGVKRKFASFS SQPS+ HQHHVQRQTAPP
Subjt:  RKFTEL------------------------EIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPP

Query:  VCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSF
        VCPSCKKSHAGPCW+GKRIC++CQKE HFARECPMTGSNTQALGQ+ PA AAAQGGTHRARVFALTRGDVE+AEAVVT TVLVLS+PAY LFDSGSSHSF
Subjt:  VCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMTGSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSF

Query:  IASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKA
        IASTF+ HADLELESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQTLEVKLIQLD+QDFDVILGMDWLAAN+ANI+CSKK+VSFRLPSGQNFTFKGVKA
Subjt:  IASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQTLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKA

Query:  GVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEA
        GVPRVVLALKASHLLQRGAWAYLAS+VDA KVVPSIEA
Subjt:  GVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEA

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-0421.22Show/hide
Query:  PAPNATVA------ARNAYDRWIKANDKANVYILASISDVLAKKDEDTVTAKEIMDSLQRMFGQPSSQARHEALKFIYNSRMKESSLMREHVLNLMVHFN
        P P AT+           Y RW + +      IL +IS  +        TA +I ++L++++  P S      L+FI  +R  + +L+            
Subjt:  PAPNATVA------ARNAYDRWIKANDKANVYILASISDVLAKKDEDTVTAKEIMDSLQRMFGQPSSQARHEALKFIYNSRMKESSLMREHVLNLMVHFN

Query:  VAESNGAVIDEQSQVSFILESFPKSFLPFRNNAVMTKLEYTLT----TLLNELQTYQSLMKCKGQEGEANVATSKRFNKGSSSGTRSAPSSFGSKTFKKK
             G  +D   QV  +LE+ P  + P  +         +LT     L+N      +L   +     ANV T +  N   +   R       ++ +   
Subjt:  VAESNGAVIDEQSQVSFILESFPKSFLPFRNNAVMTKLEYTLT----TLLNELQTYQSLMKCKGQEGEANVATSKRFNKGSSSGTRSAPSSFGSKTFKKK

Query:  KAAGKGSKPDSAAAAQKGKVKVPEKGKCFHCNMDGHWKRNCPKYLAEKKKAN-----------EGKYDLLVLETCLVENDDSAWILDSGATNHVCYSFQG
               +P S+ +    +   P  G+C  C++ GH  + CP+    +   N           + + +L V       N    W+LDSGAT+H+   F  
Subjt:  KAAGKGSKPDSAAAAQKGKVKVPEKGKCFHCNMDGHWKRNCPKYLAEKKKAN-----------EGKYDLLVLETCLVENDDSAWILDSGATNHVCYSFQG

Query:  ISSWRQLDAGE
        +S  +    G+
Subjt:  ISSWRQLDAGE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAATGGAGCCGGTGGAGCTCAAGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTCAGGGTGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTT
CAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGTATCCGTCACGTGGTCAAGATTTA
ATGACCTACTTTATGAGTACTATTTTGCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAAGGAGCCTAACCGTGGCCCAATACGAGAGGAAG
TTCACTGAGCTGGAGATCAAGAGGCTACTCGTTCTCAAAGAACCAACTACTTATGCAGCGGCAGTCAGGTGTACGTTGGTTATGGATAAGTGTCTCGAGGAACCTCAGAC
TCAGCAGGTAATGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTTCAGTCAACCCTCAAAAGGACACCAGCACCATGTGCAAAGGCAGACTGCTCCTC
CGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCATGTTGGTTAGGAAAAAGAATATGTTTCAAGTGCCAGAAGGAAGAACATTTCGCAAGGGAGTGTCCGATGACC
GGCTCGAATACCCAAGCGTTAGGCCAGAAGACCCCTGCGGCAGCTGCAGCTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTCACCAGGGGCGATGTTGAGCATGC
CGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATACCTGCTTACACATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTATTCGACATGCGG
ACCTAGAACTAGAATCGTTAGGCTTTTTGTTGTCAGTATCCACACCGTCAGGATCTGTGTTGGTCACTAGTCAAATGGTAAAAGGAGGCCAGCTCTCTTTCGATGGTCAG
ACCTTGGAGGTAAAATTAATTCAACTGGATATTCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCAGGCTAATATTAATTGCTCAAAGAAGGAAGT
TAGTTTTCGCTTGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTTGGCATTGAAGGCCAGCCATCTTCTCCAACGTGGTGCTT
GGGCCTATTTGGCTAGCATTGTGGATGCAACGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGACTTCACTGACGTGTTCCCTGAGGACCTCCCCGGCTTG
CCTCTGTTTCGCGAAAGGATCATTGTTGCCCAAAAGGAAGATCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGAGATTTCACTCTTTCGGTGGGAGAAGGACG
AGTGACAACACATCCTGCGGTCTCCGCCTTTGGTTTGCACCGTGAGGTTTCATACATGACCTGCGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTTATTGTATGGA
AATCAAAACCAAGGCAAACTCCAGAAATGGATAGGAGTCTCTTAGCTCCTGCACCTAACGCCACTGTGGCGGCGCGCAACGCCTATGACAGGTGGATCAAGGCTAATGAC
AAGGCTAATGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGGACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGAATGTTTGGACA
ACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGAGCTCCTTAATGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACG
TGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAAAGTCAGGTCAGCTTCATTCTGGAATCTTTTCCGAAGAGTTTCCTGCCATTCCGCAACAATGCGGTTATGACTAAG
CTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAA
TAAAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTTTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTG
CCCAGAAAGGCAAGGTCAAGGTTCCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCC
AACGAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTATTCATTTCA
AGGAATTAGTTCCTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAGGTCGGAACGGAAGAGGTCGTCTTAGCTGTGGCGGGGTGGGTACCGCGAATCGAACTCCAAA
TAGGTGTGGGTAAGCGTCTAAAGTCGTCGAGGGGAAACAAAGGCCACAGATCGGAGCTGTTTGTGGTCGTCGAAGCGCGCACGGCCAAGGGCCGCCGTCGGAACGCCGCA
GAAACGACACTGCCACTGCAGAAACACTGGAACCGCCCCGGGTTTGGGTTCCAGACCCGCGTGAGATGCCGGAAATGGGCTGAAAGCGCGCCGCTACTGCCTTCGACGAC
TGCCGAAGGAGTTCCGAAGTCGCGCCGCAGTTCGGGATTAAGAGTCGCCGCTGCTGTTCGATCCGAAGAAATGCCACTGCTCGTGGGTTGTTATTCCACCGTTGGGTCGT
CGTCGGCCGCCGAAAATGAACTTGTGGCTGCCAAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAATGGAGCCGGTGGAGCTCAAGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTCAGGGTGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTT
CAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGTATCCGTCACGTGGTCAAGATTTA
ATGACCTACTTTATGAGTACTATTTTGCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAAGGAGCCTAACCGTGGCCCAATACGAGAGGAAG
TTCACTGAGCTGGAGATCAAGAGGCTACTCGTTCTCAAAGAACCAACTACTTATGCAGCGGCAGTCAGGTGTACGTTGGTTATGGATAAGTGTCTCGAGGAACCTCAGAC
TCAGCAGGTAATGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTTCAGTCAACCCTCAAAAGGACACCAGCACCATGTGCAAAGGCAGACTGCTCCTC
CGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCATGTTGGTTAGGAAAAAGAATATGTTTCAAGTGCCAGAAGGAAGAACATTTCGCAAGGGAGTGTCCGATGACC
GGCTCGAATACCCAAGCGTTAGGCCAGAAGACCCCTGCGGCAGCTGCAGCTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTCACCAGGGGCGATGTTGAGCATGC
CGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATACCTGCTTACACATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTATTCGACATGCGG
ACCTAGAACTAGAATCGTTAGGCTTTTTGTTGTCAGTATCCACACCGTCAGGATCTGTGTTGGTCACTAGTCAAATGGTAAAAGGAGGCCAGCTCTCTTTCGATGGTCAG
ACCTTGGAGGTAAAATTAATTCAACTGGATATTCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCAGGCTAATATTAATTGCTCAAAGAAGGAAGT
TAGTTTTCGCTTGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTTGGCATTGAAGGCCAGCCATCTTCTCCAACGTGGTGCTT
GGGCCTATTTGGCTAGCATTGTGGATGCAACGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGACTTCACTGACGTGTTCCCTGAGGACCTCCCCGGCTTG
CCTCTGTTTCGCGAAAGGATCATTGTTGCCCAAAAGGAAGATCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGAGATTTCACTCTTTCGGTGGGAGAAGGACG
AGTGACAACACATCCTGCGGTCTCCGCCTTTGGTTTGCACCGTGAGGTTTCATACATGACCTGCGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTTATTGTATGGA
AATCAAAACCAAGGCAAACTCCAGAAATGGATAGGAGTCTCTTAGCTCCTGCACCTAACGCCACTGTGGCGGCGCGCAACGCCTATGACAGGTGGATCAAGGCTAATGAC
AAGGCTAATGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGGACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGAATGTTTGGACA
ACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGAGCTCCTTAATGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACG
TGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAAAGTCAGGTCAGCTTCATTCTGGAATCTTTTCCGAAGAGTTTCCTGCCATTCCGCAACAATGCGGTTATGACTAAG
CTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAA
TAAAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTTTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTG
CCCAGAAAGGCAAGGTCAAGGTTCCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCC
AACGAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTATTCATTTCA
AGGAATTAGTTCCTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAGGTCGGAACGGAAGAGGTCGTCTTAGCTGTGGCGGGGTGGGTACCGCGAATCGAACTCCAAA
TAGGTGTGGGTAAGCGTCTAAAGTCGTCGAGGGGAAACAAAGGCCACAGATCGGAGCTGTTTGTGGTCGTCGAAGCGCGCACGGCCAAGGGCCGCCGTCGGAACGCCGCA
GAAACGACACTGCCACTGCAGAAACACTGGAACCGCCCCGGGTTTGGGTTCCAGACCCGCGTGAGATGCCGGAAATGGGCTGAAAGCGCGCCGCTACTGCCTTCGACGAC
TGCCGAAGGAGTTCCGAAGTCGCGCCGCAGTTCGGGATTAAGAGTCGCCGCTGCTGTTCGATCCGAAGAAATGCCACTGCTCGTGGGTTGTTATTCCACCGTTGGGTCGT
CGTCGGCCGCCGAAAATGAACTTGTGGCTGCCAAAAATTAA
Protein sequenceShow/hide protein sequence
MRMEPVELKVSERPTAAEEWVRVLEALYVYLGCSDDFKVRGAVFMLRGEAVNWWESVAAAEDHANVSVTWSRFNDLLYEYYFAVTVRNEKRAEFLRLTQRSLTVAQYERK
FTELEIKRLLVLKEPTTYAAAVRCTLVMDKCLEEPQTQQVMGSSSGVKRKFASFSFSQPSKGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEEHFARECPMT
GSNTQALGQKTPAAAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSIPAYTLFDSGSSHSFIASTFIRHADLELESLGFLLSVSTPSGSVLVTSQMVKGGQLSFDGQ
TLEVKLIQLDIQDFDVILGMDWLAANQANINCSKKEVSFRLPSGQNFTFKGVKAGVPRVVLALKASHLLQRGAWAYLASIVDATKVVPSIEAVRVVNDFTDVFPEDLPGL
PLFRERIIVAQKEDPSLAKGFSMVGHGDFTLSVGEGRVTTHPAVSAFGLHREVSYMTCVSSWSDHPYGGFIVWKSKPRQTPEMDRSLLAPAPNATVAARNAYDRWIKAND
KANVYILASISDVLAKKDEDTVTAKEIMDSLQRMFGQPSSQARHEALKFIYNSRMKESSLMREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPKSFLPFRNNAVMTK
LEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNKGSSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVPEKGKCFHCNMDGHWKRNCPKYLAEKKKA
NEGKYDLLVLETCLVENDDSAWILDSGATNHVCYSFQGISSWRQLDAGEMTLKVGTEEVVLAVAGWVPRIELQIGVGKRLKSSRGNKGHRSELFVVVEARTAKGRRRNAA
ETTLPLQKHWNRPGFGFQTRVRCRKWAESAPLLPSTTAEGVPKSRRSSGLRVAAAVRSEEMPLLVGCYSTVGSSSAAENELVAAKN