; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g18960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g18960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:13866863..13878200
RNA-Seq ExpressionMoc04g18960
SyntenyMoc04g18960
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]5.0e-20780.21Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        V LLAEALQVLLDNANGAGGAQVQQP R QI QEEVQFIRDFKRFGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFML+GE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL VAQYERKFT+LSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG SS VKRKFASFSSSQPSRGHQH+ QRQT PP CPSCKK+HAGPCW+GKRIC++CQKEGHFAREC MT SNTQALGQ+ PA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVL-SMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLSDFDVILGMDWLAANRANINCSKKEVSF
        TA  Q                        GTVLVL S+   +    G   SF          LK+  +   + DFDVILGMDWLAANRANI+CSKKEVSF
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVL-SMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLSDFDVILGMDWLAANRANINCSKKEVSF

Query:  HLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG
         LPSGQNFTFKG+K GVPRVVS LKASHLLQRG WAYLASV+DA KVVPS+EAVRVVNEFTDVFPEDL G
Subjt:  HLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.1e-23783.6Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLL NANGAGGAQVQQPRRAQIPQ+EVQFIRDFK FGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTVAQYERKFT+LSRFG QY+PTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG +S VKRKFASFS+SQ SRGHQHH QRQTAPPVCPSCKK+HA PCWLGK+ICFKCQKEGHF REC MT SNTQAL QKTP 
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
          A Q GT  ARVFA TRGDVEHAEAVVTGT+L+LS+PAYALFDSGSSHSFIASTFVRHADL+LES GF LS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF
                 DFDVILGMDWLAANRANINCSKKEVSF L SGQNFTFKGVKAGVPRVVS LKAS+LLQRG WAYLASVVDARKVVPS+E VRVVNEFTDVF
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF

Query:  PEDLPG
        PEDLPG
Subjt:  PEDLPG

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]5.5e-0490.62Show/hide
Query:  PSLRQRIIVAQKEDLSLAKGFSMVGHEDFTHS
        PSLRQRIIVAQKED SLAKGFSMVGH DFT S
Subjt:  PSLRQRIIVAQKEDLSLAKGFSMVGHEDFTHS

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.0e-22882.63Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQ+EVQFIRDFKRFGPPVFNGVSERPT TEEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFT+LSRFGMQYIPTEQLKIDKFIDGLR EIKGLLV+KEPTTYAAA+R
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQV+G SS VKRKFA FSSSQ SRGHQHHVQRQTAPPVCPSCKK+HAGPCWLGKRICF+C                     QKTPA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
         AAAQ GT RARVFA TRGDVEHAEAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRHADL+LESLGFLLS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF
                 DFDVILGMDWLAANRANINCSKKEVSF LPSGQNFTFK VK GVPRVVS LKA++LLQRGAWAYLASVVDARKVVPS+EAVRVVNEFTDVF
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF

Query:  P
        P
Subjt:  P

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]1.7e-19975.97Show/hide
Query:  NGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESVAAAEDHANVPVTWAR
        NGAGGAQVQQPRRAQ PQEEVQFIRDFKRFGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKV+GAV                             
Subjt:  NGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESVAAAEDHANVPVTWAR

Query:  FKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ
                       NEKRAEFLRLTQGSLTVAQYERKFT+LSRF MQYIP EQLKIDKFIDGL REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ
Subjt:  FKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ

Query:  QVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPATAAAQDGTHRARVFA
        QV+G SS VKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSH GPCWLGK IC++CQKEGHFARECPMT  NTQ LGQ+ P T AAQ GTHRARVFA
Subjt:  QVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPATAAAQDGTHRARVFA

Query:  HTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS-------------------------------------DFDVIL
         TRGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADL+LESLGFLLS                                     DFDVIL
Subjt:  HTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS-------------------------------------DFDVIL

Query:  GMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG
        GMDWLAAN+ANI+CSKKE SF LPS QNFTFKGVKA VPRVVS LKASH LQRGAWAYLASVVDARKVVPS+EAVRVVNEFTDVFPEDLPG
Subjt:  GMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]9.4e-22283.03Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLLDNANGAGGAQVQQPR AQIPQEE                 VSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFT+LSRFGMQYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG SS VKRKFASFSSSQPSR HQHHVQRQTAPPVCPSCKKSHAGPCW+GKRIC++CQKEGHFARECPMT SNTQALGQ+ PA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
        TAAAQ GTHRARVFA TRGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADL+LESLGFLLS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEA
                 DFDVILGMDWLAANRANI+CSKK+VSF LPSGQNFTFKGVKAGVPRVV  LKASHLLQRGAWAYLASVVDARKVVPS+EA
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEA

TrEMBL top hitse value%identityAlignment
A0A6J1DNV8 uncharacterized protein LOC1110229252.4e-20780.21Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        V LLAEALQVLLDNANGAGGAQVQQP R QI QEEVQFIRDFKRFGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFML+GE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL VAQYERKFT+LSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG SS VKRKFASFSSSQPSRGHQH+ QRQT PP CPSCKK+HAGPCW+GKRIC++CQKEGHFAREC MT SNTQALGQ+ PA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVL-SMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLSDFDVILGMDWLAANRANINCSKKEVSF
        TA  Q                        GTVLVL S+   +    G   SF          LK+  +   + DFDVILGMDWLAANRANI+CSKKEVSF
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVL-SMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLSDFDVILGMDWLAANRANINCSKKEVSF

Query:  HLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG
         LPSGQNFTFKG+K GVPRVVS LKASHLLQRG WAYLASV+DA KVVPS+EAVRVVNEFTDVFPEDL G
Subjt:  HLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG

A0A6J1DQB9 Reverse transcriptase2.9e-23783.6Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLL NANGAGGAQVQQPRRAQIPQ+EVQFIRDFK FGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTVAQYERKFT+LSRFG QY+PTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG +S VKRKFASFS+SQ SRGHQHH QRQTAPPVCPSCKK+HA PCWLGK+ICFKCQKEGHF REC MT SNTQAL QKTP 
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
          A Q GT  ARVFA TRGDVEHAEAVVTGT+L+LS+PAYALFDSGSSHSFIASTFVRHADL+LES GF LS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF
                 DFDVILGMDWLAANRANINCSKKEVSF L SGQNFTFKGVKAGVPRVVS LKAS+LLQRG WAYLASVVDARKVVPS+E VRVVNEFTDVF
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF

Query:  PEDLPG
        PEDLPG
Subjt:  PEDLPG

A0A6J1DQB9 Reverse transcriptase2.7e-0490.62Show/hide
Query:  PSLRQRIIVAQKEDLSLAKGFSMVGHEDFTHS
        PSLRQRIIVAQKED SLAKGFSMVGH DFT S
Subjt:  PSLRQRIIVAQKEDLSLAKGFSMVGHEDFTHS

A0A6J1DQB9 Reverse transcriptase1.5e-22882.63Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQ+EVQFIRDFKRFGPPVFNGVSERPT TEEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFT+LSRFGMQYIPTEQLKIDKFIDGLR EIKGLLV+KEPTTYAAA+R
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQV+G SS VKRKFA FSSSQ SRGHQHHVQRQTAPPVCPSCKK+HAGPCWLGKRICF+C                     QKTPA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
         AAAQ GT RARVFA TRGDVEHAEAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRHADL+LESLGFLLS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF
                 DFDVILGMDWLAANRANINCSKKEVSF LPSGQNFTFK VK GVPRVVS LKA++LLQRGAWAYLASVVDARKVVPS+EAVRVVNEFTDVF
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVF

Query:  P
        P
Subjt:  P

A0A6J1DTE5 uncharacterized protein LOC1110238218.4e-20075.97Show/hide
Query:  NGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESVAAAEDHANVPVTWAR
        NGAGGAQVQQPRRAQ PQEEVQFIRDFKRFGPPVFNGVSERPT  EEWVRELEALYVYLGCSDDFKV+GAV                             
Subjt:  NGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESVAAAEDHANVPVTWAR

Query:  FKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ
                       NEKRAEFLRLTQGSLTVAQYERKFT+LSRF MQYIP EQLKIDKFIDGL REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ
Subjt:  FKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQ

Query:  QVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPATAAAQDGTHRARVFA
        QV+G SS VKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSH GPCWLGK IC++CQKEGHFARECPMT  NTQ LGQ+ P T AAQ GTHRARVFA
Subjt:  QVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPATAAAQDGTHRARVFA

Query:  HTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS-------------------------------------DFDVIL
         TRGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADL+LESLGFLLS                                     DFDVIL
Subjt:  HTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS-------------------------------------DFDVIL

Query:  GMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG
        GMDWLAAN+ANI+CSKKE SF LPS QNFTFKGVKA VPRVVS LKASH LQRGAWAYLASVVDARKVVPS+EAVRVVNEFTDVFPEDLPG
Subjt:  GMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPG

A0A6J1DWP4 uncharacterized protein LOC1110252154.6e-22283.03Show/hide
Query:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV
        VALLAEALQVLLDNANGAGGAQVQQPR AQIPQEE                 VSERPT  EEWVRELEALYVYLGCSDDFKVRGAVFMLRGE VNWWESV
Subjt:  VALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESV

Query:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
        AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFT+LSRFGMQYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR
Subjt:  AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVR

Query:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA
        CALVMDKCLEEPQSQQVIG SS VKRKFASFSSSQPSR HQHHVQRQTAPPVCPSCKKSHAGPCW+GKRIC++CQKEGHFARECPMT SNTQALGQ+ PA
Subjt:  CALVMDKCLEEPQSQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPA

Query:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------
        TAAAQ GTHRARVFA TRGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADL+LESLGFLLS                            
Subjt:  TAAAQDGTHRARVFAHTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLS----------------------------

Query:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEA
                 DFDVILGMDWLAANRANI+CSKK+VSF LPSGQNFTFKGVKAGVPRVV  LKASHLLQRGAWAYLASVVDARKVVPS+EA
Subjt:  ---------DFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGAWAYLASVVDARKVVPSLEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGATCCGAATCTTAGGGGTGGCGTTGCTAGCTGAGGCATTGCAAGTACTGCTGGATAATGCGAATGGAGCCGGTGGAGCTCAAGTGCAACAACCTCGCCGGGCACA
AATTCCGCAAGAGGAGGTTCAGTTTATCAGGGATTTCAAACGCTTTGGGCCACCCGTTTTCAACGGGGTAAGTGAGAGGCCTACTACGACCGAGGAATGGGTCAGGGAGT
TGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTTCAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGTAGTAAATTGGTGGGAGTCGGTGGCGGCAGCG
GAGGATCACGCCAACGTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCT
CACTCAGGGGAGCCTAACTGTGGCCCAATACGAGAGAAAGTTCACTAAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAGCAATTAAAGATTGACAAGTTCATTG
ACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACATATGCAGCGGCAGTCAGATGTGCGTTGGTTATGGACAAATGTCTCGAGGAGCCTCAA
TCTCAGCAGGTGATAGGCTGCAGCTCGACGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCAAGAGGACACCAGCATCATGTGCAAAGGCAGACTGCTCC
TCCGGTGTGCCCCTCTTGTAAGAAGAGTCATGCTGGGCCATGTTGGTTGGGAAAAAGAATATGTTTCAAGTGCCAGAAGGAAGGACATTTCGCAAGGGAGTGTCCGATGA
CCAGCTCGAATACCCAAGCATTAGGCCAGAAGACCCCTGCGACGGCGGCAGCTCAAGATGGAACCCATAGGGCACGCGTCTTTGCTCACACCAGGGGGGATGTTGAGCAT
GCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGC
GGACCTAAAGCTAGAATCATTAGGCTTTTTGTTGTCGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATTAATTGCTCAAAGAAGGAAG
TTAGTTTTCACTTGCCCTCCGGACAAAACTTTACCTTTAAGGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTCGGTATTGAAGGCCAGCCATCTTCTCCAGCGTGGTGCT
TGGGCCTATTTGGCTAGCGTTGTGGATGCAAGGAAGGTTGTGCCAAGCCTTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCCCTGAAGACCTCCCCGGCTG
CCTCCTGGGACCTAGCCTGAGACAGAGGATCATCGTTGCCCAAAAGGAAGATCTCAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGAAGATTTCACTCATTCGGCCG
TCACCCACCCCTCTTCTTCTTCTTGTTCACCGGCGACCACACCTCACGGCGGCGACGGAAGCGGCAGCTCCTCCTCGAAACAGCAGCAGCGGCGGCCGGCGATCACCTCC
GTAACAGCAGCAGCGGCGGACAGCGACCCATCCAGCAGCGGCGCGACTTCGACCCGACATCCCGGCGATCCACAGCAGCGGCGACGCGATTTCCTCCGGCGACGTGTTCT
CCGGCGAACGACCCAGATCTGCCAGCACCGTTCCTGCAGCGTTCCTCGCGGCGGCGCACGACGAACAGATCAGATCCGCGGCGAACCTCGACGACCTGCTGCTACGAAAA
CCAGCGGCGGGATTAGTGCGTTACTGCAGCGTTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTCGATTTAGACTACCCACACCTTGGCGAGCTAGATCTAGG
TGGGAGATCGACGAAAAAATTAAGGGCGCGGTTGTGTGGGCCGACTCTACTCGAAGGAGTTATCTGTGGGGGACTAGGATAGCAGTGATAAGCGTAGAGGTACTTGAGAT
GTTAGGGAGGCTCGAGGCGGGAGTGGAGGCCGATGATGTGAATGTAGTGACATGCAAGAAGCCGGTCACATCTCTAAGGCGAGAGTTCGGGGTTATTCGATCGTTCGGGA
GGAATAATCTCCCATACCGCTGCGGATCCCAAGACCCGAGGAGTGACAGCGAAGGTGATGACACCATTGCTGGAATCACACGAGCTCGAGGGGAAACTCGTGGCGAATTA
TTGTCACGAGTCGTTCATCGTACTGGGAAAATCAAGCTCCAATGGACGGAGCAACAGGGAAGAGGACCTATGATAAGATTGTTGTCGATATATCTCAGCATCATTGATAT
GATCATAAAGCCACCAAGCTTACCACAAAGACCAATCACGAGAGCCCGAGCCAAAAGCTTCAAGGAGCCGTGGAAAAATACGTGCAAAGCTATATTGGCTTCATACAAGA
AGAAGGGAATGAAGGCACCAAGATCCAGCATCGGTGTCTCATCAAGGTGCGACCAAGTGAAGAACCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGATCCGAATCTTAGGGGTGGCGTTGCTAGCTGAGGCATTGCAAGTACTGCTGGATAATGCGAATGGAGCCGGTGGAGCTCAAGTGCAACAACCTCGCCGGGCACA
AATTCCGCAAGAGGAGGTTCAGTTTATCAGGGATTTCAAACGCTTTGGGCCACCCGTTTTCAACGGGGTAAGTGAGAGGCCTACTACGACCGAGGAATGGGTCAGGGAGT
TGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTTCAAGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGTAGTAAATTGGTGGGAGTCGGTGGCGGCAGCG
GAGGATCACGCCAACGTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCT
CACTCAGGGGAGCCTAACTGTGGCCCAATACGAGAGAAAGTTCACTAAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAGCAATTAAAGATTGACAAGTTCATTG
ACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACATATGCAGCGGCAGTCAGATGTGCGTTGGTTATGGACAAATGTCTCGAGGAGCCTCAA
TCTCAGCAGGTGATAGGCTGCAGCTCGACGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCAAGAGGACACCAGCATCATGTGCAAAGGCAGACTGCTCC
TCCGGTGTGCCCCTCTTGTAAGAAGAGTCATGCTGGGCCATGTTGGTTGGGAAAAAGAATATGTTTCAAGTGCCAGAAGGAAGGACATTTCGCAAGGGAGTGTCCGATGA
CCAGCTCGAATACCCAAGCATTAGGCCAGAAGACCCCTGCGACGGCGGCAGCTCAAGATGGAACCCATAGGGCACGCGTCTTTGCTCACACCAGGGGGGATGTTGAGCAT
GCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGC
GGACCTAAAGCTAGAATCATTAGGCTTTTTGTTGTCGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATTAATTGCTCAAAGAAGGAAG
TTAGTTTTCACTTGCCCTCCGGACAAAACTTTACCTTTAAGGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTCGGTATTGAAGGCCAGCCATCTTCTCCAGCGTGGTGCT
TGGGCCTATTTGGCTAGCGTTGTGGATGCAAGGAAGGTTGTGCCAAGCCTTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCCCTGAAGACCTCCCCGGCTG
CCTCCTGGGACCTAGCCTGAGACAGAGGATCATCGTTGCCCAAAAGGAAGATCTCAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGAAGATTTCACTCATTCGGCCG
TCACCCACCCCTCTTCTTCTTCTTGTTCACCGGCGACCACACCTCACGGCGGCGACGGAAGCGGCAGCTCCTCCTCGAAACAGCAGCAGCGGCGGCCGGCGATCACCTCC
GTAACAGCAGCAGCGGCGGACAGCGACCCATCCAGCAGCGGCGCGACTTCGACCCGACATCCCGGCGATCCACAGCAGCGGCGACGCGATTTCCTCCGGCGACGTGTTCT
CCGGCGAACGACCCAGATCTGCCAGCACCGTTCCTGCAGCGTTCCTCGCGGCGGCGCACGACGAACAGATCAGATCCGCGGCGAACCTCGACGACCTGCTGCTACGAAAA
CCAGCGGCGGGATTAGTGCGTTACTGCAGCGTTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTCGATTTAGACTACCCACACCTTGGCGAGCTAGATCTAGG
TGGGAGATCGACGAAAAAATTAAGGGCGCGGTTGTGTGGGCCGACTCTACTCGAAGGAGTTATCTGTGGGGGACTAGGATAGCAGTGATAAGCGTAGAGGTACTTGAGAT
GTTAGGGAGGCTCGAGGCGGGAGTGGAGGCCGATGATGTGAATGTAGTGACATGCAAGAAGCCGGTCACATCTCTAAGGCGAGAGTTCGGGGTTATTCGATCGTTCGGGA
GGAATAATCTCCCATACCGCTGCGGATCCCAAGACCCGAGGAGTGACAGCGAAGGTGATGACACCATTGCTGGAATCACACGAGCTCGAGGGGAAACTCGTGGCGAATTA
TTGTCACGAGTCGTTCATCGTACTGGGAAAATCAAGCTCCAATGGACGGAGCAACAGGGAAGAGGACCTATGATAAGATTGTTGTCGATATATCTCAGCATCATTGATAT
GATCATAAAGCCACCAAGCTTACCACAAAGACCAATCACGAGAGCCCGAGCCAAAAGCTTCAAGGAGCCGTGGAAAAATACGTGCAAAGCTATATTGGCTTCATACAAGA
AGAAGGGAATGAAGGCACCAAGATCCAGCATCGGTGTCTCATCAAGGTGCGACCAAGTGAAGAACCCATAG
Protein sequenceShow/hide protein sequence
MRIRILGVALLAEALQVLLDNANGAGGAQVQQPRRAQIPQEEVQFIRDFKRFGPPVFNGVSERPTTTEEWVRELEALYVYLGCSDDFKVRGAVFMLRGEVVNWWESVAAA
EDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTKLSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQ
SQQVIGCSSTVKRKFASFSSSQPSRGHQHHVQRQTAPPVCPSCKKSHAGPCWLGKRICFKCQKEGHFARECPMTSSNTQALGQKTPATAAAQDGTHRARVFAHTRGDVEH
AEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLKLESLGFLLSDFDVILGMDWLAANRANINCSKKEVSFHLPSGQNFTFKGVKAGVPRVVSVLKASHLLQRGA
WAYLASVVDARKVVPSLEAVRVVNEFTDVFPEDLPGCLLGPSLRQRIIVAQKEDLSLAKGFSMVGHEDFTHSAVTHPSSSSCSPATTPHGGDGSGSSSSKQQQRRPAITS
VTAAAADSDPSSSGATSTRHPGDPQQRRRDFLRRRVLRRTTQICQHRSCSVPRGGARRTDQIRGEPRRPAATKTSGGISALLQRFRTFGGDPHPFEARFRLPTPWRARSR
WEIDEKIKGAVVWADSTRRSYLWGTRIAVISVEVLEMLGRLEAGVEADDVNVVTCKKPVTSLRREFGVIRSFGRNNLPYRCGSQDPRSDSEGDDTIAGITRARGETRGEL
LSRVVHRTGKIKLQWTEQQGRGPMIRLLSIYLSIIDMIIKPPSLPQRPITRARAKSFKEPWKNTCKAILASYKKKGMKAPRSSIGVSSRCDQVKNP