; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20910 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20910
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:14292299..14296933
RNA-Seq ExpressionMoc03g20910
SyntenyMoc03g20910
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS39075.1 hypothetical protein Acr_00g0061040 [Actinidia rufa]4.9e-8437.97Show/hide
Query:  IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSS
        IIRG  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ Y T K +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SS
Subjt:  IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSS

Query:  LRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTP-TKPLSLSLESLALEAR
        L +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L+ KP+P++D IF+EVR E  R+R+M+G  P     S++ ++ A+ AR
Subjt:  LRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTP-TKPLSLSLESLALEAR

Query:  GPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPL-FSKAQLEQLYRLLTPPV
            P SR  R+  LWCDHC R +HTK+ CW+LHG+P       D+        P S+  S G +  P V+   D A+S  L F++AQL+Q+ +L +   
Subjt:  GPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPL-FSKAQLEQLYRLLTPPV

Query:  ESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------
            SS+ +   GI S+ +     S + WI+DSGA++HM++   +F+ YS       V LADGS + + G G+V LSPN+ LH+VL              
Subjt:  ESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------

Query:  ------------------HDLITGTTIGNADGFEGLYYF--RGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKIL
                           D  +G TIG A    GLYYF      +R  QV +G      S       +I  L+ R+  P+S        N +S +  I 
Subjt:  ------------------HDLITGTTIGNADGFEGLYYF--RGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKIL

Query:  IPIIPITQIEESVPIISCNNEDDQVNPNRSDK-QPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDNTNDLDLPIALMK
                 +++   ++ N       PN S   +P  L Y RR+ V    +P  P    H S SS G + + L PQ     LD+PIAL K
Subjt:  IPIIPITQIEESVPIISCNNEDDQVNPNRSDK-QPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDNTNDLDLPIALMK

KAE8690376.1 pentatricopeptide repeat-containing protein [Hibiscus syriacus]1.2e-8236.27Show/hide
Query:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL
        IRG  + GY++GT  +  E D     W+A+NSM+M+WLINSM+  +  +++F  T  D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L
Subjt:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL

Query:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP
        + LW E+D+  + EW  +KD   F+K VEKE +++FL GL  ELD+VRGR+L  +P+P+  E+F+EVR E SR+ +M+G     P     ES AL +  P
Subjt:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP

Query:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLL-------
           + R   R+   C+HC +  HTK +CW+LHG+P  +N S +           SR +   Y      S     AT L  FSK QLEQLY+L+       
Subjt:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLL-------

Query:  TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH------DLIT
        TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADGS   I G GS+I+SP++TL +VLH      +LI+
Subjt:  TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH------DLIT

Query:  GTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRI-ESPQSK-IPEIDGLNTESPQPKILIPIIPITQIEESVPIISCNNEDD
           I +         F G   +         +P +  + GN  E+D L   + ++P  K + +     T   + +I++    I+Q   S P       + 
Subjt:  GTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRI-ESPQSK-IPEIDGLNTESPQPKILIPIIPITQIEESVPIISCNNEDD

Query:  QVNPNRSDKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDN---TNDLDLPIALMKGVRLC-TQHPIARCIGYSHLS
        QV  N  +   E  VYSRR T    V P     Q  +S    G    + +   N   + +   PIAL KGVR C T+HPI+  + Y+ LS
Subjt:  QVNPNRSDKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDN---TNDLDLPIALMKGVRLC-TQHPIARCIGYSHLS

XP_022154801.1 uncharacterized protein LOC111021967 [Momordica charantia]2.8e-8794.86Show/hide
Query:  MEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR
        MEEDIKESFIFYST KDLWNALTM FSDFDNS QLFELRNKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+R
Subjt:  MEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR

Query:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCD
        PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDT TKPLSLSLES AL ARGPPPPSSRSTRRNNLW D
Subjt:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCD

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]5.6e-16564.44Show/hide
Query:  MVKTAMVTDVRKDE---------------------------------------------------IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAM+TDVRKDE                                                   +IRG SRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMVTDVRKDE---------------------------------------------------IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYST KDLWNALTMAFSDFDNS QLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDT TKPLSLSLES AL ARGPPP                                
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                 +  P+   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------------------------HDLITGTTIGNADGFEGLYY
        MTAFHDMFT+YSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL                                 DLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------------------------HDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQGETEPITSSL
        FRGPSLRNKQVLQG T   +S +
Subjt:  FRGPSLRNKQVLQGETEPITSSL

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]8.6e-8139.64Show/hide
Query:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL
        IRG  ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I ++++F  T KDLW+A+T  +SD  NS Q+++L+ + R  +QG   VT+YY+ L
Subjt:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL

Query:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP
        + LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR E SRK VMMG +  +  +L   +      G 
Subjt:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP

Query:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQ-VGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVES
             +S  ++ +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +    T   LF+K QLEQLYR L      
Subjt:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQ-VGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVES

Query:  TPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH-------------
           SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N+ L SVLH             
Subjt:  TPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH-------------

Query:  -------------------DLITGTTIGNADGFEGLYYF
                           DL +G  IG+A   +GLYYF
Subjt:  -------------------DLITGTTIGNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A2N9GQ49 Uncharacterized protein4.8e-8534.65Show/hide
Query:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL
        IRG  ++GY+ G    P EADP+++ WDA+NSMVM WL+NSMEEDI  +++ Y T ++LW  +   +SD  N +Q+FEL  K   +RQGE  VT+Y++SL
Subjt:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL

Query:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP
        +R+W +LDL    EW++ +D+   +K VE  RI+ FLAGL  E D+VRGR++  +P+P I ++F+EVR E SR+ VM+G    K   +++ES AL A   
Subjt:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP

Query:  PPPSS-----RSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTP
            +     R+  +  +WCD+C +  HT++ CW++HG+              P N   S+      +  P+ + ++     +  F+K Q+E L  LL  
Subjt:  PPPSS-----RSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTP

Query:  PVESTPSSSFVAQRGIFSAALTSQQHSD-QWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH----------D
           S   S  VAQ G    AL+   +S   WI+DSGA+DHMT+ H+ F  YSP      V++ADGS + I G G + +S  I L SVLH          D
Subjt:  PVESTPSSSFVAQRGIFSAALTSQQHSD-QWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH----------D

Query:  LITGTTIGNADGFEGLYYFRGPSLRNKQVLQGET--------EPITSSLDG---NFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKILIPIIPITQIEE
          +G TIG+A    GLYYF   +L + +  QG +        E I   + G   NFWE      +     + I +    +TES  P+I   I    Q   
Subjt:  LITGTTIGNADGFEGLYYFRGPSLRNKQVLQGET--------EPITSSLDG---NFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKILIPIIPITQIEE

Query:  SVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQRGVE-PPQPQQQSHESIS--SLGIEQST------------------------LVPQDNTNDLDLP
        S  I     E + +  +  +   E LVY+R++  +R  + P  P Q   ES++  SL I  ++                        + P++ T+DLD+P
Subjt:  SVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQRGVE-PPQPQQQSHESIS--SLGIEQST------------------------LVPQDNTNDLDLP

Query:  IALMKGVRLCTQHPIARCIGYSHLSSAVQTLALNL
        IA+ KG+R CT++PIA+ I Y  LS+  +    N+
Subjt:  IALMKGVRLCTQHPIARCIGYSHLSSAVQTLALNL

A0A6A2ZFN0 Gag-Pol-p1995.8e-8336.27Show/hide
Query:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL
        IRG  + GY++GT  +  E D     W+A+NSM+M+WLINSM+  +  +++F  T  D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L
Subjt:  IRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSL

Query:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP
        + LW E+D+  + EW  +KD   F+K VEKE +++FL GL  ELD+VRGR+L  +P+P+  E+F+EVR E SR+ +M+G     P     ES AL +  P
Subjt:  RRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGP

Query:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLL-------
           + R   R+   C+HC +  HTK +CW+LHG+P  +N S +           SR +   Y      S     AT L  FSK QLEQLY+L+       
Subjt:  PPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLL-------

Query:  TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH------DLIT
        TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADGS   I G GS+I+SP++TL +VLH      +LI+
Subjt:  TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLH------DLIT

Query:  GTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRI-ESPQSK-IPEIDGLNTESPQPKILIPIIPITQIEESVPIISCNNEDD
           I +         F G   +         +P +  + GN  E+D L   + ++P  K + +     T   + +I++    I+Q   S P       + 
Subjt:  GTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRI-ESPQSK-IPEIDGLNTESPQPKILIPIIPITQIEESVPIISCNNEDD

Query:  QVNPNRSDKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDN---TNDLDLPIALMKGVRLC-TQHPIARCIGYSHLS
        QV  N  +   E  VYSRR T    V P     Q  +S    G    + +   N   + +   PIAL KGVR C T+HPI+  + Y+ LS
Subjt:  QVNPNRSDKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDN---TNDLDLPIALMKGVRLC-TQHPIARCIGYSHLS

A0A6J1DPT5 uncharacterized protein LOC1110219671.3e-8794.86Show/hide
Query:  MEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR
        MEEDIKESFIFYST KDLWNALTM FSDFDNS QLFELRNKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+R
Subjt:  MEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR

Query:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCD
        PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDT TKPLSLSLES AL ARGPPPPSSRSTRRNNLW D
Subjt:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCD

A0A6J1DY12 uncharacterized protein LOC1110255772.7e-16564.44Show/hide
Query:  MVKTAMVTDVRKDE---------------------------------------------------IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAM+TDVRKDE                                                   +IRG SRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMVTDVRKDE---------------------------------------------------IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYST KDLWNALTMAFSDFDNS QLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDT TKPLSLSLES AL ARGPPP                                
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                 +  P+   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------------------------HDLITGTTIGNADGFEGLYY
        MTAFHDMFT+YSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL                                 DLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------------------------HDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQGETEPITSSL
        FRGPSLRNKQVLQG T   +S +
Subjt:  FRGPSLRNKQVLQGETEPITSSL

A0A7J0DNJ6 Uncharacterized protein2.4e-8437.97Show/hide
Query:  IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSS
        IIRG  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ Y T K +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SS
Subjt:  IIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSS

Query:  LRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTP-TKPLSLSLESLALEAR
        L +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L+ KP+P++D IF+EVR E  R+R+M+G  P     S++ ++ A+ AR
Subjt:  LRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTP-TKPLSLSLESLALEAR

Query:  GPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPL-FSKAQLEQLYRLLTPPV
            P SR  R+  LWCDHC R +HTK+ CW+LHG+P       D+        P S+  S G +  P V+   D A+S  L F++AQL+Q+ +L +   
Subjt:  GPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPL-FSKAQLEQLYRLLTPPV

Query:  ESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------
            SS+ +   GI S+ +     S + WI+DSGA++HM++   +F+ YS       V LADGS + + G G+V LSPN+ LH+VL              
Subjt:  ESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL--------------

Query:  ------------------HDLITGTTIGNADGFEGLYYF--RGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKIL
                           D  +G TIG A    GLYYF      +R  QV +G      S       +I  L+ R+  P+S        N +S +  I 
Subjt:  ------------------HDLITGTTIGNADGFEGLYYF--RGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKIL

Query:  IPIIPITQIEESVPIISCNNEDDQVNPNRSDK-QPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDNTNDLDLPIALMK
                 +++   ++ N       PN S   +P  L Y RR+ V    +P  P    H S SS G + + L PQ     LD+PIAL K
Subjt:  IPIIPITQIEESVPIISCNNEDDQVNPNRSDK-QPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLVPQDNTNDLDLPIALMK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-0721.65Show/hide
Query:  GTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCL
        GT A P   +P ++ W  Q+ ++ + ++ ++   ++ +    +T   +W  L   +++  +   + +LR + +   +G   +  Y   L   + +L L L
Subjt:  GTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCL

Query:  NLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRN
            ++ +  ER  +++ +E         +P +D    ++ A    P + EI          +R++  ++    +S S   + + A      ++ +T  N
Subjt:  NLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSSRSTRRN

Query:  NLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGI
        N    +  R N   +R    + +P +++          TN  P+   S  Y     +   Q         S  +  QL   L+      P S F   +  
Subjt:  NLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGI

Query:  FSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLH------DLITGTTIGNADGFEGLYY
         + AL S   S+ W+LDSGAT H+T+  +  +L+ P      V +ADGS+  I   GS  LS     + LH++L+      +LI+   + NA+G    ++
Subjt:  FSAALTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLH------DLITGTTIGNADGFEGLYY

Query:  ---FRGPSLR-NKQVLQGETE------PITSSLDGNFWEIDDLNTRIESPQSKIPEID-GLNTESPQPKILIPII---PITQIEESVPIISCNNEDDQVN
           F+   L     +LQG+T+      PI SS   + +          SP SK            P P IL  +I    ++ +  S   +SC++      
Subjt:  ---FRGPSLR-NKQVLQGETE------PITSSLDGNFWEIDDLNTRIESPQSKIPEID-GLNTESPQPKILIPII---PITQIEESVPIISCNNEDDQVN

Query:  PNRSDKQP
         N+S+K P
Subjt:  PNRSDKQP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.8e-2132.07Show/hide
Query:  VRKDEIIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVT
        +R    +R   + G+I+GT+ +PD   P +  W+  N+MVM WL+NSM + + ES ++  T   +W  L   F    +  ++++LR +  +LRQG   V 
Subjt:  VRKDEIIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVT

Query:  QYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR
        +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL GL+     + V  +++  KP P++ E FA V+
Subjt:  QYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAGCCATGGTGACTGATGTGCGCAAGGATGAGATTATTCGTGGCTGCAGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCC
TTCTTTTTCTGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAGGACATTAAAGAATCCTTCATCTTCTACTCAACAACAAAGGATC
TTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGACTCAATTGTTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAA
TACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTTCGCAAACACGTCGAGAAGGAACG
AATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCT
GGGAGTCAAGCCGTAAACGTGTAATGATGGGTGATACACCCACAAAACCTCTGTCCCTCTCACTGGAATCATTAGCTCTGGAGGCACGAGGTCCACCACCACCTTCATCC
CGATCTACTCGTCGGAACAACCTATGGTGTGATCATTGTAAGCGCACCAACCATACAAAAGATCGGTGTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGG
GGATTATCGGCCACCTCCACCTACAAATACTCCACCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCACATCTCTCC
CTCTATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCT
TTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCCTGTACTCACCCAACCCGATTCAGAC
ACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGACTTGATAACGGGAA
CGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGGGAGACTGAGCCTATTACAAGTAGTCTT
GATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATACT
TATCCCGATCATCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTC
TTGTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTATTGAACAATCTACCCTTGTG
CCTCAAGACAATACTAATGATCTTGATCTTCCTATTGCACTTATGAAGGGTGTGAGATTGTGTACTCAACATCCAATTGCTCGATGTATTGGTTATAGTCATTTATCTTC
AGCAGTTCAGACCTTGGCTCTCAATCTGGAAGCCAATGAGGGAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAGCCATGGTGACTGATGTGCGCAAGGATGAGATTATTCGTGGCTGCAGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCC
TTCTTTTTCTGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAGGACATTAAAGAATCCTTCATCTTCTACTCAACAACAAAGGATC
TTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGACTCAATTGTTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAA
TACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTTCGCAAACACGTCGAGAAGGAACG
AATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCT
GGGAGTCAAGCCGTAAACGTGTAATGATGGGTGATACACCCACAAAACCTCTGTCCCTCTCACTGGAATCATTAGCTCTGGAGGCACGAGGTCCACCACCACCTTCATCC
CGATCTACTCGTCGGAACAACCTATGGTGTGATCATTGTAAGCGCACCAACCATACAAAAGATCGGTGTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGG
GGATTATCGGCCACCTCCACCTACAAATACTCCACCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCACATCTCTCC
CTCTATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCT
TTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCCTGTACTCACCCAACCCGATTCAGAC
ACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGACTTGATAACGGGAA
CGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGGGAGACTGAGCCTATTACAAGTAGTCTT
GATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATACT
TATCCCGATCATCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTC
TTGTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTATTGAACAATCTACCCTTGTG
CCTCAAGACAATACTAATGATCTTGATCTTCCTATTGCACTTATGAAGGGTGTGAGATTGTGTACTCAACATCCAATTGCTCGATGTATTGGTTATAGTCATTTATCTTC
AGCAGTTCAGACCTTGGCTCTCAATCTGGAAGCCAATGAGGGAAGTTAG
Protein sequenceShow/hide protein sequence
MVKTAMVTDVRKDEIIRGCSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTTKDLWNALTMAFSDFDNSTQLFELRNKARSLRQGESDVTQ
YYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESLALEARGPPPPSS
RSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPLFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAA
LTSQQHSDQWILDSGATDHMTAFHDMFTLYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSL
DGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKILIPIIPITQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGIEQSTLV
PQDNTNDLDLPIALMKGVRLCTQHPIARCIGYSHLSSAVQTLALNLEANEGS