; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGypsy retrotransposon integrase-like protein 1
Genome locationchr1:2653916..2658466
RNA-Seq ExpressionMoc01g04060
SyntenyMoc01g04060
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144033.1 uncharacterized protein LOC111013825 [Momordica charantia]3.5e-11660.24Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----
        PYDGSKDPKDYVEV E LM+FQAA D IKC AFQIALTGS                              HY+RKTATHLA I+QK+GGTLREYVT    
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----

Query:  --------SHNTSSCWEL--------------------------------------------------KRQIEDLIQDGYFKKFVGKP---------RSN
                S +++ C+ L                                                  K   ++  + G   K  G           RSN
Subjt:  --------SHNTSSCWEL--------------------------------------------------KRQIEDLIQDGYFKKFVGKP---------RSN

Query:  SVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGG
        SVEKKEERKRSR PPR DDRPAVINTI  GPSGGQ GNKRKEL REARREVCIIREQR TS ITF+D DLEGVHLPHNDA VIAPLIDHVLVRRVLVDGG
Subjt:  SVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGG

Query:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNG
        ASANILSLPTYL LGWTRSQLKKSPTPLVGFSGES    GCI+L V IGQDDTQ +QM EFVVI GRSAY  IF RPIIHSFR VPSTL+QVLKYSTPNG
Subjt:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNG

Query:  VGMVRREQKTSRECYASPLK
        VG V+ EQKTSREC AS LK
Subjt:  VGMVRREQKTSRECYASPLK

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.5e-12455.69Show/hide
Query:  YDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRR------------------------------------------------------
        YDGSKDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSARLW++                                                       
Subjt:  YDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRR------------------------------------------------------

Query:  ------------------------LPARSISTYSQLRKEFISQFS----SRHYNRKTATHLATI---------------------------RQKEGGTLR
                                L ++   ++S  R EF    +    SR Y R T T +                              R K+     
Subjt:  ------------------------LPARSISTYSQLRKEFISQFS----SRHYNRKTATHLATI---------------------------RQKEGGTLR

Query:  EYVTSHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFI
             HNTS  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SR P R  DRPAVINTI  GPSGGQSG+KRKEL R ARREVCIIREQRPT  I
Subjt:  EYVTSHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFI

Query:  TFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVV
        TF+ ADLE VHLPHNDA VIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCIDLPVT+G D TQV+QM EFVV
Subjt:  TFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVV

Query:  IDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQKTSRECYASPLKRSILVEILDNLSILEPDLMKVNTPALSWMDPTVEL
        IDGRSAYN IF RPIIHSFRA+PSTLHQVLKYSTPNGVGMVR EQ  SRECYAS LK S +  +   +S       K N P   +  PT EL
Subjt:  IDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQKTSRECYASPLKRSILVEILDNLSILEPDLMKVNTPALSWMDPTVEL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.9e-14158.83Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----
        PYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHY+RKT THLATIRQKEG TLREYVT    
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------SHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE
                                                                     HNTS+ WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE
Subjt:  ------------------------------------------------------------SHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE

Query:  RKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILS
        RKR R PPR DDRPAVI             NK+KEL REARREVCIIREQRPTS I FN ADLEGVHLPHNDA VIAPLID VLVRR+LVDGGASANILS
Subjt:  RKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILS

Query:  LPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRRE
        L TYLALGWTRSQLKKSPTPLVGFSGES+S EGCIDLPV+I QDDTQV+QM EFVVIDGRSAYN IF RPIIHSFRAVPSTLHQVLKYST NGVG VR E
Subjt:  LPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRRE

Query:  QKTSRECYASPLKRS
         KTSRECYAS  KRS
Subjt:  QKTSRECYASPLKRS

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.8e-15370.34Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTL---------
        PYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI QFS RHY+RKTATHLATIRQKE  TL         
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTL---------

Query:  ----------------------------------------------------------REYVTSHNTSS-------CWELKRQIEDLIQDGYFKKFVGKP
                                                                   EY  S +  S       CWELKRQIEDLIQD YFKKFVGKP
Subjt:  ----------------------------------------------------------REYVTSHNTSS-------CWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLV
        RSNSVEKKEERKRSR PPR +DRPAVINTI  GPSGGQ  NKRKEL  EARR+V IIREQ+PT  ITF D DLEGVHLPHNDA VIAPLIDHVLVRRVLV
Subjt:  RSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLV

Query:  DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYST
        DGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD TQV+QM EFVVIDGR AYN IF RPIIHSF+AVPS LHQVLKYST
Subjt:  DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGMVRREQKTSRECYASPLKRSILVEILDNLS
        PNGVG VR EQKTSRECYAS LKRS +  + +  S
Subjt:  PNGVGMVRREQKTSRECYASPLKRSILVEILDNLS

XP_022157474.1 uncharacterized protein LOC111024166 [Momordica charantia]8.4e-11890.04Show/hide
Query:  HNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDA
        HNTSSCWELKRQIEDLIQD YFKKFVGKPRSN  EKKEERKRSR PPR DDRP VINTI  GPSGGQSGNKRKEL REARREVCIIREQ+PT  ITF DA
Subjt:  HNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDA

Query:  DLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRS
        DLEGVHLPHNDA VIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS ESVSPEGCIDLP+TIGQD TQV+QM EFVVIDGRS
Subjt:  DLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRS

Query:  AYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQK
        AYN IF RPIIHSFRAVPSTLHQVLKYSTPNGVGMVR  +K
Subjt:  AYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQK

TrEMBL top hitse value%identityAlignment
A0A6J1CS66 uncharacterized protein LOC1110138251.7e-11660.24Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----
        PYDGSKDPKDYVEV E LM+FQAA D IKC AFQIALTGS                              HY+RKTATHLA I+QK+GGTLREYVT    
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----

Query:  --------SHNTSSCWEL--------------------------------------------------KRQIEDLIQDGYFKKFVGKP---------RSN
                S +++ C+ L                                                  K   ++  + G   K  G           RSN
Subjt:  --------SHNTSSCWEL--------------------------------------------------KRQIEDLIQDGYFKKFVGKP---------RSN

Query:  SVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGG
        SVEKKEERKRSR PPR DDRPAVINTI  GPSGGQ GNKRKEL REARREVCIIREQR TS ITF+D DLEGVHLPHNDA VIAPLIDHVLVRRVLVDGG
Subjt:  SVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGG

Query:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNG
        ASANILSLPTYL LGWTRSQLKKSPTPLVGFSGES    GCI+L V IGQDDTQ +QM EFVVI GRSAY  IF RPIIHSFR VPSTL+QVLKYSTPNG
Subjt:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNG

Query:  VGMVRREQKTSRECYASPLK
        VG V+ EQKTSREC AS LK
Subjt:  VGMVRREQKTSRECYASPLK

A0A6J1D9E1 uncharacterized protein LOC1110188231.7e-12455.69Show/hide
Query:  YDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRR------------------------------------------------------
        YDGSKDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSARLW++                                                       
Subjt:  YDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRR------------------------------------------------------

Query:  ------------------------LPARSISTYSQLRKEFISQFS----SRHYNRKTATHLATI---------------------------RQKEGGTLR
                                L ++   ++S  R EF    +    SR Y R T T +                              R K+     
Subjt:  ------------------------LPARSISTYSQLRKEFISQFS----SRHYNRKTATHLATI---------------------------RQKEGGTLR

Query:  EYVTSHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFI
             HNTS  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SR P R  DRPAVINTI  GPSGGQSG+KRKEL R ARREVCIIREQRPT  I
Subjt:  EYVTSHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFI

Query:  TFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVV
        TF+ ADLE VHLPHNDA VIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCIDLPVT+G D TQV+QM EFVV
Subjt:  TFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVV

Query:  IDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQKTSRECYASPLKRSILVEILDNLSILEPDLMKVNTPALSWMDPTVEL
        IDGRSAYN IF RPIIHSFRA+PSTLHQVLKYSTPNGVGMVR EQ  SRECYAS LK S +  +   +S       K N P   +  PT EL
Subjt:  IDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQKTSRECYASPLKRSILVEILDNLSILEPDLMKVNTPALSWMDPTVEL

A0A6J1DHB3 uncharacterized protein LOC1110204799.0e-14258.83Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----
        PYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHY+RKT THLATIRQKEG TLREYVT    
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVT----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------SHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE
                                                                     HNTS+ WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE
Subjt:  ------------------------------------------------------------SHNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEE

Query:  RKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILS
        RKR R PPR DDRPAVI             NK+KEL REARREVCIIREQRPTS I FN ADLEGVHLPHNDA VIAPLID VLVRR+LVDGGASANILS
Subjt:  RKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILS

Query:  LPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRRE
        L TYLALGWTRSQLKKSPTPLVGFSGES+S EGCIDLPV+I QDDTQV+QM EFVVIDGRSAYN IF RPIIHSFRAVPSTLHQVLKYST NGVG VR E
Subjt:  LPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRRE

Query:  QKTSRECYASPLKRS
         KTSRECYAS  KRS
Subjt:  QKTSRECYASPLKRS

A0A6J1DPC9 uncharacterized protein LOC1110222801.3e-15370.34Show/hide
Query:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTL---------
        PYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI QFS RHY+RKTATHLATIRQKE  TL         
Subjt:  PYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTL---------

Query:  ----------------------------------------------------------REYVTSHNTSS-------CWELKRQIEDLIQDGYFKKFVGKP
                                                                   EY  S +  S       CWELKRQIEDLIQD YFKKFVGKP
Subjt:  ----------------------------------------------------------REYVTSHNTSS-------CWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLV
        RSNSVEKKEERKRSR PPR +DRPAVINTI  GPSGGQ  NKRKEL  EARR+V IIREQ+PT  ITF D DLEGVHLPHNDA VIAPLIDHVLVRRVLV
Subjt:  RSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLV

Query:  DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYST
        DGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD TQV+QM EFVVIDGR AYN IF RPIIHSF+AVPS LHQVLKYST
Subjt:  DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGMVRREQKTSRECYASPLKRSILVEILDNLS
        PNGVG VR EQKTSRECYAS LKRS +  + +  S
Subjt:  PNGVGMVRREQKTSRECYASPLKRSILVEILDNLS

A0A6J1DWK7 uncharacterized protein LOC1110241664.1e-11890.04Show/hide
Query:  HNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDA
        HNTSSCWELKRQIEDLIQD YFKKFVGKPRSN  EKKEERKRSR PPR DDRP VINTI  GPSGGQSGNKRKEL REARREVCIIREQ+PT  ITF DA
Subjt:  HNTSSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDA

Query:  DLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRS
        DLEGVHLPHNDA VIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS ESVSPEGCIDLP+TIGQD TQV+QM EFVVIDGRS
Subjt:  DLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRS

Query:  AYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQK
        AYN IF RPIIHSFRAVPSTLHQVLKYSTPNGVGMVR  +K
Subjt:  AYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQK

SwissProt top hitse value%identityAlignment
Q8K259 Gypsy retrotransposon integrase-like protein 13.8e-0434.74Show/hide
Query:  SKEHKKMARKATRFILRDGPLYRRGFSLPLLKSLSGKVVRQGYYWPTVEQDAKQFVKTCDNCQHFAN-IIHQPPELLTPISAPWPLA---YHGPF
        S+E KK   K  R    +GP    G S  L       +V  GYYW +V  D KQ+V  C +CQ   N +I  P + L  +  PW +      GPF
Subjt:  SKEHKKMARKATRFILRDGPLYRRGFSLPLLKSLSGKVVRQGYYWPTVEQDAKQFVKTCDNCQHFAN-IIHQPPELLTPISAPWPLA---YHGPF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCGTCGTATGCCGCCGTGAGGACGCCGTTCGTCGTGCGCCGCCGCGAGAAGTGCTGCAGGACGTTGCAGAAACGGTTGCGTCGCCGTGGGTCACCATCGGGTCCGT
GAGCCGCGCGCTGCAGGCGTCGTTGGAGGGTGCCGCTGCTGCTGTCGAAGACCGCCGCAACTGCTTGGTTGGATTCCGCCGTCGTGCTGTCATAGACCGCTGCTACTGCC
GCCCTGTGGCTTCGTCGGAGTGTGCCGCCGCCGGAGAAGAAGAAGGTGCCAGCTGGCCCTATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAGGCCTC
ATGGATTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCTTTTCAGATCGCACTCACTGGCAGCGCGCGCCTGTGGTATCGAAGACTGCCGGCCAGGTCAATCTCGAC
CTACTCCCAGCTGAGGAAAGAATTTATTAGTCAATTCTCTTCTCGGCACTATAATAGAAAAACAGCGACTCACCTCGCCACCATCAGGCAGAAAGAAGGTGGAACGCTGA
GGGAATATGTCACAAGCCATAACACATCAAGTTGTTGGGAACTTAAGCGCCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTGGGCAAACCGAGGTCC
AACTCGGTAGAAAAAAAGGAAGAGAGGAAGCGTTCGAGGATGCCACCTCGTCCAGATGACCGACCTGCGGTCATCAACACTATTTTAAGAGGCCCAAGTGGGGGCCAATC
TGGGAATAAAAGAAAAGAGCTAGTTCGCGAAGCCAGGCGCGAGGTGTGCATTATTAGGGAGCAGAGACCGACCTCCTTCATTACCTTCAACGACGCCGACTTGGAAGGGG
TCCATTTGCCCCATAATGACGCGTTCGTGATCGCTCCTCTTATCGACCACGTCCTGGTCCGAAGAGTACTGGTAGATGGAGGCGCATCTGCCAACATCCTGTCCCTCCCA
ACATATCTCGCCTTGGGATGGACCAGGTCACAGCTGAAGAAGAGTCCGACGCCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTGTATTGACTTGCCGGT
CACGATTGGGCAAGATGATACACAAGTAAGTCAGATGGTCGAGTTCGTCGTGATTGACGGCAGGTCGGCCTACAACACCATCTTTTGGAGACCCATCATCCATTCGTTCC
GGGCTGTTCCCTCAACACTTCATCAAGTCCTGAAGTACTCAACTCCTAATGGAGTGGGCATGGTCCGAAGAGAGCAGAAAACTTCAAGGGAGTGCTACGCCTCCCCGCTC
AAAAGGTCGATCCTAGTCGAGATTCTGGACAATCTGTCAATCTTGGAGCCAGATTTGATGAAGGTTAACACTCCAGCACTCTCGTGGATGGATCCAACCGTGGAGCTTAT
CAAAGGGAGTCCACCACAAGATTCAAAGGAGCACAAGAAAATGGCGCGAAAAGCAACTCGATTCATACTCCGAGATGGACCATTGTACCGACGTGGCTTCTCCCTACCTC
TGCTTAAGTCGTTATCGGGCAAGGTGGTTCGACAAGGGTACTATTGGCCCACTGTGGAGCAGGATGCAAAGCAGTTTGTGAAAACCTGCGATAACTGCCAGCACTTTGCA
AATATCATCCATCAGCCTCCCGAACTACTCACCCCCATCTCGGCCCCATGGCCATTGGCATATCACGGTCCTTTTCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCGTCGTATGCCGCCGTGAGGACGCCGTTCGTCGTGCGCCGCCGCGAGAAGTGCTGCAGGACGTTGCAGAAACGGTTGCGTCGCCGTGGGTCACCATCGGGTCCGT
GAGCCGCGCGCTGCAGGCGTCGTTGGAGGGTGCCGCTGCTGCTGTCGAAGACCGCCGCAACTGCTTGGTTGGATTCCGCCGTCGTGCTGTCATAGACCGCTGCTACTGCC
GCCCTGTGGCTTCGTCGGAGTGTGCCGCCGCCGGAGAAGAAGAAGGTGCCAGCTGGCCCTATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAGGCCTC
ATGGATTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCTTTTCAGATCGCACTCACTGGCAGCGCGCGCCTGTGGTATCGAAGACTGCCGGCCAGGTCAATCTCGAC
CTACTCCCAGCTGAGGAAAGAATTTATTAGTCAATTCTCTTCTCGGCACTATAATAGAAAAACAGCGACTCACCTCGCCACCATCAGGCAGAAAGAAGGTGGAACGCTGA
GGGAATATGTCACAAGCCATAACACATCAAGTTGTTGGGAACTTAAGCGCCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTGGGCAAACCGAGGTCC
AACTCGGTAGAAAAAAAGGAAGAGAGGAAGCGTTCGAGGATGCCACCTCGTCCAGATGACCGACCTGCGGTCATCAACACTATTTTAAGAGGCCCAAGTGGGGGCCAATC
TGGGAATAAAAGAAAAGAGCTAGTTCGCGAAGCCAGGCGCGAGGTGTGCATTATTAGGGAGCAGAGACCGACCTCCTTCATTACCTTCAACGACGCCGACTTGGAAGGGG
TCCATTTGCCCCATAATGACGCGTTCGTGATCGCTCCTCTTATCGACCACGTCCTGGTCCGAAGAGTACTGGTAGATGGAGGCGCATCTGCCAACATCCTGTCCCTCCCA
ACATATCTCGCCTTGGGATGGACCAGGTCACAGCTGAAGAAGAGTCCGACGCCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTGTATTGACTTGCCGGT
CACGATTGGGCAAGATGATACACAAGTAAGTCAGATGGTCGAGTTCGTCGTGATTGACGGCAGGTCGGCCTACAACACCATCTTTTGGAGACCCATCATCCATTCGTTCC
GGGCTGTTCCCTCAACACTTCATCAAGTCCTGAAGTACTCAACTCCTAATGGAGTGGGCATGGTCCGAAGAGAGCAGAAAACTTCAAGGGAGTGCTACGCCTCCCCGCTC
AAAAGGTCGATCCTAGTCGAGATTCTGGACAATCTGTCAATCTTGGAGCCAGATTTGATGAAGGTTAACACTCCAGCACTCTCGTGGATGGATCCAACCGTGGAGCTTAT
CAAAGGGAGTCCACCACAAGATTCAAAGGAGCACAAGAAAATGGCGCGAAAAGCAACTCGATTCATACTCCGAGATGGACCATTGTACCGACGTGGCTTCTCCCTACCTC
TGCTTAAGTCGTTATCGGGCAAGGTGGTTCGACAAGGGTACTATTGGCCCACTGTGGAGCAGGATGCAAAGCAGTTTGTGAAAACCTGCGATAACTGCCAGCACTTTGCA
AATATCATCCATCAGCCTCCCGAACTACTCACCCCCATCTCGGCCCCATGGCCATTGGCATATCACGGTCCTTTTCCCTAA
Protein sequenceShow/hide protein sequence
MRVVCRREDAVRRAPPREVLQDVAETVASPWVTIGSVSRALQASLEGAAAAVEDRRNCLVGFRRRAVIDRCYCRPVASSECAAAGEEEGASWPYDGSKDPKDYVEVFEGL
MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYNRKTATHLATIRQKEGGTLREYVTSHNTSSCWELKRQIEDLIQDGYFKKFVGKPRS
NSVEKKEERKRSRMPPRPDDRPAVINTILRGPSGGQSGNKRKELVREARREVCIIREQRPTSFITFNDADLEGVHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLP
TYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDDTQVSQMVEFVVIDGRSAYNTIFWRPIIHSFRAVPSTLHQVLKYSTPNGVGMVRREQKTSRECYASPL
KRSILVEILDNLSILEPDLMKVNTPALSWMDPTVELIKGSPPQDSKEHKKMARKATRFILRDGPLYRRGFSLPLLKSLSGKVVRQGYYWPTVEQDAKQFVKTCDNCQHFA
NIIHQPPELLTPISAPWPLAYHGPFP