; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0013178 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0013178
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:24308616..24310056
RNA-Seq ExpressionIVF0013178
SyntenyIVF0013178
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035612.1 No apical meristem (NAM) protein [Cucumis melo var. makuwa]2.73e-11158.22Show/hide
Query:  LNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR
        +NPYFIHHSLGPTAAIV QPLTGAINY SWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIY GS         
Subjt:  LNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR

Query:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS
             +N P                     Y T           + F   C                                                S
Subjt:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS

Query:  LLIQEEQQRSAGILTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAAN
        LLIQEEQQRSAGILTPPIDPVAL    NIA +MAISTDRNRKKERPTC YCGIKGHIADKCYKKHGYPPGYKPRNSN IT   +TSKTN VANTNS AAN
Subjt:  LLIQEEQQRSAGILTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAAN

Query:  -----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFP-------TNDDW
                   +QYSQLMTLLNNHLQAA TAPIT  T ITHT   F        ++D+W
Subjt:  -----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFP-------TNDDW

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.69e-24693.68Show/hide
Query:  MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL
        MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL
Subjt:  MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL

Query:  NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL
        NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL
Subjt:  NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL

Query:  NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI
        NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI
Subjt:  NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI

Query:  TTAPDTSKTNNVANTNSAAAN-----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFP-------TNDDW
        TTAPDTSKTNNVANTNSAAAN           +QYSQLMTLLNNHLQAATTAPITTATAITHTSG F        ++D+W
Subjt:  TTAPDTSKTNNVANTNSAAAN-----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFP-------TNDDW

KAF5444131.1 hypothetical protein F2P56_036633 [Juglans regia]1.60e-9344.44Show/hide
Query:  NPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSD-GVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR
        +PY++H+   P + +VTQ L G  NY +W R+M+MA++ +NK GF+ G I KPSD   L   W   N+++ SWILNS+SKEIAAS+IY+ S  ++W +L+
Subjt:  NPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSD-GVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR

Query:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS
        +RF QSNGP IYQL+K   +L Q +L++  Y+T++K +W  L  YR    C+CGGL+  +D  + +Y+M FLMGLNDS++ +R QILL+ PLP IN VFS
Subjt:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS

Query:  LLIQEEQQRSAG-ILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSA--AANQ
         ++QEE+QR  G IL P I   AL+ A+     T  + +K+RP CS+C + GH  +KC+K H YPPGY+ +   S    P +S    VA+++S      Q
Subjt:  LLIQEEQQRSAG-ILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSA--AANQ

Query:  QYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG
        QY QL+ LLN+      TA +   + I   SG  P  +DW+G
Subjt:  QYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG

XP_020221438.1 uncharacterized protein LOC109804079 [Cajanus cajan]6.39e-9346.57Show/hide
Query:  DAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP-SDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIW
        D   NPYF+H S  P A IV+QPL G  NY SWSRA+LMA+  +NK GF+ G I KP        +W  NN+I+ASW+LN +SK++ AS+IY  S   IW
Subjt:  DAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP-SDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIW

Query:  DELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSIN
        ++LR RF+Q NGP ++QLR++ VTL+QG+L I  Y+TK+K +W+ L EY+ ++ CTCGG+KP+IDH +SEY M FLMGLN+ Y+ +R QILLM P+P I 
Subjt:  DELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSIN

Query:  TVFSLLIQEEQQRSAGILTPPID-PVALNIASSM-AISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAA
          FSL++QEE+Q+  GI T   D P A    S   A S   N  KERP C +CG  GHI DKC+K HGYP   K  NSN+          N V++ ++ A
Subjt:  TVFSLLIQEEQQRSAGILTPPID-PVALNIASSM-AISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAA

Query:  ---ANQQYSQLMTLLNNHLQA--ATTAPITTATAITHTSGSFPTNDDWQG
              QY Q+++LL N   +    + PI         +G  P  DDW G
Subjt:  ---ANQQYSQLMTLLNNHLQA--ATTAPITTATAITHTSGSFPTNDDWQG

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]3.04e-12551.83Show/hide
Query:  TAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIG
        TA  +  ++QLNPY IHHS  PT  +VTQ L GA NY SW R+ML+A+SG+NK GFI G I+KP+ G LL AW CNNDI+ SWI+NSVSKEIAASIIY G
Subjt:  TAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIG

Query:  SIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQ
        S K+IWDEL++RF+QS+ P I+QLRKE VT  QG L+IE YYTKLKT+WQ L +YR T DCTC GLK   +  +SEY+M FLMGLN+SYA +RAQILLM 
Subjt:  SIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQ

Query:  PLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIA--SSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVA
        P+P +N VFSLLIQEE+QR+ G + PP+  +A+ +A  S    +T   RK  R  C++CG++GH+ DKCYK HGYPPGY  R +N           N  +
Subjt:  PLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIA--SSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVA

Query:  NTNSAAANQ-------------------------------QYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG
        ++N   ANQ                               QYSQLM +L +HLQAA    IT    + H +G     +DWQG
Subjt:  NTNSAAANQ-------------------------------QYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 88.1e-19793.68Show/hide
Query:  MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL
        MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL
Subjt:  MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWIL

Query:  NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL
        NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL
Subjt:  NSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGL

Query:  NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI
        NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI
Subjt:  NDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSI

Query:  TTAPDTSKTNNVANTNSAAAN-----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSF-------PTNDDW
        TTAPDTSKTNNVANTNSAAAN           +QYSQLMTLLNNHLQAATTAPITTATAITHTSG F        ++D+W
Subjt:  TTAPDTSKTNNVANTNSAAAN-----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSF-------PTNDDW

A0A5D3E5P0 No apical meristem (NAM) protein1.0e-9058.22Show/hide
Query:  LNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR
        +NPYFIHHSLGPTAAIV QPLTGAINY SWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIY GS         
Subjt:  LNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIWDELR

Query:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS
             +N P                     Y T           + F   C                                                S
Subjt:  QRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFS

Query:  LLIQEEQQRSAGILTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAAN
        LLIQEEQQRSAGILTPPIDPVAL    NIA +MAISTDRNRKKERPTC YCGIKGHIADKCYKKHGYPPGYKPRNSN IT   +TSKTN VANTNS AAN
Subjt:  LLIQEEQQRSAGILTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAAN

Query:  -----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSF-------PTNDDW
                   +QYSQLMTLLNNHLQAA TAPIT  T ITHT   F        ++D+W
Subjt:  -----------QQYSQLMTLLNNHLQAATTAPITTATAITHTSGSF-------PTNDDW

A0A5J5B2C5 Uncharacterized protein7.8e-7544.16Show/hide
Query:  MENQSSTNGSRFDPLTAAN-SQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP--SDGVLLDAWICNNDILAS
        M   SS + S     T +N S  +   NPY++HHS  P   +V+Q LTG  NYT+WSRAML+A+S +NK GF+ G I +P  +D  LLD+WI NN+I+ S
Subjt:  MENQSSTNGSRFDPLTAAN-SQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP--SDGVLLDAWICNNDILAS

Query:  WILNSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYR---FTNDCTCGGLKPFIDHLESEYIM
        WILNS+SKEI+ASII+    +EIW +LR RF+Q NGP I+QL++E + LRQ   ++  Y+TK+KTIW+ L+ YR       C CGG+K   D+ ++EYIM
Subjt:  WILNSVSKEIAASIIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYR---FTNDCTCGGLKPFIDHLESEYIM

Query:  AFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDP-------VALNIASS----------MAISTDRNRKKERPTCSYCGIKGH
        +FLMGL+DS++ V  Q+LLM  +P IN VFSL++QEEQQR   + +   +        V  ++A S             S  +N+K++RP C++C I GH
Subjt:  AFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDP-------VALNIASS----------MAISTDRNRKKERPTCSYCGIKGH

Query:  IADKCYKKHGYPPGYKPRNSNS-------ITTAPDTSKTNN-----VANTNSAAANQQYSQLMTLLNNHLQAA---TTAPITTAT
          D+CYK HGYPPGYK R++N+       ++T+ D S  +N     V N NS     QY QLM++L+ HL ++   T AP  + T
Subjt:  IADKCYKKHGYPPGYKPRNSNS-------ITTAPDTSKTNN-----VANTNSAAANQQYSQLMTLLNNHLQAA---TTAPITTAT

A0A6J1CXR2 uncharacterized protein LOC1110152391.9e-10052.11Show/hide
Query:  TAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIG
        TA  +  ++QLNPY IHHS  PT  +VTQ L GA NY SW R+ML+A+SG+NK GFI G I+KP +G LL AW CNNDI+ SWI+NSVSKEIAASIIY G
Subjt:  TAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIG

Query:  SIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQ
        S K+IWDEL++RF+QS+ P I+QLRKE VT  QG L+IE YYTKLKT+WQ L +YR T DCTC GLK   +  +SEY+M FLMGLN+SYA +RAQILLM 
Subjt:  SIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQ

Query:  PLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIA--SSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRN----------------S
        P+P +N VFSLLIQEE+QR+ G + PP+  +A+ +A  S    +T   RK  R  C++CG++GH+ DKCYK HGYPPGY+  N                S
Subjt:  PLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIA--SSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRN----------------S

Query:  NSITTAPDTSKTNNVAN-------TNSAAA------NQQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG
        N +     + K  ++ +       +NS+ A      + QYSQLM +L +HLQAA      T T + H +G     +DWQG
Subjt:  NSITTAPDTSKTNNVAN-------TNSAAA------NQQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.1e-7642.67Show/hide
Query:  DAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP--SDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEI
        D   +PYF+HHS GP   +V+Q LTG  NY SW+RAM++A+S +NK GFI G I KP  +D  LL++WI NN+++ SWILNSVSKEI+ASII+  S  EI
Subjt:  DAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKP--SDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEI

Query:  WDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYR---FTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPL
        W +L+ RF+QSNGP I+QLR+E +   Q    +  Y+TKLKTIW+ LN YR      +CTCGG+K    H + EYIM+FLM L+ S+A +R Q+LLM PL
Subjt:  WDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYR---FTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPL

Query:  PSINTVFSLLIQEEQQRSAGI----LTPPIDPVALNIAS--------------------SMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYK
        P IN VFSL+ QEE QR  GI    ++   D +A  I +                    +   ++ + +KK+R  C++C   GH  +KCYK+HGYPPG+K
Subjt:  PSINTVFSLLIQEEQQRSAGI----LTPPIDPVALNIAS--------------------SMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYK

Query:  PRNSNSITTAPDTSKTNNVANTNSAAA------------------NQQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG
        PR+ ++ TT+   +  N V+N + + +                  + QY QLM +L+NH+ A++        + ++T+G     DDWQG
Subjt:  PRNSNSITTAPDTSKTNNVANTNSAAA------------------NQQYSQLMTLLNNHLQAATTAPITTATAITHTSGSFPTNDDWQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.7e-2430.88Show/hide
Query:  NPYF----IHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSD-GVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIW
        +PY+    IHH   P+   + +      NY +W       +    K GFI G + KP     L   W   N ++  W++NS++ ++  S++Y  +  ++W
Subjt:  NPYF----IHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSD-GVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIKEIW

Query:  DELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGG-----LKPFIDHLESEYIMAFLMG--LNDSYAAVRAQILLM
        ++LR+ F       IYQLR+   TLRQG  ++E Y+ KL  +W  L+EY    +C CGG      K   +  E E    FLMG  LN  + AV  +I+  
Subjt:  DELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGG-----LKPFIDHLESEYIMAFLMG--LNDSYAAVRAQILLM

Query:  QPLPSINTVFSLLIQEE
        +P PS++  F+++   E
Subjt:  QPLPSINTVFSLLIQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACCAATCCAGCACAAATGGAAGTCGATTCGATCCACTTACTGCTGCGAACTCCCAAAATGATGCCCAACTCAATCCCTACTTCATTCATCATTCGCTTGGTCC
AACGGCTGCAATAGTCACGCAACCTTTGACCGGAGCTATCAACTACACCTCATGGAGTCGTGCGATGTTAATGGCGATTTCTGGACGAAATAAGGCCGGGTTCATCACCG
GAAAAATCCAGAAACCTTCTGATGGCGTCTTACTCGATGCCTGGATCTGCAACAATGATATTCTAGCCTCATGGATTCTCAACTCTGTTTCAAAGGAAATTGCAGCAAGC
ATTATCTACATAGGATCAATTAAAGAAATATGGGATGAATTACGCCAAAGATTCAAACAATCAAATGGTCCCAGCATATACCAACTTCGAAAAGAATTTGTCACGTTGCG
GCAAGGAAACCTGACAATTGAAACATACTACACAAAACTCAAAACCATATGGCAGAATCTAAATGAATACCGATTTACAAATGACTGCACATGTGGAGGTTTGAAACCAT
TCATCGATCATCTTGAATCTGAATATATTATGGCCTTCCTGATGGGATTAAATGATTCCTATGCTGCTGTAAGAGCACAAATCCTCCTTATGCAACCTTTACCTTCAATC
AACACTGTATTTTCTTTGCTGATTCAAGAAGAACAACAAAGATCTGCTGGCATTTTAACCCCTCCCATTGATCCTGTGGCTTTAAATATTGCTTCATCCATGGCTATCTC
AACTGATCGAAATCGCAAAAAAGAGCGCCCTACTTGCTCCTACTGTGGAATTAAGGGACACATAGCAGACAAATGTTACAAGAAACATGGATATCCTCCAGGTTACAAGC
CAAGAAACTCAAACTCCATTACTACTGCACCAGATACCTCAAAAACCAACAATGTTGCCAATACCAATTCAGCTGCTGCCAATCAACAATACAGTCAATTGATGACTCTA
CTCAACAACCATCTCCAAGCAGCCACTACAGCACCTATCACTACGGCAACTGCCATAACCCACACTTCAGGATCTTTCCCGACCAATGATGATTGGCAAGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACCAATCCAGCACAAATGGAAGTCGATTCGATCCACTTACTGCTGCGAACTCCCAAAATGATGCCCAACTCAATCCCTACTTCATTCATCATTCGCTTGGTCC
AACGGCTGCAATAGTCACGCAACCTTTGACCGGAGCTATCAACTACACCTCATGGAGTCGTGCGATGTTAATGGCGATTTCTGGACGAAATAAGGCCGGGTTCATCACCG
GAAAAATCCAGAAACCTTCTGATGGCGTCTTACTCGATGCCTGGATCTGCAACAATGATATTCTAGCCTCATGGATTCTCAACTCTGTTTCAAAGGAAATTGCAGCAAGC
ATTATCTACATAGGATCAATTAAAGAAATATGGGATGAATTACGCCAAAGATTCAAACAATCAAATGGTCCCAGCATATACCAACTTCGAAAAGAATTTGTCACGTTGCG
GCAAGGAAACCTGACAATTGAAACATACTACACAAAACTCAAAACCATATGGCAGAATCTAAATGAATACCGATTTACAAATGACTGCACATGTGGAGGTTTGAAACCAT
TCATCGATCATCTTGAATCTGAATATATTATGGCCTTCCTGATGGGATTAAATGATTCCTATGCTGCTGTAAGAGCACAAATCCTCCTTATGCAACCTTTACCTTCAATC
AACACTGTATTTTCTTTGCTGATTCAAGAAGAACAACAAAGATCTGCTGGCATTTTAACCCCTCCCATTGATCCTGTGGCTTTAAATATTGCTTCATCCATGGCTATCTC
AACTGATCGAAATCGCAAAAAAGAGCGCCCTACTTGCTCCTACTGTGGAATTAAGGGACACATAGCAGACAAATGTTACAAGAAACATGGATATCCTCCAGGTTACAAGC
CAAGAAACTCAAACTCCATTACTACTGCACCAGATACCTCAAAAACCAACAATGTTGCCAATACCAATTCAGCTGCTGCCAATCAACAATACAGTCAATTGATGACTCTA
CTCAACAACCATCTCCAAGCAGCCACTACAGCACCTATCACTACGGCAACTGCCATAACCCACACTTCAGGATCTTTCCCGACCAATGATGATTGGCAAGGCTAG
Protein sequenceShow/hide protein sequence
MENQSSTNGSRFDPLTAANSQNDAQLNPYFIHHSLGPTAAIVTQPLTGAINYTSWSRAMLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAAS
IIYIGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSI
NTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAANQQYSQLMTL
LNNHLQAATTAPITTATAITHTSGSFPTNDDWQG