; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005140 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005140
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:11119663..11126104
RNA-Seq ExpressionLag0005140
SyntenyLag0005140
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-9747.83Show/hide
Query:  GSSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMI-------YIVASW
        GS     + + +  +AQLNPYF+HHS G  + +V+QPL GAI+ T        WS   R  L + + R +         KP +  ++        I+ASW
Subjt:  GSSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMI-------YIVASW

Query:  ILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLM
        ILNS+SKEIAASI+Y GS+K IWDELR RFKQSNGPSIYQL K+ VTLRQG +++ TYYTK+KTIWQ+L+++     C C GLKPF+DHL+SEY+M FLM
Subjt:  ILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLM

Query:  GLNESYSAIRAQILLMKPLPSRGA------SKNRKTDKPNRTKPIQWFG-----------------SRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSP
        GLN+SY+A+RAQILLM+PLPS          + ++      T PI                      RP CS+CGI+GH+ D+CYK HGYP GYK R+S 
Subjt:  GLNESYSAIRAQILLMKPLPSRGA------SKNRKTDKPNRTKPIQWFG-----------------SRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSP

Query:  TSDASANPATPKPLTAAN---SAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHT
         +  +  P T K    AN   +A + +PDFFSSLNS QYSQLM   LL++HLQAA T PIT AT ++H +GI ++ S +  + D+ WIIDSGASRHICH 
Subjt:  TSDASANPATPKPLTAAN---SAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHT

Query:  RSAFHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL
        +S F NW     ++V LP  HR+ V+  G + ++ SL
Subjt:  RSAFHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL

KAA8523936.1 hypothetical protein F0562_010359 [Nyssa sinensis]2.0e-6041.41Show/hide
Query:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS
        IV SWILNS+SKEI+ASI++  S + IW +LRDRF+Q NGP I+QL ++L+ LRQ   SV  Y+TK+KTIW++LS+  P  S   C C G+K   DH   
Subjt:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS

Query:  EYVMTFLMGLNESYSAIRAQILLMKPLP------------SRGASKNRKTDKPNRTKPIQWF---------GS-------------------RPICSHCG
        EY+M+FLMGL++S+S +R Q+LLM P+P             +    N  +D  N T  + +          GS                   +P C+HC 
Subjt:  EYVMTFLMGLNESYSAIRAQILLMKPLP------------SRGASKNRKTDKPNRTKPIQWF---------GS-------------------RPICSHCG

Query:  IRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKP--LTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAA-KTDPITVATVVSHVTGIC-S
        IRGH VDRCYK+HGYP GYKFRS+   +A+A+  +     L  +NS       F  +LNS+QY QLM   +LS+HL ++ K       +  + + GIC S
Subjt:  IRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKP--LTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAA-KTDPITVATVVSHVTGIC-S

Query:  MASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAG
        ++   + + +  WI+DSGA+RHIC   SAF + HSI    VTLP   ++LV +AG
Subjt:  MASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAG

KYP31881.1 Putative transposon Ty5-1 protein YCL075W family [Cajanus cajan]2.6e-5536.89Show/hide
Query:  NPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIY---IVASWILNSISKEIAASIVYTGSVKAI
        NP FLHHS G    L SQPL      T   A   +  +LG KN                   P  A  I+   +V SW+ NS+SKEI  SI++    K I
Subjt:  NPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIY---IVASWILNSISKEIAASIVYTGSVKAI

Query:  WDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPLPSR
        WD+L+ RF + NGP I+QL + L +L+QGT  V TYYTK+K+IW+DLS + P   C C GL+    + D EYVM+FLMGLN+S+S IR QILL  PLP  
Subjt:  WDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPLPSR

Query:  G-------------------------ASKNRKTDKPNRTKPIQW------FGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKPL
        G                          S N   D  + TK             RP C++CG+ GH  D+CYKL GYP  Y F++  T  A+    +P+PL
Subjt:  G-------------------------ASKNRKTDKPNRTKPIQW------FGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKPL

Query:  TAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTL
                  PD   +L  +Q  QL  ++ L++ ++    D      V ++VTGIC      + N+   W+IDSGA+ HIC  R+ +H++ S+   Y+ L
Subjt:  TAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTL

Query:  PTSHRVLVEYAG
        P S +V +E  G
Subjt:  PTSHRVLVEYAG

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]4.6e-6842.93Show/hide
Query:  SSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISI-THPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISK
        S  +SS+  T +IE+QLNPY +HHS    ++LV+Q LLGA +  +   ++    S   +      T +  N   +A   K  N     I+ SWI+NS+SK
Subjt:  SSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISI-THPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISK

Query:  EIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYS
        EIAASI+YTGS K IWDEL++RF+QS+ P I+QL K+LVT  QGT+S+  YYTK+KT+WQ+L+D+ P   C CSGLK   +   SEYVMTFLMGLNESY+
Subjt:  EIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYS

Query:  AIRAQILLMKPLPSRG-------------------------ASKNRKTDKPNRTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFR--------
         IRAQILLM P+P                            A    +  K N     +   +R  C+HCG+RGHV+D+CYKLHGYP GY+          
Subjt:  AIRAQILLMKPLPSRG-------------------------ASKNRKTDKPNRTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFR--------

Query:  ------SSPTSDASANPATPKPLTAANSAV-----STNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTG
              +S ++   AN  + K +   +S       +++P FF+SLNSSQYSQLM+  +L SHLQAAK + I   T ++HV G
Subjt:  ------SSPTSDASANPATPKPLTAANSAV-----STNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTG

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]2.8e-5741.08Show/hide
Query:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS
        IV SWILNS+SKEI+AS++Y+ S   IW +L++RF+Q NGP I+QL ++L+ L QG +SVG Y+TK+KTIW++LS++ PI S   C C   K   +H   
Subjt:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS

Query:  EYVMTFLMGLNESYSAIRAQILLMKPLPS--------------RGASKN-------------RKTDKPNRTKPIQWF--GSRPICSHCGIRGHVVDRCYK
        EYVM+FLMGLN++++  R Q+LLM P+PS              R  S N              K D    +     F    +PIC++C + GH VD+CYK
Subjt:  EYVMTFLMGLNESYSAIRAQILLMKPLPS--------------RGASKN-------------RKTDKPNRTKPIQWF--GSRPICSHCGIRGHVVDRCYK

Query:  LHGYPLGYKF--RSSPTSDASANPATPKPLTAANSAVSTN---------PDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGIC-SMAS
        LHGYP GYK   RS+ T      P       AAN  V  N          DF  SLN +QY QLM  ++L +HL +AKT+        + V+G C S+  
Subjt:  LHGYPLGYKF--RSSPTSDASANPATPKPLTAANSAVSTN---------PDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGIC-SMAS

Query:  PSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAGV
            N    W++DSGA+ HIC ++SAF++   IE  YVTLP   R+ V + G+
Subjt:  PSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAGV

TrEMBL top hitse value%identityAlignment
A0A151QNL0 Putative transposon Ty5-1 protein YCL075W family1.3e-5536.89Show/hide
Query:  NPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIY---IVASWILNSISKEIAASIVYTGSVKAI
        NP FLHHS G    L SQPL      T   A   +  +LG KN                   P  A  I+   +V SW+ NS+SKEI  SI++    K I
Subjt:  NPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIY---IVASWILNSISKEIAASIVYTGSVKAI

Query:  WDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPLPSR
        WD+L+ RF + NGP I+QL + L +L+QGT  V TYYTK+K+IW+DLS + P   C C GL+    + D EYVM+FLMGLN+S+S IR QILL  PLP  
Subjt:  WDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQILLMKPLPSR

Query:  G-------------------------ASKNRKTDKPNRTKPIQW------FGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKPL
        G                          S N   D  + TK             RP C++CG+ GH  D+CYKL GYP  Y F++  T  A+    +P+PL
Subjt:  G-------------------------ASKNRKTDKPNRTKPIQW------FGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKPL

Query:  TAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTL
                  PD   +L  +Q  QL  ++ L++ ++    D      V ++VTGIC      + N+   W+IDSGA+ HIC  R+ +H++ S+   Y+ L
Subjt:  TAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTL

Query:  PTSHRVLVEYAG
        P S +V +E  G
Subjt:  PTSHRVLVEYAG

A0A438KJI9 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-5435.06Show/hide
Query:  STASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNL--ASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISKEIAASIV
        S +S+E   +PYFLH+S     VLVS  L GA   T   A+    ++  + +    S      +       ++  N     +V SWILNS+ K+IA S++
Subjt:  STASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNL--ASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISKEIAASIV

Query:  YTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQIL
        Y  +   IW++LRDRF+QSNGP I+Q+ K L+ L QG++ V TYYT++K +W +L    P+  C C  +K +++    EYVM FLMGLNES+   R+QIL
Subjt:  YTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQIL

Query:  LMKPLP---------------------------SRGASKNRKTDKPN----RTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRS-SPTSDAS
        +M+PLP                           S  A+ +  T   +     +KP +    RP CSHCGI GH VD+CYKL+GYP GYKF+S +P +   
Subjt:  LMKPLP---------------------------SRGASKNRKTDKPN----RTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRS-SPTSDAS

Query:  ANPATPKPLTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQ---AAKTDPITVATVVSHVTGICSMASPSVTNLDD--CWIIDSGASRHICHTRSA
        AN  + +  T   SA + +P   +SL+ +Q  QL  + LLSS L     A  +       VS  +GI S++S S  N  D   W++D G + H+C +  +
Subjt:  ANPATPKPLTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQ---AAKTDPITVATVVSHVTGICSMASPSVTNLDD--CWIIDSGASRHICHTRSA

Query:  FHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL----DKRLLTMIDGAKCYHGLYILSDST
        F +        VTLP  H V +   G V +S  +    D     MI   K +  LY+L  S+
Subjt:  FHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL----DKRLLTMIDGAKCYHGLYILSDST

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 87.8e-9847.83Show/hide
Query:  GSSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMI-------YIVASW
        GS     + + +  +AQLNPYF+HHS G  + +V+QPL GAI+ T        WS   R  L + + R +         KP +  ++        I+ASW
Subjt:  GSSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMI-------YIVASW

Query:  ILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLM
        ILNS+SKEIAASI+Y GS+K IWDELR RFKQSNGPSIYQL K+ VTLRQG +++ TYYTK+KTIWQ+L+++     C C GLKPF+DHL+SEY+M FLM
Subjt:  ILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLM

Query:  GLNESYSAIRAQILLMKPLPSRGA------SKNRKTDKPNRTKPIQWFG-----------------SRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSP
        GLN+SY+A+RAQILLM+PLPS          + ++      T PI                      RP CS+CGI+GH+ D+CYK HGYP GYK R+S 
Subjt:  GLNESYSAIRAQILLMKPLPSRGA------SKNRKTDKPNRTKPIQWFG-----------------SRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSP

Query:  TSDASANPATPKPLTAAN---SAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHT
         +  +  P T K    AN   +A + +PDFFSSLNS QYSQLM   LL++HLQAA T PIT AT ++H +GI ++ S +  + D+ WIIDSGASRHICH 
Subjt:  TSDASANPATPKPLTAAN---SAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHT

Query:  RSAFHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL
        +S F NW     ++V LP  HR+ V+  G + ++ SL
Subjt:  RSAFHNWHSIEPIYVTLPTSHRVLVEYAG-VSVSSSL

A0A5J5A1K4 Retrotrans_gag domain-containing protein9.9e-6141.41Show/hide
Query:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS
        IV SWILNS+SKEI+ASI++  S + IW +LRDRF+Q NGP I+QL ++L+ LRQ   SV  Y+TK+KTIW++LS+  P  S   C C G+K   DH   
Subjt:  IVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITS---CPCSGLKPFLDHLDS

Query:  EYVMTFLMGLNESYSAIRAQILLMKPLP------------SRGASKNRKTDKPNRTKPIQWF---------GS-------------------RPICSHCG
        EY+M+FLMGL++S+S +R Q+LLM P+P             +    N  +D  N T  + +          GS                   +P C+HC 
Subjt:  EYVMTFLMGLNESYSAIRAQILLMKPLP------------SRGASKNRKTDKPNRTKPIQWF---------GS-------------------RPICSHCG

Query:  IRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKP--LTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAA-KTDPITVATVVSHVTGIC-S
        IRGH VDRCYK+HGYP GYKFRS+   +A+A+  +     L  +NS       F  +LNS+QY QLM   +LS+HL ++ K       +  + + GIC S
Subjt:  IRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKP--LTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSHLQAA-KTDPITVATVVSHVTGIC-S

Query:  MASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAG
        ++   + + +  WI+DSGA+RHIC   SAF + HSI    VTLP   ++LV +AG
Subjt:  MASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAG

A0A6J1CXR2 uncharacterized protein LOC1110152392.2e-6842.93Show/hide
Query:  SSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISI-THPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISK
        S  +SS+  T +IE+QLNPY +HHS    ++LV+Q LLGA +  +   ++    S   +      T +  N   +A   K  N     I+ SWI+NS+SK
Subjt:  SSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISI-THPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSISK

Query:  EIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYS
        EIAASI+YTGS K IWDEL++RF+QS+ P I+QL K+LVT  QGT+S+  YYTK+KT+WQ+L+D+ P   C CSGLK   +   SEYVMTFLMGLNESY+
Subjt:  EIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYS

Query:  AIRAQILLMKPLPSRG-------------------------ASKNRKTDKPNRTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFR--------
         IRAQILLM P+P                            A    +  K N     +   +R  C+HCG+RGHV+D+CYKLHGYP GY+          
Subjt:  AIRAQILLMKPLPSRG-------------------------ASKNRKTDKPNRTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFR--------

Query:  ------SSPTSDASANPATPKPLTAANSAV-----STNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTG
              +S ++   AN  + K +   +S       +++P FF+SLNSSQYSQLM+  +L SHLQAAK + I   T ++HV G
Subjt:  ------SSPTSDASANPATPKPLTAANSAV-----STNPDFFSSLNSSQYSQLMDLHLLSSHLQAAKTDPITVATVVSHVTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.1e-1734.53Show/hide
Query:  NATMIYIVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSG-----LK
        NA ++Y    W++NS++ ++  S++Y  +   +W++LR  F       IYQL + L TLRQG  SV  Y+ K+  +W +LS++ PI  C C G      K
Subjt:  NATMIYIVASWILNSISKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSG-----LK

Query:  PFLDHLDSEYVMTFLMG--LNESYSAIRAQILLMKPLPS
           +  + E    FLMG  LN+ + A+  +I+  KP PS
Subjt:  PFLDHLDSEYVMTFLMG--LNESYSAIRAQILLMKPLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACAAGGTTTCAGGAAGCAACACAGGACCCGGAAGTTCCAGTTCATCTTCTTCGATTTCAACTGCATCGATTGAGGCTCAGTTGAACCCCTACTTTCTTCATCA
TTCCTTTGGTTCCGCTTCGGTTCTTGTCTCCCAGCCATTACTTGGTGCAATATCAATTACACATCCTGGAGCCGTGCCATGCGTATGGTCATCTCTGGGAAGAAAAAACT
TGGCTTCATCAACGACAAGATTTCGAAACCAGAGGAAGATGGCTCTCTGCTTGAAGCCTGGGAATGCAACAATGATATATATAGTAGCATCTTGGATTCTCAATTCCATT
TCCAAGGAGATTGCCGCTAGCATTGTTTACACTGGATCTGTCAAGGCTATTTGGGATGAACTTCGTGATCGATTCAAACAGTCCAATGGCCCGAGTATCTATCAGCTTTG
CAAGGATTTGGTGACTTTACGTCAAGGTACTATGTCCGTGGGGACATATTACACTAAAATGAAGACCATTTGGCAAGATCTCAGTGATCATCATCCTATTACCAGTTGCC
CTTGCAGCGGGTTGAAGCCTTTTCTTGATCATCTGGATTCTGAGTATGTGATGACGTTCCTAATGGGATTAAATGAATCCTACTCTGCAATTAGGGCACAAATCCTCCTG
ATGAAGCCGCTTCCATCTAGGGGTGCAAGCAAAAACCGAAAAACCGACAAACCGAACCGAACCAAACCGATCCAATGGTTTGGTTCGAGACCCATTTGTTCGCATTGTGG
CATTCGTGGTCACGTGGTGGATCGCTGTTACAAGCTGCACGGATATCCCCTGGGATATAAATTTCGATCCTCACCCACATCTGATGCTTCGGCCAATCCTGCTACTCCGA
AGCCCCTCACTGCTGCGAATTCTGCTGTTTCTACCAATCCTGATTTCTTTTCAAGCCTCAATTCTTCGCAATACTCACAATTGATGGACTTGCACTTGCTTAGTTCTCAC
CTTCAAGCCGCGAAGACGGATCCCATTACCGTAGCAACTGTTGTTTCTCACGTCACAGGTATTTGTTCTATGGCTTCCCCCTCTGTTACCAATCTTGATGATTGTTGGAT
TATAGACTCTGGGGCTTCTCGCCATATTTGCCACACTCGATCTGCATTTCATAACTGGCATAGCATTGAACCTATTTATGTCACACTTCCTACATCGCATAGAGTTCTCG
TTGAATATGCTGGGGTGTCAGTTTCTAGTTCTCTGGACAAACGACTCTTGACAATGATTGACGGGGCTAAGTGTTATCACGGTCTTTACATACTTTCTGACTCTACCGCT
TCTGTGATTACACATTCTACCTGGAAATCGAGCTGTGAACGCGATTTCGCCGCCGCGATTTCGTTAGATCTGGAACCAACACGGTCGTTCACGCGATTTCGCCGCCACTA
CTTCGTTAGATCTGCGACCAAAACGGTCATTCACGCGATTTTGCCGTCGCGATTTCATTCAGATCTGGAACCAACACGACCGTTCACACGATTTCGCAGCCGCGATTTTG
TTAGATATGGAAACAACTCGCGATTTCGCGCGCTCGAAGGGGGTGAGGCCGAAAGACTACGAATGGGCGACGAACTTCAGTCTGAACGGGGGGCGAACGGACTACGAACG
GCGGCGAACATAGTTGCAGAGAGAGTGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACAAGGTTTCAGGAAGCAACACAGGACCCGGAAGTTCCAGTTCATCTTCTTCGATTTCAACTGCATCGATTGAGGCTCAGTTGAACCCCTACTTTCTTCATCA
TTCCTTTGGTTCCGCTTCGGTTCTTGTCTCCCAGCCATTACTTGGTGCAATATCAATTACACATCCTGGAGCCGTGCCATGCGTATGGTCATCTCTGGGAAGAAAAAACT
TGGCTTCATCAACGACAAGATTTCGAAACCAGAGGAAGATGGCTCTCTGCTTGAAGCCTGGGAATGCAACAATGATATATATAGTAGCATCTTGGATTCTCAATTCCATT
TCCAAGGAGATTGCCGCTAGCATTGTTTACACTGGATCTGTCAAGGCTATTTGGGATGAACTTCGTGATCGATTCAAACAGTCCAATGGCCCGAGTATCTATCAGCTTTG
CAAGGATTTGGTGACTTTACGTCAAGGTACTATGTCCGTGGGGACATATTACACTAAAATGAAGACCATTTGGCAAGATCTCAGTGATCATCATCCTATTACCAGTTGCC
CTTGCAGCGGGTTGAAGCCTTTTCTTGATCATCTGGATTCTGAGTATGTGATGACGTTCCTAATGGGATTAAATGAATCCTACTCTGCAATTAGGGCACAAATCCTCCTG
ATGAAGCCGCTTCCATCTAGGGGTGCAAGCAAAAACCGAAAAACCGACAAACCGAACCGAACCAAACCGATCCAATGGTTTGGTTCGAGACCCATTTGTTCGCATTGTGG
CATTCGTGGTCACGTGGTGGATCGCTGTTACAAGCTGCACGGATATCCCCTGGGATATAAATTTCGATCCTCACCCACATCTGATGCTTCGGCCAATCCTGCTACTCCGA
AGCCCCTCACTGCTGCGAATTCTGCTGTTTCTACCAATCCTGATTTCTTTTCAAGCCTCAATTCTTCGCAATACTCACAATTGATGGACTTGCACTTGCTTAGTTCTCAC
CTTCAAGCCGCGAAGACGGATCCCATTACCGTAGCAACTGTTGTTTCTCACGTCACAGGTATTTGTTCTATGGCTTCCCCCTCTGTTACCAATCTTGATGATTGTTGGAT
TATAGACTCTGGGGCTTCTCGCCATATTTGCCACACTCGATCTGCATTTCATAACTGGCATAGCATTGAACCTATTTATGTCACACTTCCTACATCGCATAGAGTTCTCG
TTGAATATGCTGGGGTGTCAGTTTCTAGTTCTCTGGACAAACGACTCTTGACAATGATTGACGGGGCTAAGTGTTATCACGGTCTTTACATACTTTCTGACTCTACCGCT
TCTGTGATTACACATTCTACCTGGAAATCGAGCTGTGAACGCGATTTCGCCGCCGCGATTTCGTTAGATCTGGAACCAACACGGTCGTTCACGCGATTTCGCCGCCACTA
CTTCGTTAGATCTGCGACCAAAACGGTCATTCACGCGATTTTGCCGTCGCGATTTCATTCAGATCTGGAACCAACACGACCGTTCACACGATTTCGCAGCCGCGATTTTG
TTAGATATGGAAACAACTCGCGATTTCGCGCGCTCGAAGGGGGTGAGGCCGAAAGACTACGAATGGGCGACGAACTTCAGTCTGAACGGGGGGCGAACGGACTACGAACG
GCGGCGAACATAGTTGCAGAGAGAGTGAAGTGA
Protein sequenceShow/hide protein sequence
MADKVSGSNTGPGSSSSSSSISTASIEAQLNPYFLHHSFGSASVLVSQPLLGAISITHPGAVPCVWSSLGRKNLASSTTRFRNQRKMALCLKPGNATMIYIVASWILNSI
SKEIAASIVYTGSVKAIWDELRDRFKQSNGPSIYQLCKDLVTLRQGTMSVGTYYTKMKTIWQDLSDHHPITSCPCSGLKPFLDHLDSEYVMTFLMGLNESYSAIRAQILL
MKPLPSRGASKNRKTDKPNRTKPIQWFGSRPICSHCGIRGHVVDRCYKLHGYPLGYKFRSSPTSDASANPATPKPLTAANSAVSTNPDFFSSLNSSQYSQLMDLHLLSSH
LQAAKTDPITVATVVSHVTGICSMASPSVTNLDDCWIIDSGASRHICHTRSAFHNWHSIEPIYVTLPTSHRVLVEYAGVSVSSSLDKRLLTMIDGAKCYHGLYILSDSTA
SVITHSTWKSSCERDFAAAISLDLEPTRSFTRFRRHYFVRSATKTVIHAILPSRFHSDLEPTRPFTRFRSRDFVRYGNNSRFRALEGGEAERLRMGDELQSERGANGLRT
AANIVAERVK