; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G06250 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G06250
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr07:11387140..11394868
RNA-Seq ExpressionClc07G06250
SyntenyClc07G06250
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV57287.1 hypothetical protein F511_33476, partial [Dorcoceras hygrometricum]1.8e-3834.51Show/hide
Query:  WSHAMILGLTIENKLGFVDGTL--PRQDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDLQQRYQRQRLKFHARAIPVVFFGYLPGIKG
        WS AMI+ LT +NKLGF+D ++  PR D  +  SW  CNS+V +W+LNS+ ++I+ S+    +AR++W+D+  R       FH    P            
Subjt:  WSHAMILGLTIENKLGFVDGTL--PRQDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDLQQRYQRQRLKFHARAIPVVFFGYLPGIKG

Query:  YRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSIVDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENP
         R+Y I K    + +      D+  ++T                       LP  +I  C L+                                     
Subjt:  YRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSIVDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENP

Query:  TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKG
            ++YGLKQASRQWF KFS+  L +GFSQS +  SLF +V     LVLLVYVD I+I        ++L + LD  FKLKDLG L+YFLG+E+AR S+G
Subjt:  TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKG

Query:  IFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG
        I + QR+Y + LL +   L  K  S P+D N+KL  D G
Subjt:  IFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG

XP_022132680.1 uncharacterized protein LOC111005480 [Momordica charantia]2.9e-3642.11Show/hide
Query:  VDIDNSIGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIR
        V +D  +GY  N+  S     V      +  ++YGLKQASRQWFDKFS A L L F QSKSDYSLFTR +G  F+ LLVYVD II+T AS   + +L   
Subjt:  VDIDNSIGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIR

Query:  LDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNI-VPILYGGNFNEL---------DNFVNYKVA
        LD  FKLKDLG LRYFLGLELAR S GI LSQR+Y L L ED+  L++K  +LP+DP +KL    G  +  P  Y      L           FV +K++
Subjt:  LDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNI-VPILYGGNFNEL---------DNFVNYKVA

Query:  EIALYSGMSFAELQQL-------VFDK---TYRIINFENVDIYACIGSKDFIKKDVIISKDNDVKW
        +   +   +F  LQ L       +F K   ++ +  F + D+ +C+ S+       I   D+ V W
Subjt:  EIALYSGMSFAELQQL-------VFDK---TYRIINFENVDIYACIGSKDFIKKDVIISKDNDVKW

XP_022154974.1 uncharacterized protein LOC111022118 [Momordica charantia]9.8e-3765.91Show/hide
Query:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS
        ++YGLKQASRQWF KFS+  L LGFSQSKSDYSLF R SGS F+ LLVYVD IIITGAS+  I  L   L+  FKLKDLG L+YFL L+LAR S GI +S
Subjt:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS

Query:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQY
        QR+YTLQLLED  FL+ K   +P+DP LKL++
Subjt:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQY

XP_022887164.1 uncharacterized protein LOC111403045 [Olea europaea var. sylvestris]1.7e-3655.83Show/hide
Query:  IGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFK
        +GY+ + D +     V      +  +IYGLKQASRQWF KFS   +L GF QSKSDYSLFT+ SG+ F+VLLVYVD IIITG +++ I  L   L   FK
Subjt:  IGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFK

Query:  LKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNI
        LKDLGVL+YFLGLE+AR   GIFLSQR+YTLQLLED  FL  K  +LP++P  +L   +G  I
Subjt:  LKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNI

XP_022899321.1 uncharacterized protein LOC111412620 [Olea europaea var. sylvestris]3.4e-3763.31Show/hide
Query:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS
        +IYGLKQASRQWF KFS+A +  GF+QSKSDYSLFT+ +G+ F+ LLVYVD I+IT +S  +I +L   L   FKLKDLG L+YFL LE+AR  KGIFLS
Subjt:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS

Query:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV
        QR YTLQLLEDT FL++K  +LP+DP LKL   +G  IV
Subjt:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV

TrEMBL top hitse value%identityAlignment
A0A2N9HKN6 Uncharacterized protein1.1e-3632.72Show/hide
Query:  QSSTQSSLLDQYLNPYFLHHSNSTNLILVLDLL-----MSWSHAMILGLTIENKLGFVDGTL--PRQDGD-MKNSWIICNSVVTTWLLNSLLKEISTSVS
        Q++T S  ++   +PY+L++ +   + +V D L      SW  +M   L+ +NKLGFV+GT+  P  + D +   W  CN +V +W+ N L ++I ++V 
Subjt:  QSSTQSSLLDQYLNPYFLHHSNSTNLILVLDLL-----MSWSHAMILGLTIENKLGFVDGTL--PRQDGD-MKNSWIICNSVVTTWLLNSLLKEISTSVS

Query:  ISDSARDIWLDLQQRYQRQR--LKFHARAIPVVFFGYLPGIKGYRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSI
           +A+++W DLQQRY +       H + IP    G              K    +SR             I++  +  D V + ++    S A     I
Subjt:  ISDSARDIWLDLQQRYQRQR--LKFHARAIPVVFFGYLPGIKGYRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSI

Query:  VDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGI
        +   +         +   +DNS+         N     P    T G        AS QWF KFS+  +  GF QS SDYSLFTR  GS F+ LLVYVD I
Subjt:  VDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGI

Query:  IITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG
        ++       +  L   L   FKLKDLG L++FLGLE+AR +KGI L QR Y L +L D+  L +K  + P++ NLKL    G
Subjt:  IITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG

A0A2N9J5N4 Uncharacterized protein1.7e-3929.29Show/hide
Query:  DQYLNPYFLHHSNSTNLILVL-----DLLMSWSHAMILGLTIENKLGFVDGTLPR---QDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIW
        D+  N +FLHH +S   +LV      D   +WS +MI+ LT +NK+GF++GT+     Q     N W  CN++V +WLLNS+ KEI++SV  +++A+++W
Subjt:  DQYLNPYFLHHSNSTNLILVL-----DLLMSWSHAMILGLTIENKLGFVDGTLPR---QDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIW

Query:  LDLQQRYQR---QRLKFHARAIPVV---------FFGYLPG----IKGYRLYDI-----------GKQHMFISRDVTFHEDMF-----------------
         DL++R+ +    R+    +AI  +         +F  L      +  YR +              KQH  + + +    D F                 
Subjt:  LDLQQRYQR---QRLKFHARAIPVV---------FFGYLPG----IKGYRLYDI-----------GKQHMFISRDVTFHEDMF-----------------

Query:  RFHTIIVQDEVMDNVSNMVLPKSSSS-------------------------------------------ALPN---------------------------
        +  +++VQ+E   ++    L  S+ S                                           +LP+                           
Subjt:  RFHTIIVQDEVMDNVSNMVLPKSSSS-------------------------------------------ALPN---------------------------

Query:  -----------------------FSIVDCSLN-ATIEPKDHVHVDIDNSI--GYAD-----NVDCSNDNTDVPTENPTI---GMNIYGLKQASRQWFDKF
                               F  V C L  A ++      +D++N+   G  D     ++     +   P  + T+     ++YGLKQASRQWF KF
Subjt:  -----------------------FSIVDCSLN-ATIEPKDHVHVDIDNSI--GYAD-----NVDCSNDNTDVPTENPTI---GMNIYGLKQASRQWFDKF

Query:  SNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLS
        S   L  GF+QSK+DYSLFT+  GS F+ LLVYVD I+I   S  D+  L   LD  FKLKDLG +RYFLGLE+AR SKGI +SQR Y L+++ED   L 
Subjt:  SNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLS

Query:  AKSCSLPLDPNLKLQYDKG
         K    P+D NLKL   +G
Subjt:  AKSCSLPLDPNLKLQYDKG

A0A2Z7DDR0 Uncharacterized protein (Fragment)8.7e-3934.51Show/hide
Query:  WSHAMILGLTIENKLGFVDGTL--PRQDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDLQQRYQRQRLKFHARAIPVVFFGYLPGIKG
        WS AMI+ LT +NKLGF+D ++  PR D  +  SW  CNS+V +W+LNS+ ++I+ S+    +AR++W+D+  R       FH    P            
Subjt:  WSHAMILGLTIENKLGFVDGTL--PRQDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDLQQRYQRQRLKFHARAIPVVFFGYLPGIKG

Query:  YRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSIVDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENP
         R+Y I K    + +      D+  ++T                       LP  +I  C L+                                     
Subjt:  YRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSIVDCSLNATIEPKDHVHVDIDNSIGYADNVDCSNDNTDVPTENP

Query:  TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKG
            ++YGLKQASRQWF KFS+  L +GFSQS +  SLF +V     LVLLVYVD I+I        ++L + LD  FKLKDLG L+YFLG+E+AR S+G
Subjt:  TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKG

Query:  IFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG
        I + QR+Y + LL +   L  K  S P+D N+KL  D G
Subjt:  IFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKG

A0A6J1DNU2 uncharacterized protein LOC1110221184.8e-3765.91Show/hide
Query:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS
        ++YGLKQASRQWF KFS+  L LGFSQSKSDYSLF R SGS F+ LLVYVD IIITGAS+  I  L   L+  FKLKDLG L+YFL L+LAR S GI +S
Subjt:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS

Query:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQY
        QR+YTLQLLED  FL+ K   +P+DP LKL++
Subjt:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQY

A0A803Q579 Uncharacterized protein2.3e-3939.46Show/hide
Query:  RQRLKFHARAIPVVFFGYLPGIKGYRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDN----------VSNMVLPKSSSSALPNFSIVDCSLNATI
        + R KF  R+ P +F GY PG+KGY+L D+ K   FISRDV F+E +F F +      + +N           SN   P ++S         D SL+  +
Subjt:  RQRLKFHARAIPVVFFGYLPGIKGYRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDN----------VSNMVLPKSSSSALPNFSIVDCSLNATI

Query:  EPKDHV--------HVDIDNSIGYAD---NVDCSNDNTDVPT-ENP-----TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLV
            HV         +D++N+  + D   +V  S      P  E P      +  + YGLKQASRQWF KFS+A    GF    +D+SLF +    +F+ 
Subjt:  EPKDHV--------HVDIDNSIGYAD---NVDCSNDNTDVPT-ENP-----TIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLV

Query:  LLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILY
        LLVYVD +I+   ++ ++  L  RL   F+LKDLG LRYFLGLE+AR  KGI +SQR Y LQLLED  +LS+K  S P++ NLKL  D+   +  P LY
Subjt:  LLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILY

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-0825.66Show/hide
Query:  VDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSG--SHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDL
        + C++DN         +   IYGLKQA+R WF+ F  A     F  S  D  ++    G  +  + +L+YVD ++I    +  +      L + F++ DL
Subjt:  VDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSG--SHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDL

Query:  GVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQ
          +++F+G+ +      I+LSQ  Y  ++L      +  + S PL   +  +
Subjt:  GVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-1538.24Show/hide
Query:  IGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSL-FTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELA--RPS
        +  ++YGLKQA RQW+ KF +      + ++ SD  + F R S ++F++LL+YVD ++I G     I KL   L K+F +KDLG  +  LG+++   R S
Subjt:  IGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSL-FTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELA--RPS

Query:  KGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKL
        + ++LSQ  Y  ++LE     +AK  S PL  +LKL
Subjt:  KGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKL

P92519 Uncharacterized mitochondrial protein AtMg008106.2e-1041.98Show/hide
Query:  LVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPL
        + LL+YVD I++TG+S   +  L  +L  TF +KDLG + YFLG+++     G+FLSQ  Y  Q+L +   L  K  S PL
Subjt:  LVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1940.69Show/hide
Query:  IYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQ
        +YGLKQA R W+ +  N  L +GF  S SD SLF    G   + +LVYVD I+ITG     +      L + F +KD   L YFLG+E  R   G+ LSQ
Subjt:  IYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQ

Query:  RHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILYGG
        R Y L LL  T  ++AK  + P+ P+ KL    G+ +  P  Y G
Subjt:  RHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILYGG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1836.99Show/hide
Query:  IYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQ
        IYGLKQA R W+ +     L +GF  S SD SLF    G   + +LVYVD I+ITG     +      L + F +K+   L YFLG+E  R  +G+ LSQ
Subjt:  IYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQ

Query:  RHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILYGGNFNELDNF------VNYKVAEIALYSGM
        R YTL LL  T  L+AK  + P+  + KL    G+ +  P  Y G    L         ++Y V  ++ Y  M
Subjt:  RHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV-PILYGGNFNELDNF------VNYKVAEIALYSGM

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0730.39Show/hide
Query:  NPYFL----HHSNSTNLILVL---DLLMSWSHAMILGLTIENKLGFVDGTLPRQD--GDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDL
        +PY+L    HH +  ++  +    D  ++W       L +  K GF+DGTLP+ D    +   W  CN++V  WL+NS+  ++  SV  +++A  +W DL
Subjt:  NPYFL----HHSNSTNLILVL---DLLMSWSHAMILGLTIENKLGFVDGTLPRQD--GDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDL

Query:  QQ
        ++
Subjt:  QQ

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-2947.48Show/hide
Query:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS
        +IYGLKQASRQWF KFS   +  GF QS SD++ F +++ + FL +LVYVD III   +   + +L  +L   FKL+DLG L+YFLGLE+AR + GI + 
Subjt:  NIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLS

Query:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV
        QR Y L LL++T  L  K  S+P+DP++      G + V
Subjt:  QRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIV

ATMG00810.1 DNA/RNA polymerases superfamily protein4.4e-1141.98Show/hide
Query:  LVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPL
        + LL+YVD I++TG+S   +  L  +L  TF +KDLG + YFLG+++     G+FLSQ  Y  Q+L +   L  K  S PL
Subjt:  LVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLELARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTTCAATCCTCAACTCAGTCCTCACTTCTTGACCAATATTTGAATCCTTACTTCCTTCATCATTCTAATAGCACGAACCTCATTCTTGTCTTAGATTTATTGAT
GTCCTGGAGTCACGCTATGATTCTAGGGCTTACTATCGAGAACAAACTTGGATTTGTTGATGGAACTCTACCTCGACAAGATGGAGATATGAAGAATTCATGGATTATTT
GTAACAGTGTTGTTACGACATGGCTTCTAAATTCTCTCTTGAAGGAAATCTCTACGAGTGTAAGTATTTCCGATTCAGCAAGGGATATTTGGCTTGATCTTCAACAGCGC
TATCAACGACAACGATTGAAGTTTCATGCGCGTGCTATTCCTGTTGTTTTCTTCGGATATCTGCCAGGTATTAAAGGATATAGGCTTTATGACATTGGAAAACAACATAT
GTTCATCTCAAGAGATGTTACCTTTCATGAAGACATGTTTCGTTTTCATACTATTATTGTTCAAGATGAGGTTATGGATAATGTTTCAAATATGGTATTGCCAAAGTCTT
CGTCATCTGCATTGCCTAATTTTTCTATTGTGGATTGTTCTTTGAATGCAACCATAGAGCCAAAAGATCATGTTCATGTAGATATTGATAATTCAATTGGGTATGCCGAT
AATGTTGATTGTTCTAATGATAATACTGATGTGCCTACAGAAAATCCAACTATTGGAATGAATATATATGGTTTGAAACAGGCTTCACGCCAATGGTTTGATAAATTTTC
AAATGCATCATTGTTGCTTGGCTTTTCTCAATCAAAATCTGACTACTCTTTGTTCACAAGGGTTTCAGGATCTCATTTTCTTGTTTTGCTTGTCTATGTGGATGGTATTA
TAATTACTGGTGCTTCTGTACATGATATTACTAAACTAAGCATTCGTCTTGACAAGACTTTTAAGCTTAAAGATTTAGGAGTTTTGCGTTATTTTTTGGGGTTGGAACTA
GCAAGGCCTTCAAAAGGGATTTTTTTGTCCCAAAGGCACTACACTTTGCAGCTACTTGAAGATACATGTTTCTTGTCTGCTAAGTCTTGTTCATTGCCTTTGGATCCCAA
TCTTAAGCTACAGTATGATAAAGGATCAAATATTGTCCCAATTCTATATGGTGGCAATTTCAATGAGTTGGATAATTTCGTCAATTACAAAGTAGCTGAAATAGCTTTGT
ATTCAGGAATGTCATTTGCTGAATTGCAACAGTTGGTTTTCGATAAAACATATCGTATAATCAATTTTGAGAATGTTGACATTTATGCTTGTATTGGATCGAAGGATTTT
ATAAAGAAAGACGTCATTATTTCAAAAGACAATGATGTGAAATGGTTGTTACACTCTATTTTCAGTGGTGTTGAAAAACATAAGAGCATAGTGAAAGCAATTGAGCTTGT
AATCCCTAATGCCTACCATTGCATATGTATGGTTAATGCAGTAAACAATATGCAATATGAAGTATTGGATGAAATATTTCAACATGACGTACACTTGCCTTCCAGAAGTT
GTAGTTGCAGGATGTTGAATATTTTGCAGATCTTGTGCTCACATGGATGCGTAGTATTGAGTAAGAAGCACTTATCTATTAAGGAGTATGTGTTGTCGTATTACCTTAAC
AACACTCTTTCATCAGTATACAAAGATTCAATTAATCCTTTGGGTGATCAACGTATGTGGCATGTTCCTGAAGATGTGAGTTCTATAAATATTCTATCACTGAATAGCAA
GCATCCGGTTGGTGACCTAAGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTTCAATCCTCAACTCAGTCCTCACTTCTTGACCAATATTTGAATCCTTACTTCCTTCATCATTCTAATAGCACGAACCTCATTCTTGTCTTAGATTTATTGAT
GTCCTGGAGTCACGCTATGATTCTAGGGCTTACTATCGAGAACAAACTTGGATTTGTTGATGGAACTCTACCTCGACAAGATGGAGATATGAAGAATTCATGGATTATTT
GTAACAGTGTTGTTACGACATGGCTTCTAAATTCTCTCTTGAAGGAAATCTCTACGAGTGTAAGTATTTCCGATTCAGCAAGGGATATTTGGCTTGATCTTCAACAGCGC
TATCAACGACAACGATTGAAGTTTCATGCGCGTGCTATTCCTGTTGTTTTCTTCGGATATCTGCCAGGTATTAAAGGATATAGGCTTTATGACATTGGAAAACAACATAT
GTTCATCTCAAGAGATGTTACCTTTCATGAAGACATGTTTCGTTTTCATACTATTATTGTTCAAGATGAGGTTATGGATAATGTTTCAAATATGGTATTGCCAAAGTCTT
CGTCATCTGCATTGCCTAATTTTTCTATTGTGGATTGTTCTTTGAATGCAACCATAGAGCCAAAAGATCATGTTCATGTAGATATTGATAATTCAATTGGGTATGCCGAT
AATGTTGATTGTTCTAATGATAATACTGATGTGCCTACAGAAAATCCAACTATTGGAATGAATATATATGGTTTGAAACAGGCTTCACGCCAATGGTTTGATAAATTTTC
AAATGCATCATTGTTGCTTGGCTTTTCTCAATCAAAATCTGACTACTCTTTGTTCACAAGGGTTTCAGGATCTCATTTTCTTGTTTTGCTTGTCTATGTGGATGGTATTA
TAATTACTGGTGCTTCTGTACATGATATTACTAAACTAAGCATTCGTCTTGACAAGACTTTTAAGCTTAAAGATTTAGGAGTTTTGCGTTATTTTTTGGGGTTGGAACTA
GCAAGGCCTTCAAAAGGGATTTTTTTGTCCCAAAGGCACTACACTTTGCAGCTACTTGAAGATACATGTTTCTTGTCTGCTAAGTCTTGTTCATTGCCTTTGGATCCCAA
TCTTAAGCTACAGTATGATAAAGGATCAAATATTGTCCCAATTCTATATGGTGGCAATTTCAATGAGTTGGATAATTTCGTCAATTACAAAGTAGCTGAAATAGCTTTGT
ATTCAGGAATGTCATTTGCTGAATTGCAACAGTTGGTTTTCGATAAAACATATCGTATAATCAATTTTGAGAATGTTGACATTTATGCTTGTATTGGATCGAAGGATTTT
ATAAAGAAAGACGTCATTATTTCAAAAGACAATGATGTGAAATGGTTGTTACACTCTATTTTCAGTGGTGTTGAAAAACATAAGAGCATAGTGAAAGCAATTGAGCTTGT
AATCCCTAATGCCTACCATTGCATATGTATGGTTAATGCAGTAAACAATATGCAATATGAAGTATTGGATGAAATATTTCAACATGACGTACACTTGCCTTCCAGAAGTT
GTAGTTGCAGGATGTTGAATATTTTGCAGATCTTGTGCTCACATGGATGCGTAGTATTGAGTAAGAAGCACTTATCTATTAAGGAGTATGTGTTGTCGTATTACCTTAAC
AACACTCTTTCATCAGTATACAAAGATTCAATTAATCCTTTGGGTGATCAACGTATGTGGCATGTTCCTGAAGATGTGAGTTCTATAAATATTCTATCACTGAATAGCAA
GCATCCGGTTGGTGACCTAAGAAGTTGA
Protein sequenceShow/hide protein sequence
MVFQSSTQSSLLDQYLNPYFLHHSNSTNLILVLDLLMSWSHAMILGLTIENKLGFVDGTLPRQDGDMKNSWIICNSVVTTWLLNSLLKEISTSVSISDSARDIWLDLQQR
YQRQRLKFHARAIPVVFFGYLPGIKGYRLYDIGKQHMFISRDVTFHEDMFRFHTIIVQDEVMDNVSNMVLPKSSSSALPNFSIVDCSLNATIEPKDHVHVDIDNSIGYAD
NVDCSNDNTDVPTENPTIGMNIYGLKQASRQWFDKFSNASLLLGFSQSKSDYSLFTRVSGSHFLVLLVYVDGIIITGASVHDITKLSIRLDKTFKLKDLGVLRYFLGLEL
ARPSKGIFLSQRHYTLQLLEDTCFLSAKSCSLPLDPNLKLQYDKGSNIVPILYGGNFNELDNFVNYKVAEIALYSGMSFAELQQLVFDKTYRIINFENVDIYACIGSKDF
IKKDVIISKDNDVKWLLHSIFSGVEKHKSIVKAIELVIPNAYHCICMVNAVNNMQYEVLDEIFQHDVHLPSRSCSCRMLNILQILCSHGCVVLSKKHLSIKEYVLSYYLN
NTLSSVYKDSINPLGDQRMWHVPEDVSSINILSLNSKHPVGDLRS