; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034661 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034661
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein FAR-RED ELONGATED HYPOCOTYL 3-like
Genome locationchr3:9530502..9534417
RNA-Seq ExpressionLag0034661
SyntenyLag0034661
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039149.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Cucumis melo var. makuwa]2.0e-8938.68Show/hide
Query:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF
        +SK+ SNF + V DMFSSK + +  I+AI +RDNFQ+  +KSN+++ ++QC +E+C+W LRA  C  GD     WV+ RFD EHT S+DV LTDH+QATF
Subjt:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF

Query:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI
          I D IK KIS+A S+L TPK+I+ FI  E+GL ISYQKAWRA + AL++IRGS EDSYKM+P F+++L   N         D D RF YFFM +S  I
Subjt:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI

Query:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------
        SGWQ+C  ++++DGT +KNK+ GTL+++ST D + QIFPLAFCVVDS+N+ SW WF  Q+ +I+G   ++VIV +RHK                      
Subjt:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------

Query:  -----------------------------------------------SIDKA--------VC--------------------------------------
                                                       S++ A        +C                                      
Subjt:  -----------------------------------------------SIDKA--------VC--------------------------------------

Query:  --------EVNPVDNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRV
                +VN V+N+++Q ID +SQ+ +YL               P  H     T KHLSIK       ++I+   +   VGRPKKVRIPS+M+FK+R+
Subjt:  --------EVNPVDNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRV

Query:  KCGRCGRSGHNRKTCRF
        KCGR GR GHN K   F
Subjt:  KCGRCGRSGHNRKTCRF

XP_008452162.1 PREDICTED: uncharacterized protein LOC103493267 [Cucumis melo]2.4e-7953.71Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA-CRFGDDVSPTWVVKRFDSEHTCSIDVVLT
        GRF   +D+SK+ SNF + V DMFS+K + +  ++ IA+RDNFQ+ T+KSN ++ ++QCV+E+C+W L A C    D S  WV+ RFDSE TCS+D  LT
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA-CRFGDDVSPTWVVKRFDSEHTCSIDVVLT

Query:  DHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNPDVDGRFRYFFMEISTCIS
        DH+QATFT I D IK KI +A S+L TPKDI+ FI  E+GL I YQKAWRAR AAL++IR          P+   ++  K  D  GRF YFFM +S  IS
Subjt:  DHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNPDVDGRFRYFFMEISTCIS

Query:  GWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAV
        G Q+CR ++ ++ T +KNK+ GTL++ASTPD +D IFPLAFCVVDSEN+ SW WF  QL +I+G   D+VIV DRH+SI KA+
Subjt:  GWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAV

XP_008463110.1 PREDICTED: uncharacterized protein LOC103501336 [Cucumis melo]1.5e-9241.29Show/hide
Query:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF
        +SK+ SNF + V DMFSSK + +  I+AI +RDNFQ+  +KSN+++ ++QC +E+C+W LRA  C  GD     WV+ RFD EHT S+DV LTDH+QATF
Subjt:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF

Query:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI
          I D IK KIS+A S+L TPK+I+ FI  E+GL ISYQKAWRA + AL++IRGS EDSYKM+P F+++L   N         D D RF YFFM +S  I
Subjt:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI

Query:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------
        SGWQ+C  ++++DGT +KNK+ GTL+++ST D + QIFPLAFCVVDS+N+ SW WF  Q+ +I+G   ++VIV +RHK                      
Subjt:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------

Query:  ----------------SIDKA--------VC----------------------------------------------EVNPVDNVQFQVIDGSSQHEVYL
                        S++ A        +C                                              +VN V+N+++Q ID +SQ+ +YL
Subjt:  ----------------SIDKA--------VC----------------------------------------------EVNPVDNVQFQVIDGSSQHEVYL

Query:  TARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRK
                       P  H     T KHLSIK       ++I+   +   VGRPKKVRIPS+M+FK+R+KCGR GR GHN K
Subjt:  TARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRK

XP_022134813.1 uncharacterized protein LOC111006994 [Momordica charantia]1.7e-8546.76Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD
        G+F   IDV+ +S+ F I V+D F  K  LQNA+R++AIR NF F+T+KSNR++ +V C+  +C+W L A +FGD  S  W+VK+F  EHTCS+++VL D
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD

Query:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM
        H+QATF+ I + IK +I+   ++LP+ KD IS +  E  + I+YQKA  ARE A+ EIRGSPE SY +IP F HM+  KNP        D +GRF Y FM
Subjt:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM

Query:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCEVNPVDNVQ
          S+ ISGW+YC P++ VDGT MKNK+AGTLI+A T D + QIFPLAF V DSEN+ SW  FF+QL + +G  +D+VIV DRHKSI              
Subjt:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCEVNPVDNVQ

Query:  FQVIDGSSQHEVYLTARRCSC--------------RMWDILEIPCSHACVVLTFK
             G S  +V+ TA  C C              ++ D L  PC+ A  V  F+
Subjt:  FQVIDGSSQHEVYLTARRCSC--------------RMWDILEIPCSHACVVLTFK

XP_022155207.1 uncharacterized protein LOC111022347 [Momordica charantia]1.5e-7636.7Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD
        G+F ++IDV+ +  NF I V+D F  K  LQNA+R++AIRDNFQF+TIKSNRD+ +V+  +++C+W L A +FGD  S  W+VK+F  EHTCS+++VL D
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD

Query:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM
        H+QATF+ I + IK + ++  ++L + KD+IS +R   G+ I+YQKAWR R++A+ EI+GSPE+SY ++PSF HM+  KNP        D  GRF Y FM
Subjt:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM

Query:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITA-----STPDVHDQIFPLA-------FCVVDSE-------NNWSWKWFFQQLSKIVGHHKDIVIVFD
         +S+ I G+QYCRP+    G +        L+T      S     +++F  A       +C   SE       + WS+ +        +  +    +   
Subjt:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITA-----STPDVHDQIFPLA-------FCVVDSE-------NNWSWKWFFQQLSKIVGHHKDIVIVFD

Query:  RHKSIDKAVCEVNPV-------------DNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLN-------------
           +++  +  +  V             ++V FQ+   +   E  L+ +  +CR   +  +        +  KHL  K Y+S +Y N             
Subjt:  RHKSIDKAVCEVNPV-------------DNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLN-------------

Query:  -----------------ILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRKTCRFALTQ
                         +L PNVK   GRPKK+RIPS +EFKKRVKC RCGR GHNRK+C+F+LTQ
Subjt:  -----------------ILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRKTCRFALTQ

TrEMBL top hitse value%identityAlignment
A0A1S3BT85 uncharacterized protein LOC1034932671.2e-7953.71Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA-CRFGDDVSPTWVVKRFDSEHTCSIDVVLT
        GRF   +D+SK+ SNF + V DMFS+K + +  ++ IA+RDNFQ+ T+KSN ++ ++QCV+E+C+W L A C    D S  WV+ RFDSE TCS+D  LT
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA-CRFGDDVSPTWVVKRFDSEHTCSIDVVLT

Query:  DHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNPDVDGRFRYFFMEISTCIS
        DH+QATFT I D IK KI +A S+L TPKDI+ FI  E+GL I YQKAWRAR AAL++IR          P+   ++  K  D  GRF YFFM +S  IS
Subjt:  DHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNPDVDGRFRYFFMEISTCIS

Query:  GWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAV
        G Q+CR ++ ++ T +KNK+ GTL++ASTPD +D IFPLAFCVVDSEN+ SW WF  QL +I+G   D+VIV DRH+SI KA+
Subjt:  GWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAV

A0A1S3CIV7 uncharacterized protein LOC1035013367.1e-9341.29Show/hide
Query:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF
        +SK+ SNF + V DMFSSK + +  I+AI +RDNFQ+  +KSN+++ ++QC +E+C+W LRA  C  GD     WV+ RFD EHT S+DV LTDH+QATF
Subjt:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF

Query:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI
          I D IK KIS+A S+L TPK+I+ FI  E+GL ISYQKAWRA + AL++IRGS EDSYKM+P F+++L   N         D D RF YFFM +S  I
Subjt:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI

Query:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------
        SGWQ+C  ++++DGT +KNK+ GTL+++ST D + QIFPLAFCVVDS+N+ SW WF  Q+ +I+G   ++VIV +RHK                      
Subjt:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------

Query:  ----------------SIDKA--------VC----------------------------------------------EVNPVDNVQFQVIDGSSQHEVYL
                        S++ A        +C                                              +VN V+N+++Q ID +SQ+ +YL
Subjt:  ----------------SIDKA--------VC----------------------------------------------EVNPVDNVQFQVIDGSSQHEVYL

Query:  TARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRK
                       P  H     T KHLSIK       ++I+   +   VGRPKKVRIPS+M+FK+R+KCGR GR GHN K
Subjt:  TARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRK

A0A5A7TCZ3 Protein FAR-RED ELONGATED HYPOCOTYL 3-like9.6e-9038.68Show/hide
Query:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF
        +SK+ SNF + V DMFSSK + +  I+AI +RDNFQ+  +KSN+++ ++QC +E+C+W LRA  C  GD     WV+ RFD EHT S+DV LTDH+QATF
Subjt:  VSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRA--CRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATF

Query:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI
          I D IK KIS+A S+L TPK+I+ FI  E+GL ISYQKAWRA + AL++IRGS EDSYKM+P F+++L   N         D D RF YFFM +S  I
Subjt:  TFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN--------PDVDGRFRYFFMEISTCI

Query:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------
        SGWQ+C  ++++DGT +KNK+ GTL+++ST D + QIFPLAFCVVDS+N+ SW WF  Q+ +I+G   ++VIV +RHK                      
Subjt:  SGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHK----------------------

Query:  -----------------------------------------------SIDKA--------VC--------------------------------------
                                                       S++ A        +C                                      
Subjt:  -----------------------------------------------SIDKA--------VC--------------------------------------

Query:  --------EVNPVDNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRV
                +VN V+N+++Q ID +SQ+ +YL               P  H     T KHLSIK       ++I+   +   VGRPKKVRIPS+M+FK+R+
Subjt:  --------EVNPVDNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRV

Query:  KCGRCGRSGHNRKTCRF
        KCGR GR GHN K   F
Subjt:  KCGRCGRSGHNRKTCRF

A0A6J1C328 uncharacterized protein LOC1110069948.4e-8646.76Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD
        G+F   IDV+ +S+ F I V+D F  K  LQNA+R++AIR NF F+T+KSNR++ +V C+  +C+W L A +FGD  S  W+VK+F  EHTCS+++VL D
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD

Query:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM
        H+QATF+ I + IK +I+   ++LP+ KD IS +  E  + I+YQKA  ARE A+ EIRGSPE SY +IP F HM+  KNP        D +GRF Y FM
Subjt:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM

Query:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCEVNPVDNVQ
          S+ ISGW+YC P++ VDGT MKNK+AGTLI+A T D + QIFPLAF V DSEN+ SW  FF+QL + +G  +D+VIV DRHKSI              
Subjt:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCEVNPVDNVQ

Query:  FQVIDGSSQHEVYLTARRCSC--------------RMWDILEIPCSHACVVLTFK
             G S  +V+ TA  C C              ++ D L  PC+ A  V  F+
Subjt:  FQVIDGSSQHEVYLTARRCSC--------------RMWDILEIPCSHACVVLTFK

A0A6J1DNQ8 uncharacterized protein LOC1110223477.1e-7736.7Show/hide
Query:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD
        G+F ++IDV+ +  NF I V+D F  K  LQNA+R++AIRDNFQF+TIKSNRD+ +V+  +++C+W L A +FGD  S  W+VK+F  EHTCS+++VL D
Subjt:  GRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTD

Query:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM
        H+QATF+ I + IK + ++  ++L + KD+IS +R   G+ I+YQKAWR R++A+ EI+GSPE+SY ++PSF HM+  KNP        D  GRF Y FM
Subjt:  HKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNP--------DVDGRFRYFFM

Query:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITA-----STPDVHDQIFPLA-------FCVVDSE-------NNWSWKWFFQQLSKIVGHHKDIVIVFD
         +S+ I G+QYCRP+    G +        L+T      S     +++F  A       +C   SE       + WS+ +        +  +    +   
Subjt:  EISTCISGWQYCRPIVYVDGTQMKNKFAGTLITA-----STPDVHDQIFPLA-------FCVVDSE-------NNWSWKWFFQQLSKIVGHHKDIVIVFD

Query:  RHKSIDKAVCEVNPV-------------DNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLN-------------
           +++  +  +  V             ++V FQ+   +   E  L+ +  +CR   +  +        +  KHL  K Y+S +Y N             
Subjt:  RHKSIDKAVCEVNPV-------------DNVQFQVIDGSSQHEVYLTARRCSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLN-------------

Query:  -----------------ILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRKTCRFALTQ
                         +L PNVK   GRPKK+RIPS +EFKKRVKC RCGR GHNRK+C+F+LTQ
Subjt:  -----------------ILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRKTCRFALTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase2.6e-1524.37Show/hide
Query:  KSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTF-INDCIKRKISIATSELPT-PKDIISFIRSEYGLRISYQK
        ++ +D++VV+C    C+W + A R  +D    + +      H C  +  L D       F I   ++ + +++T+EL    +    F   +     S   
Subjt:  KSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTF-INDCIKRKISIATSELPT-PKDIISFIRSEYGLRISYQK

Query:  AWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN-------------PDVDGRFRYFFMEISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHD
           A+  A+    G  + S+++IP    +L++ N                   FR  F   S  I G+Q+CRP++ VD   +  K+   L+ AS  D  +
Subjt:  AWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN-------------PDVDGRFRYFFMEISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHD

Query:  QIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIV
        Q FPLAF V    +  SW+WF  ++ + V   + I ++
Subjt:  QIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIV

AT1G64255.1 MuDR family transposase1.0e-1423.67Show/hide
Query:  RDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAR
        +D ++ +C+   C+W L A R          + ++   HTC   +V  D K     F  D I+R +    ++  T  ++  + + + G  +       A+
Subjt:  RDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAR

Query:  EAALNEIRGSPEDSYKMIPSFSHMLNTKNP-DVDGRFRYF-----------FMEISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLA
        E A+  + G  + S++  P     L + N   VD ++  F           F      I G+Q+CRP++ VD   +  ++   L+ AS  D  ++ FPLA
Subjt:  EAALNEIRGSPEDSYKMIPSFSHMLNTKNP-DVDGRFRYF-----------FMEISTCISGWQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLA

Query:  FCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCE
        F V    +   W+WF   + + V   K + ++   H  I   V E
Subjt:  FCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCE

AT1G64260.1 MuDR family transposase3.0e-1922.71Show/hide
Query:  FSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTFINDCIKRKISIATSE
        F  +  L+ A+    IR        ++ ++++  +CV   C+W LRA R   +      + ++   HTCS +       +     I   ++ + +++ +E
Subjt:  FSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEHTCSIDVVLTDHKQATFTFINDCIKRKISIATSE

Query:  LPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN-----------PDVD-GRFRYFFMEISTCISGWQYCRPIVYVD
        L        + + + G  +   K    +   +  + G  + S++++P      ++ N           P+ D   FR  F   S  I G+Q+CRP++ VD
Subjt:  LPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKN-----------PDVD-GRFRYFFMEISTCISGWQYCRPIVYVD

Query:  GTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCE
           +  K+   L+ AS  D  ++ FPLAF V    +  SW+WFF ++ + V   KD+ ++    + I   V E
Subjt:  GTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.2e-0528.24Show/hide
Query:  TYEEWRVVDQALLGWFFGSMTPSVVCDVIDLK-TSRELWKALEELYGATTKARIIQLRKVMQTTRGSSMKMENYLSFMKQTSEHL
        T + W+  D  +  W +G++T S++  +I +  T+R+LW +LE L+    +AR +Q    ++TT    + +  Y   +K  S+ L
Subjt:  TYEEWRVVDQALLGWFFGSMTPSVVCDVIDLK-TSRELWKALEELYGATTKARIIQLRKVMQTTRGSSMKMENYLSFMKQTSEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACTTATTGTACTTCGAGGACAGAAGGTTGATGGCTATGCTCTTGGAACAAAAGAACAACCACCTGAATTCATCACCACTAGTGAGAAAGAAACTTCTGCACGCAC
AATATCTAATCCGACCTATGAGGAATGGAGAGTTGTTGATCAAGCTCTATTAGGATGGTTCTTCGGATCGATGACACCATCTGTGGTATGTGATGTTATTGACTTAAAGA
CTTCTAGAGAATTGTGGAAGGCACTCGAGGAACTCTATGGTGCAACAACCAAAGCTCGCATTATCCAACTTAGGAAGGTTATGCAAACCACAAGGGGGAGCTCCATGAAA
ATGGAAAACTATCTTAGCTTCATGAAGCAAACTTCAGAACATCTAAAATTGGCTGGTGAAGGGCGTTTTTCCCACACCATTGATGTTTCGAAGATGTCTTCCAATTTTGA
GATTTGCGTAGATGACATGTTCTCCTCGAAGCTTGTGTTACAGAATGCTATTCGAGCTATTGCCATTAGAGACAACTTCCAATTCAAAACAATCAAATCTAACCGTGACA
TTTTTGTTGTGCAATGTGTGGTTGAGGATTGTGAGTGGATCCTTAGAGCATGTAGGTTCGGTGACGATGTTAGTCCCACTTGGGTTGTCAAGAGATTTGACAGTGAACAT
ACATGTTCGATTGATGTTGTATTGACAGATCACAAGCAAGCGACATTTACATTTATTAATGATTGCATTAAGCGGAAGATTAGCATTGCGACTAGTGAATTACCCACTCC
TAAAGACATTATATCATTCATCCGATCGGAATATGGCTTGCGTATTAGTTACCAAAAGGCTTGGCGTGCTCGGGAAGCTGCATTAAATGAGATTAGAGGATCTCCAGAAG
ACTCGTACAAAATGATCCCATCATTTTCACATATGTTGAATACAAAAAATCCAGATGTCGATGGCAGATTTCGTTATTTCTTCATGGAGATATCAACATGCATTTCTGGT
TGGCAATATTGTCGTCCAATTGTTTATGTAGATGGTACACAAATGAAAAACAAGTTTGCCGGAACTCTAATAACAGCTTCAACACCTGATGTGCACGATCAAATATTCCC
TCTAGCATTTTGTGTTGTCGATTCAGAGAATAATTGGTCATGGAAATGGTTCTTTCAGCAGTTAAGTAAAATTGTTGGTCATCATAAAGATATAGTTATTGTGTTTGACA
GGCATAAGAGCATTGACAAAGCAGTTTGCGAAGTAAACCCAGTGGACAATGTGCAATTCCAGGTAATAGATGGTTCTAGTCAACATGAGGTTTATTTGACAGCCAGAAGG
TGCAGCTGCAGGATGTGGGACATCTTGGAGATTCCATGTTCTCATGCTTGTGTTGTTTTGACTTTCAAGCATTTATCCATTAAGGAGTACATATCCCATTTCTACTTGAA
TATTCTCCCACCTAATGTCAAGTGTTCAGTTGGTAGGCCAAAGAAGGTCAGAATCCCTTCACAAATGGAATTCAAAAAGCGTGTGAAATGCGGTCGTTGTGGCAGGAGTG
GACATAATAGGAAAACTTGCAGATTTGCGTTGACCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTACTTATTGTACTTCGAGGACAGAAGGTTGATGGCTATGCTCTTGGAACAAAAGAACAACCACCTGAATTCATCACCACTAGTGAGAAAGAAACTTCTGCACGCAC
AATATCTAATCCGACCTATGAGGAATGGAGAGTTGTTGATCAAGCTCTATTAGGATGGTTCTTCGGATCGATGACACCATCTGTGGTATGTGATGTTATTGACTTAAAGA
CTTCTAGAGAATTGTGGAAGGCACTCGAGGAACTCTATGGTGCAACAACCAAAGCTCGCATTATCCAACTTAGGAAGGTTATGCAAACCACAAGGGGGAGCTCCATGAAA
ATGGAAAACTATCTTAGCTTCATGAAGCAAACTTCAGAACATCTAAAATTGGCTGGTGAAGGGCGTTTTTCCCACACCATTGATGTTTCGAAGATGTCTTCCAATTTTGA
GATTTGCGTAGATGACATGTTCTCCTCGAAGCTTGTGTTACAGAATGCTATTCGAGCTATTGCCATTAGAGACAACTTCCAATTCAAAACAATCAAATCTAACCGTGACA
TTTTTGTTGTGCAATGTGTGGTTGAGGATTGTGAGTGGATCCTTAGAGCATGTAGGTTCGGTGACGATGTTAGTCCCACTTGGGTTGTCAAGAGATTTGACAGTGAACAT
ACATGTTCGATTGATGTTGTATTGACAGATCACAAGCAAGCGACATTTACATTTATTAATGATTGCATTAAGCGGAAGATTAGCATTGCGACTAGTGAATTACCCACTCC
TAAAGACATTATATCATTCATCCGATCGGAATATGGCTTGCGTATTAGTTACCAAAAGGCTTGGCGTGCTCGGGAAGCTGCATTAAATGAGATTAGAGGATCTCCAGAAG
ACTCGTACAAAATGATCCCATCATTTTCACATATGTTGAATACAAAAAATCCAGATGTCGATGGCAGATTTCGTTATTTCTTCATGGAGATATCAACATGCATTTCTGGT
TGGCAATATTGTCGTCCAATTGTTTATGTAGATGGTACACAAATGAAAAACAAGTTTGCCGGAACTCTAATAACAGCTTCAACACCTGATGTGCACGATCAAATATTCCC
TCTAGCATTTTGTGTTGTCGATTCAGAGAATAATTGGTCATGGAAATGGTTCTTTCAGCAGTTAAGTAAAATTGTTGGTCATCATAAAGATATAGTTATTGTGTTTGACA
GGCATAAGAGCATTGACAAAGCAGTTTGCGAAGTAAACCCAGTGGACAATGTGCAATTCCAGGTAATAGATGGTTCTAGTCAACATGAGGTTTATTTGACAGCCAGAAGG
TGCAGCTGCAGGATGTGGGACATCTTGGAGATTCCATGTTCTCATGCTTGTGTTGTTTTGACTTTCAAGCATTTATCCATTAAGGAGTACATATCCCATTTCTACTTGAA
TATTCTCCCACCTAATGTCAAGTGTTCAGTTGGTAGGCCAAAGAAGGTCAGAATCCCTTCACAAATGGAATTCAAAAAGCGTGTGAAATGCGGTCGTTGTGGCAGGAGTG
GACATAATAGGAAAACTTGCAGATTTGCGTTGACCCAGTAG
Protein sequenceShow/hide protein sequence
MVLIVLRGQKVDGYALGTKEQPPEFITTSEKETSARTISNPTYEEWRVVDQALLGWFFGSMTPSVVCDVIDLKTSRELWKALEELYGATTKARIIQLRKVMQTTRGSSMK
MENYLSFMKQTSEHLKLAGEGRFSHTIDVSKMSSNFEICVDDMFSSKLVLQNAIRAIAIRDNFQFKTIKSNRDIFVVQCVVEDCEWILRACRFGDDVSPTWVVKRFDSEH
TCSIDVVLTDHKQATFTFINDCIKRKISIATSELPTPKDIISFIRSEYGLRISYQKAWRAREAALNEIRGSPEDSYKMIPSFSHMLNTKNPDVDGRFRYFFMEISTCISG
WQYCRPIVYVDGTQMKNKFAGTLITASTPDVHDQIFPLAFCVVDSENNWSWKWFFQQLSKIVGHHKDIVIVFDRHKSIDKAVCEVNPVDNVQFQVIDGSSQHEVYLTARR
CSCRMWDILEIPCSHACVVLTFKHLSIKEYISHFYLNILPPNVKCSVGRPKKVRIPSQMEFKKRVKCGRCGRSGHNRKTCRFALTQ