; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013561 (gene) of Snake gourd v1 genome

Gene IDTan0013561
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein FAR1-RELATED SEQUENCE 4-like
Genome locationLG02:87122524..87125419
RNA-Seq ExpressionTan0013561
SyntenyTan0013561
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001207 - Transposase, mutator type
IPR001878 - Zinc finger, CCHC-type
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065296.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]3.8e-7728.66Show/hide
Query:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------
        I + +SG W+ + NY++YK   + +   M+F      I++++       S  + + +    + I+   +I  D DV W +S++   +  +  +       
Subjt:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------

Query:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------
            ++  + +   +DN+ S   +  ++   ++  +I V+  SS F ++ +D+F  K +L  +   IAI+++F++KT                       
Subjt:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------

Query:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH
                                                I  D SS    TP +I              YKAWRA+EL MN + G  ++SY ++P+F  
Subjt:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH

Query:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN
         LK  NPGS T  + D +G FK+ F +  A I GWK+CRP IS                                     ND SW WFF +++  +  + 
Subjt:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN

Query:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM
        DLV++SDRH SI K+V  VFP+A +C+C+ HLLK+LKL+YK  I D +F+ C KAY V+DFE  MR +E     IR+ L  +   KW+ A+  R R+ MM
Subjt:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM

Query:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW
        TTNISESLN V+K++RDLP+A++L+                                           VN ++++E+QVIDG +Q+ V L  +SC+C +W
Subjt:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW

Query:  DTLQIPCSHAC----------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF
        D  +IPC+HA                             R  GRP+K RI S  E K   +C  C ++GHNR++C+F
Subjt:  DTLQIPCSHAC----------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF

TYK09469.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]3.8e-7728.57Show/hide
Query:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------
        I + +SG W+ + +Y++YK + + +   M+F      I++++       S  + + +    + I+   +I  D DV W +S++   +  +  +       
Subjt:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------

Query:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------
            ++  + +   +DN+ S   +  ++   ++  +I V+ +SS F ++  D+F  K +L  +   IAI+++F++KT                       
Subjt:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------

Query:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH
                                                I  D SS    TP +I              YKAWRA+EL MN + G  ++SY ++P+F  
Subjt:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH

Query:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN
         LK  NPGS T  + D +G FK+ F +  A I GWK+CRP IS                                     ND SW WFF +++  +  + 
Subjt:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN

Query:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM
        DLV++SDRH SI K+V  VFP+A +C+C+ HLLK+LKL+YK  I D +F+ C KAY V+DFE  MR +E     IR+ L  +   KW+ A+  R R+ MM
Subjt:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM

Query:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW
        TTNISESLN V+K++RDLP+A++L+                                           VN ++++E+QVIDG +Q+ V L  +SC+CR+W
Subjt:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW

Query:  DTLQIPCSHAC------------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF
        D  +IPC+HA                               R  GRP+K RI S  E K   +C  C ++GHNR++C+F
Subjt:  DTLQIPCSHAC------------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF

XP_022134813.1 uncharacterized protein LOC111006994 [Momordica charantia]1.9e-8939.88Show/hide
Query:  MTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGSTNKTPIVE-GRVYRN
        M++  L + I+K LG VG  D PD++ C+G+  FIKKD +I+ D DV WL +I++    + C+L++D +N LS +LD    N  S+    I   G+ YR 
Subjt:  MTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGSTNKTPIVE-GRVYRN

Query:  IDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------------------------------------------
        IDV  IS+ F I ++D F GK  L NALR +AIR +F ++T                                                           
Subjt:  IDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------------------------------------------

Query:  --------SLINIDGSSLPTPKE------------IKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASI
                  IN  G+ LP+ K+            I   KA  ARE A+ EIRGSPE SY ++P F HM+K KNPGS+ + + D +GRF++ F + ++SI
Subjt:  --------SLINIDGSSLPTPKE------------IKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASI

Query:  NGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHL
        +GWK+C PVIS                                     ND SW+ FF  L++ I  + DLVIVSDRHKSIGK+ + VF  A HCIC  HL
Subjt:  NGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHL

Query:  LKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV
         KNLKL YK K++D +F+ CAKAYNV DFE  MR L+ + RGIR EL  IG  KWS AF+  SR+  MTTNISESLN  +K AR+LPI SMLEV
Subjt:  LKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV

XP_022155207.1 uncharacterized protein LOC111022347 [Momordica charantia]1.8e-7931.48Show/hide
Query:  MLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSV
        M + G WN   NYV YKVSE+ +H  M++ +L + I+++LG VG  D PDI+ CIG+  F+ KD +I+ D DV WL +++   + + C+L++D +N LS 
Subjt:  MLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSV

Query:  MLDNIGSTNKTPIVE-----GRVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKTSLINID---------------------------
        +LD +     + I       G+ + +IDVT I  NF I+++D F GK  L NALR +AIRD+FQ++T   N D                           
Subjt:  MLDNIGSTNKTPIVE-----GRVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKTSLINID---------------------------

Query:  ----------------------------------------GSSLPTPKEIKNY------------KAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKN
                                                G+ L + K++ ++            KAWR R+ A+ EI+GSPE+SY ++PSF HM+K+KN
Subjt:  ----------------------------------------GSSLPTPKEIKNY------------KAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKN

Query:  PGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSK
        PGS+ + ++D  GRF + F + ++SI+G+++CRP              L+  I  + DLV V DRHKSI K+ + VF  A HCIC  +  ++        
Subjt:  PGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSK

Query:  INDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV----------NPVDNME
                                          EL  IG  KWS+A++P SR+  MTTNIS+SLN  +K A +LPI SMLEV             ++++
Subjt:  INDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV----------NPVDNME

Query:  YQVIDGTRQYNVNLPRRSCSCRMWDTLQI---------------------------------PCSHAC-------------------RSVGRPKKNRIPS
        +Q+   T+     L  +  +CR      +                                 P  H                     R  GRPKK RIPS
Subjt:  YQVIDGTRQYNVNLPRRSCSCRMWDTLQI---------------------------------PCSHAC-------------------RSVGRPKKNRIPS

Query:  QMEFKRRVKCGRCGKSGHNRKSCRFALTK
         +EFK+RVKC RCG+ GHNRKSC+F+LT+
Subjt:  QMEFKRRVKCGRCGKSGHNRKSCRFALTK

XP_022159005.1 uncharacterized protein LOC111025451 [Momordica charantia]1.2e-8043.39Show/hide
Query:  IKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGST--NKTPIVEGRVYRNIDVTKISSN
        +++L  VG  D P+++ CIG+  FIKKD +I  D DV WL ++      + C+L++D +N LS +LD    NI S+  N  P + G+ + +IDV  IS+N
Subjt:  IKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGST--NKTPIVEGRVYRNIDVTKISSN

Query:  FTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-------------------------------SLINIDGSSLPTPKE-----------IKNYKAWRARE
        F I+++D F G   L NALR +AIRD+FQ++T                                 INI G+ L + K+           I  +K WRA+E
Subjt:  FTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-------------------------------SLINIDGSSLPTPKE-----------IKNYKAWRARE

Query:  LAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKA
         A+ EIRGSPE+SY ++ SF HM+K+KNPGS+ + ++D  G            I     C     ND SW+WFF HL+  I  + +LVIVS+RHKSI K+
Subjt:  LAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKA

Query:  VQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKA
        V+ VF  AFHCIC  HL KNLKL YK KI+D +F+ CAKAYN+ DFEH MR L+ + RG+R EL  IG  KWS A++  SR+  MTTNISESLN  +K A
Subjt:  VQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKA

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A5A7VG38 Protein FAR1-RELATED SEQUENCE 4-like1.8e-7728.66Show/hide
Query:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------
        I + +SG W+ + NY++YK   + +   M+F      I++++       S  + + +    + I+   +I  D DV W +S++   +  +  +       
Subjt:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------

Query:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------
            ++  + +   +DN+ S   +  ++   ++  +I V+  SS F ++ +D+F  K +L  +   IAI+++F++KT                       
Subjt:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------

Query:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH
                                                I  D SS    TP +I              YKAWRA+EL MN + G  ++SY ++P+F  
Subjt:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH

Query:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN
         LK  NPGS T  + D +G FK+ F +  A I GWK+CRP IS                                     ND SW WFF +++  +  + 
Subjt:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN

Query:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM
        DLV++SDRH SI K+V  VFP+A +C+C+ HLLK+LKL+YK  I D +F+ C KAY V+DFE  MR +E     IR+ L  +   KW+ A+  R R+ MM
Subjt:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM

Query:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW
        TTNISESLN V+K++RDLP+A++L+                                           VN ++++E+QVIDG +Q+ V L  +SC+C +W
Subjt:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW

Query:  DTLQIPCSHAC----------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF
        D  +IPC+HA                             R  GRP+K RI S  E K   +C  C ++GHNR++C+F
Subjt:  DTLQIPCSHAC----------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF

A0A5D3DAW8 Protein FAR1-RELATED SEQUENCE 4-like1.8e-7728.57Show/hide
Query:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------
        I + +SG W+ + +Y++YK + + +   M+F      I++++       S  + + +    + I+   +I  D DV W +S++   +  +  +       
Subjt:  IPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIG-SQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALI------

Query:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------
            ++  + +   +DN+ S   +  ++   ++  +I V+ +SS F ++  D+F  K +L  +   IAI+++F++KT                       
Subjt:  ----IDSKNTLSVMLDNIGSTNKTPIVEG--RVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------

Query:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH
                                                I  D SS    TP +I              YKAWRA+EL MN + G  ++SY ++P+F  
Subjt:  --------------------------------------SLINIDGSSL--PTPKEI------------KNYKAWRARELAMNEIRGSPEDSYKMLPSFAH

Query:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN
         LK  NPGS T  + D +G FK+ F +  A I GWK+CRP IS                                     ND SW WFF +++  +  + 
Subjt:  MLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQN

Query:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM
        DLV++SDRH SI K+V  VFP+A +C+C+ HLLK+LKL+YK  I D +F+ C KAY V+DFE  MR +E     IR+ L  +   KW+ A+  R R+ MM
Subjt:  DLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMM

Query:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW
        TTNISESLN V+K++RDLP+A++L+                                           VN ++++E+QVIDG +Q+ V L  +SC+CR+W
Subjt:  TTNISESLNTVIKKARDLPIASMLE-------------------------------------------VNPVDNMEYQVIDGTRQYNVNLPRRSCSCRMW

Query:  DTLQIPCSHAC------------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF
        D  +IPC+HA                               R  GRP+K RI S  E K   +C  C ++GHNR++C+F
Subjt:  DTLQIPCSHAC------------------------------RSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRF

A0A6J1C328 uncharacterized protein LOC1110069949.3e-9039.88Show/hide
Query:  MTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGSTNKTPIVE-GRVYRN
        M++  L + I+K LG VG  D PD++ C+G+  FIKKD +I+ D DV WL +I++    + C+L++D +N LS +LD    N  S+    I   G+ YR 
Subjt:  MTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGSTNKTPIVE-GRVYRN

Query:  IDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------------------------------------------
        IDV  IS+ F I ++D F GK  L NALR +AIR +F ++T                                                           
Subjt:  IDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-----------------------------------------------------------

Query:  --------SLINIDGSSLPTPKE------------IKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASI
                  IN  G+ LP+ K+            I   KA  ARE A+ EIRGSPE SY ++P F HM+K KNPGS+ + + D +GRF++ F + ++SI
Subjt:  --------SLINIDGSSLPTPKE------------IKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASI

Query:  NGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHL
        +GWK+C PVIS                                     ND SW+ FF  L++ I  + DLVIVSDRHKSIGK+ + VF  A HCIC  HL
Subjt:  NGWKHCRPVIS-------------------------------------NDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHL

Query:  LKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV
         KNLKL YK K++D +F+ CAKAYNV DFE  MR L+ + RGIR EL  IG  KWS AF+  SR+  MTTNISESLN  +K AR+LPI SMLEV
Subjt:  LKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV

A0A6J1DNQ8 uncharacterized protein LOC1110223478.7e-8031.48Show/hide
Query:  MLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSV
        M + G WN   NYV YKVSE+ +H  M++ +L + I+++LG VG  D PDI+ CIG+  F+ KD +I+ D DV WL +++   + + C+L++D +N LS 
Subjt:  MLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSV

Query:  MLDNIGSTNKTPIVE-----GRVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKTSLINID---------------------------
        +LD +     + I       G+ + +IDVT I  NF I+++D F GK  L NALR +AIRD+FQ++T   N D                           
Subjt:  MLDNIGSTNKTPIVE-----GRVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKTSLINID---------------------------

Query:  ----------------------------------------GSSLPTPKEIKNY------------KAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKN
                                                G+ L + K++ ++            KAWR R+ A+ EI+GSPE+SY ++PSF HM+K+KN
Subjt:  ----------------------------------------GSSLPTPKEIKNY------------KAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKN

Query:  PGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSK
        PGS+ + ++D  GRF + F + ++SI+G+++CRP              L+  I  + DLV V DRHKSI K+ + VF  A HCIC  +  ++        
Subjt:  PGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICMVHLLKNLKLVYKSK

Query:  INDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV----------NPVDNME
                                          EL  IG  KWS+A++P SR+  MTTNIS+SLN  +K A +LPI SMLEV             ++++
Subjt:  INDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEV----------NPVDNME

Query:  YQVIDGTRQYNVNLPRRSCSCRMWDTLQI---------------------------------PCSHAC-------------------RSVGRPKKNRIPS
        +Q+   T+     L  +  +CR      +                                 P  H                     R  GRPKK RIPS
Subjt:  YQVIDGTRQYNVNLPRRSCSCRMWDTLQI---------------------------------PCSHAC-------------------RSVGRPKKNRIPS

Query:  QMEFKRRVKCGRCGKSGHNRKSCRFALTK
         +EFK+RVKC RCG+ GHNRKSC+F+LT+
Subjt:  QMEFKRRVKCGRCGKSGHNRKSCRFALTK

A0A6J1DXF3 uncharacterized protein LOC1110254516.0e-8143.39Show/hide
Query:  IKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGST--NKTPIVEGRVYRNIDVTKISSN
        +++L  VG  D P+++ CIG+  FIKKD +I  D DV WL ++      + C+L++D +N LS +LD    NI S+  N  P + G+ + +IDV  IS+N
Subjt:  IKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTYNMDEYCALIIDSKNTLSVMLD----NIGST--NKTPIVEGRVYRNIDVTKISSN

Query:  FTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-------------------------------SLINIDGSSLPTPKE-----------IKNYKAWRARE
        F I+++D F G   L NALR +AIRD+FQ++T                                 INI G+ L + K+           I  +K WRA+E
Subjt:  FTIQIDDIFIGKGVLHNALRDIAIRDSFQYKT-------------------------------SLINIDGSSLPTPKE-----------IKNYKAWRARE

Query:  LAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKA
         A+ EIRGSPE+SY ++ SF HM+K+KNPGS+ + ++D  G            I     C     ND SW+WFF HL+  I  + +LVIVS+RHKSI K+
Subjt:  LAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKA

Query:  VQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKA
        V+ VF  AFHCIC  HL KNLKL YK KI+D +F+ CAKAYN+ DFEH MR L+ + RG+R EL  IG  KWS A++  SR+  MTTNISESLN  +K A
Subjt:  VQLVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKA

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase2.6e-0723.26Show/hide
Query:  ARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLD------EDGRFKFFFRSFAASINGWKHCRPVISND------------------------
        A+  A+    G  + S++++P    +L   N G + + Q D      E   F+  F +F+ SI G++HCRP+I  D                        
Subjt:  ARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLD------EDGRFKFFFRSFAASINGWKHCRPVISND------------------------

Query:  -------------NSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAV-----QLVFPDAFHCICMVHLLKNL
                     +SW WF T +R+ + Q+  + ++S     I   +     Q   P A+H  C+ HL   L
Subjt:  -------------NSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAV-----QLVFPDAFHCICMVHLLKNL

AT1G64260.1 MuDR family transposase2.6e-0720.83Show/hide
Query:  TPKEIKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLD-----EDGRFKFFFRSFAASINGWKHCRPVISND-------------
        T  E++  K    +   +  + G  + S++++P         N G + + Q D     +   F+  F SF+ SI G++HCRP+I  D             
Subjt:  TPKEIKNYKAWRARELAMNEIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLD-----EDGRFKFFFRSFAASINGWKHCRPVISND-------------

Query:  ------------------------NSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQ-----LVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACA
                                +SW WFFT +R+ + Q+ DL ++S   + I   V         P A H  C+ HL      V++    +++     
Subjt:  ------------------------NSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQ-----LVFPDAFHCICMVHLLKNLKLVYKSKINDAIFYACA

Query:  KAYNVIDFEHQMRQLE
              +F+  M  ++
Subjt:  KAYNVIDFEHQMRQLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTTACCGTATATTACTTTCTGTAGTTAATCAACATATTTTGGAGAATATATCGTTCATACGTAGTCGTGATATTCCTATGCTTTATAGCGGATCTTGGAACCA
GTCTTACAATTATGTTAGTTACAAAGTATCAGAGATATCATTACACTACGGAATGACATTTGTTGAGTTACAACAACGTATCATCAAAAAGTTGGGTACTGTTGGACATG
TTGACTCTCCAGATATATATGTTTGCATAGGATCTCAGTCTTTTATTAAGAAAGATGCAGAAATCAATTCAGATACTGATGTTGATTGGTTAGTTAGTATATTGACCTAC
AATATGGATGAATATTGTGCCCTAATTATTGACTCAAAGAACACACTATCAGTTATGTTGGACAACATTGGTTCAACCAACAAAACCCCTATTGTAGAAGGGCGTGTATA
TCGCAACATTGACGTTACTAAGATATCATCCAACTTTACCATTCAAATTGATGACATATTTATTGGTAAGGGTGTATTACACAATGCATTACGCGATATTGCAATTAGAG
ATAGTTTCCAATACAAGACTTCTTTGATTAATATAGATGGCAGCTCGTTACCCACTCCGAAAGAAATCAAAAACTACAAAGCTTGGCGTGCACGGGAATTAGCAATGAAT
GAGATTAGAGGTTCACCAGAGGACTCCTATAAAATGCTTCCATCTTTTGCTCACATGTTGAAGATCAAGAATCCAGGTTCAATAACTGAACTTCAACTTGATGAGGATGG
AAGGTTTAAATTTTTCTTTAGGTCATTTGCTGCTAGTATCAATGGGTGGAAACACTGTCGACCAGTGATTTCAAATGATAATTCATGGTTATGGTTCTTCACTCACCTCA
GAAAGATAATCGTACAACAGAATGACCTTGTTATTGTATCTGATAGGCATAAGAGTATTGGTAAGGCAGTTCAACTAGTATTTCCCGATGCATTTCATTGTATTTGCATG
GTCCATCTATTGAAGAACCTTAAGCTCGTGTACAAGTCGAAGATTAATGATGCCATTTTCTATGCATGTGCCAAAGCTTATAATGTTATTGATTTTGAACATCAGATGCG
ACAACTGGAGTTAAGTGCTCGAGGTATTCGTAATGAATTGCTAACCATAGGTTTACCAAAGTGGTCTCATGCCTTTGCACCTCGTAGTCGTTTTACGATGATGACAACTA
ACATTTCTGAAAGTTTGAATACTGTCATTAAGAAAGCACGAGATTTGCCTATTGCATCCATGTTGGAGGTTAATCCCGTAGACAATATGGAATATCAAGTCATAGATGGA
ACAAGACAATACAATGTGAATTTACCCAGAAGGAGTTGCAGTTGTAGAATGTGGGACACATTGCAGATTCCTTGCTCTCATGCATGTCGATCGGTTGGCAGGCCGAAGAA
GAATAGAATTCCTTCACAAATGGAGTTCAAGAGACGTGTTAAATGCGGTCGTTGTGGCAAATCTGGTCATAATCGGAAGTCTTGCAGATTCGCTCTTACAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATTTTACCGTATATTACTTTCTGTAGTTAATCAACATATTTTGGAGAATATATCGTTCATACGTAGTCGTGATATTCCTATGCTTTATAGCGGATCTTGGAACCA
GTCTTACAATTATGTTAGTTACAAAGTATCAGAGATATCATTACACTACGGAATGACATTTGTTGAGTTACAACAACGTATCATCAAAAAGTTGGGTACTGTTGGACATG
TTGACTCTCCAGATATATATGTTTGCATAGGATCTCAGTCTTTTATTAAGAAAGATGCAGAAATCAATTCAGATACTGATGTTGATTGGTTAGTTAGTATATTGACCTAC
AATATGGATGAATATTGTGCCCTAATTATTGACTCAAAGAACACACTATCAGTTATGTTGGACAACATTGGTTCAACCAACAAAACCCCTATTGTAGAAGGGCGTGTATA
TCGCAACATTGACGTTACTAAGATATCATCCAACTTTACCATTCAAATTGATGACATATTTATTGGTAAGGGTGTATTACACAATGCATTACGCGATATTGCAATTAGAG
ATAGTTTCCAATACAAGACTTCTTTGATTAATATAGATGGCAGCTCGTTACCCACTCCGAAAGAAATCAAAAACTACAAAGCTTGGCGTGCACGGGAATTAGCAATGAAT
GAGATTAGAGGTTCACCAGAGGACTCCTATAAAATGCTTCCATCTTTTGCTCACATGTTGAAGATCAAGAATCCAGGTTCAATAACTGAACTTCAACTTGATGAGGATGG
AAGGTTTAAATTTTTCTTTAGGTCATTTGCTGCTAGTATCAATGGGTGGAAACACTGTCGACCAGTGATTTCAAATGATAATTCATGGTTATGGTTCTTCACTCACCTCA
GAAAGATAATCGTACAACAGAATGACCTTGTTATTGTATCTGATAGGCATAAGAGTATTGGTAAGGCAGTTCAACTAGTATTTCCCGATGCATTTCATTGTATTTGCATG
GTCCATCTATTGAAGAACCTTAAGCTCGTGTACAAGTCGAAGATTAATGATGCCATTTTCTATGCATGTGCCAAAGCTTATAATGTTATTGATTTTGAACATCAGATGCG
ACAACTGGAGTTAAGTGCTCGAGGTATTCGTAATGAATTGCTAACCATAGGTTTACCAAAGTGGTCTCATGCCTTTGCACCTCGTAGTCGTTTTACGATGATGACAACTA
ACATTTCTGAAAGTTTGAATACTGTCATTAAGAAAGCACGAGATTTGCCTATTGCATCCATGTTGGAGGTTAATCCCGTAGACAATATGGAATATCAAGTCATAGATGGA
ACAAGACAATACAATGTGAATTTACCCAGAAGGAGTTGCAGTTGTAGAATGTGGGACACATTGCAGATTCCTTGCTCTCATGCATGTCGATCGGTTGGCAGGCCGAAGAA
GAATAGAATTCCTTCACAAATGGAGTTCAAGAGACGTGTTAAATGCGGTCGTTGTGGCAAATCTGGTCATAATCGGAAGTCTTGCAGATTCGCTCTTACAAAATAA
Protein sequenceShow/hide protein sequence
MQFYRILLSVVNQHILENISFIRSRDIPMLYSGSWNQSYNYVSYKVSEISLHYGMTFVELQQRIIKKLGTVGHVDSPDIYVCIGSQSFIKKDAEINSDTDVDWLVSILTY
NMDEYCALIIDSKNTLSVMLDNIGSTNKTPIVEGRVYRNIDVTKISSNFTIQIDDIFIGKGVLHNALRDIAIRDSFQYKTSLINIDGSSLPTPKEIKNYKAWRARELAMN
EIRGSPEDSYKMLPSFAHMLKIKNPGSITELQLDEDGRFKFFFRSFAASINGWKHCRPVISNDNSWLWFFTHLRKIIVQQNDLVIVSDRHKSIGKAVQLVFPDAFHCICM
VHLLKNLKLVYKSKINDAIFYACAKAYNVIDFEHQMRQLELSARGIRNELLTIGLPKWSHAFAPRSRFTMMTTNISESLNTVIKKARDLPIASMLEVNPVDNMEYQVIDG
TRQYNVNLPRRSCSCRMWDTLQIPCSHACRSVGRPKKNRIPSQMEFKRRVKCGRCGKSGHNRKSCRFALTK