; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001057 (gene) of Snake gourd v1 genome

Gene IDTan0001057
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG06:19348816..19351336
RNA-Seq ExpressionTan0001057
SyntenyTan0001057
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.2e-8631.63Show/hide
Query:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF
        MD  EL +I+ AF ++Q QLLL+LE+  ND +RI +     RH+IRQLA FR+IH                         T   L  T+VVDV EMVAMF
Subjt:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF

Query:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------
        LHI+AHD K+R+I+R+F+RSGET+SRHF  +L A+++LH+ LLKKP+P+ N  TD +WRWFE                                      
Subjt:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------VQIISKSNK------------MLGTSRAPKHSWTKHDDAKLV
                                                                  +  I  SN+            M  +SR PKH+WTK ++A LV
Subjt:  ----------------------------------------------------------VQIISKSNK------------MLGTSRAPKHSWTKHDDAKLV

Query:  ECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLR
        ECL+ L  +GGWRS+N TFRPGYL +L  MM  +IPG+ I +STID RI+ +K+ +  +AEM G  CSGFGWNDE KC+  EKE FD W  +HP AKGL 
Subjt:  ECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLR

Query:  NKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMV
        NK F H+D+L+ VFGKDRATG  AE+  D+ SN  +    D    D+       PP  + G++   D++ ET T+R +    +   GSKRKR  + ++  
Subjt:  NKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMV

Query:  VMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN
         +VR A++     L  IAEWP  +     + R +IV H+  IPEL +  + ++M IL R     ++FL VP  ++  YC +++++N
Subjt:  VMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]3.2e-10236.43Show/hide
Query:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF
        MD  EL +I+ AF ++Q QLLL+LE+  ND +RI +     RH+IRQLA FR+IH SDL+CR+STRMDRRCFAILC LL+T   L  T+VVDV EMVAMF
Subjt:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF

Query:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------
        LHI+AHD KNR+I+R+F+RSGET+SRHF  +L A+++LHD LLKKP+P+ N  TD +WRWFE                                      
Subjt:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------

Query:  -----------------------VQIISKSNK--------------------------------------------------------------------
                                  +S+ N+                                                                    
Subjt:  -----------------------VQIISKSNK--------------------------------------------------------------------

Query:  ----------------------------------------------MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVE
                                                      M  +SR PKH+WTK ++A LVE    L  +GGWRS+N TFRPGYL +L  MM  
Subjt:  ----------------------------------------------MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVE

Query:  RIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASN
        +IPG  I +STID RI+ +K+ +  +AEM G  CSGFGWNDE KC+  EKE FD W  +HP AKGL NK F H+D+L+ VFGKDRATG  AE+  D+ SN
Subjt:  RIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASN

Query:  VGNHMDEDINLDDSQEYYVPIPPFS---TQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVE
          N    D    D+    VP   FS   + G++   D++ ET T+R +    +   GSKRKR  + ++   +VR A++     L  IAEWP  +     +
Subjt:  VGNHMDEDINLDDSQEYYVPIPPFS---TQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVE

Query:  VRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN
         R +IV  +  IPEL +  + ++M IL R     ++FL VP  ++  YC I++++N
Subjt:  VRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]2.4e-7840Show/hide
Query:  MDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNS-STDG---------
        MDRRCF ILCT+L+T G L  T+ VDV EMV +FLHI+AHD KNR+ RR   RSGETVSRHF ++LNA+L+LH++LLK+P+P+T+S + DG         
Subjt:  MDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNS-STDG---------

Query:  ----KWRW---------------FEVQIISKSN---KMLGT-SRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ
            ++R+                 +Q++  S+   +M  T S+A KH WT  +D  LVECLL L + GGWR++N TF+ GYL                 
Subjt:  ----KWRW---------------FEVQIISKSN---KMLGT-SRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ

Query:  SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDE-
                    KQY  IAEM+G  CSGFGWN+  KC+EVEK  FD WVK HP A+GL NKPFP+F DL +VFG+DRATG   +TP +M+S      +E 
Subjt:  SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDE-

Query:  --DINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVD
          DINL+D   + +P P     G++ P  ++M  TPTS  +  G+S P    +KR +Y  +++   R +M   +  +  IA W +EK   E  +  ++  
Subjt:  --DINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVD

Query:  HMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLE
         +  IP +D+     V   L        +FL  P    +E
Subjt:  HMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLE

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]3.3e-7543.86Show/hide
Query:  LLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFEVQIISKSNKMLGTSR
        +L+T G L  T+ VDV EMV +FLHI+AHD KNR+ RR F RSGETVSRHF  +LN +L+LH++LLK+P+ +T+S +  KWRWF++  I+        S+
Subjt:  LLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFEVQIISKSNKMLGTSR

Query:  APKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKE
          KH WT  +D  LVECLL L + G WR +N TF+PGYL+++Q +M E+I  + IQ +  ++  ++ LKKQY  IAEM+G  CSGF WN E KC+E EK 
Subjt:  APKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKE

Query:  TFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGAS
          + WVK H  A+ L NKPFP+F DL IVFG+DRATG   +TP +M S      +ED  + + +++ +P P     G++ P  ++M  TPTS  +  G+S
Subjt:  TFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGAS

Query:  TPKGSKRKRSTYQSEMVVMVR--DAMDVNASHLKMIAEWPKE
         P    +KR +Y  +++   R  +++  + + L    ++P E
Subjt:  TPKGSKRKRSTYQSEMVVMVR--DAMDVNASHLKMIAEWPKE

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]1.4e-8150.98Show/hide
Query:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC
        M   SRAPKH+WTK ++ K VECL+ L  SGGWRS+N TF+PGYL +LQ MM E++PGT IQ SSTIDC +++LKK Y  IAEM G  CSGFGWN+EF+C
Subjt:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC

Query:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN
        +  E++ FD W+K+HP AKGL +K FP++DDL+ VFGKDRATGA +ET  ++ SNV N  ++ I L DS +    IP   +QG+    DEM      + +
Subjt:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN

Query:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY
            +    SKRKR + + E V ++R  M+     LK IA+WPKEK A EVE+R+Q+V  +  IP+L    +A++M ILFR  +A E FLS+P EL+LEY
Subjt:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY

Query:  CQILIE
        C IL++
Subjt:  CQILIE

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859536.7e-8250.98Show/hide
Query:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC
        M   SRAPKH+WTK ++ K VECL+ L  SGGWRS+N TF+PGYL +LQ MM E++PGT IQ SSTIDC +++LKK Y  IAEM G  CSGFGWN+EF+C
Subjt:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC

Query:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN
        +  E++ FD W+K+HP AKGL +K FP++DDL+ VFGKDRATGA +ET  ++ SNV N  ++ I L DS +    IP   +QG+    DEM      + +
Subjt:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN

Query:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY
            +    SKRKR + + E V ++R  M+     LK IA+WPKEK A EVE+R+Q+V  +  IP+L    +A++M ILFR  +A E FLS+P EL+LEY
Subjt:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY

Query:  CQILIE
        C IL++
Subjt:  CQILIE

A0A5A7SWD8 Retrotransposon protein1.5e-10236.43Show/hide
Query:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF
        MD  EL +I+ AF ++Q QLLL+LE+  ND +RI +     RH+IRQLA FR+IH SDL+CR+STRMDRRCFAILC LL+T   L  T+VVDV EMVAMF
Subjt:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF

Query:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------
        LHI+AHD KNR+I+R+F+RSGET+SRHF  +L A+++LHD LLKKP+P+ N  TD +WRWFE                                      
Subjt:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------

Query:  -----------------------VQIISKSNK--------------------------------------------------------------------
                                  +S+ N+                                                                    
Subjt:  -----------------------VQIISKSNK--------------------------------------------------------------------

Query:  ----------------------------------------------MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVE
                                                      M  +SR PKH+WTK ++A LVE    L  +GGWRS+N TFRPGYL +L  MM  
Subjt:  ----------------------------------------------MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVE

Query:  RIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASN
        +IPG  I +STID RI+ +K+ +  +AEM G  CSGFGWNDE KC+  EKE FD W  +HP AKGL NK F H+D+L+ VFGKDRATG  AE+  D+ SN
Subjt:  RIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASN

Query:  VGNHMDEDINLDDSQEYYVPIPPFS---TQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVE
          N    D    D+    VP   FS   + G++   D++ ET T+R +    +   GSKRKR  + ++   +VR A++     L  IAEWP  +     +
Subjt:  VGNHMDEDINLDDSQEYYVPIPPFS---TQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVE

Query:  VRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN
         R +IV  +  IPEL +  + ++M IL R     ++FL VP  ++  YC I++++N
Subjt:  VRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN

A0A5A7U0H7 Retrotransposon protein6.7e-8250.98Show/hide
Query:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC
        M   SRAPKH+WTK ++ K VECL+ L  SGGWRS+N TF+PGYL +LQ MM E++PGT IQ SSTIDC +++LKK Y  IAEM G  CSGFGWN+EF+C
Subjt:  MLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ-SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKC

Query:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN
        +  E++ FD W+K+HP AKGL +K FP++DDL+ VFGKDRATGA +ET  ++ SNV N  ++ I L DS +    IP   +QG+    DEM      + +
Subjt:  VEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNN

Query:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY
            +    SKRKR + + E V ++R  M+     LK IA+WPKEK A EVE+R+Q+V  +  IP+L    +A++M ILFR  +A E FLS+P EL+LEY
Subjt:  TVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEY

Query:  CQILIE
        C IL++
Subjt:  CQILIE

A0A5D3C7T4 Uncharacterized protein1.2e-7840Show/hide
Query:  MDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNS-STDG---------
        MDRRCF ILCT+L+T G L  T+ VDV EMV +FLHI+AHD KNR+ RR   RSGETVSRHF ++LNA+L+LH++LLK+P+P+T+S + DG         
Subjt:  MDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNS-STDG---------

Query:  ----KWRW---------------FEVQIISKSN---KMLGT-SRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ
            ++R+                 +Q++  S+   +M  T S+A KH WT  +D  LVECLL L + GGWR++N TF+ GYL                 
Subjt:  ----KWRW---------------FEVQIISKSN---KMLGT-SRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQ

Query:  SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDE-
                    KQY  IAEM+G  CSGFGWN+  KC+EVEK  FD WVK HP A+GL NKPFP+F DL +VFG+DRATG   +TP +M+S      +E 
Subjt:  SSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDE-

Query:  --DINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVD
          DINL+D   + +P P     G++ P  ++M  TPTS  +  G+S P    +KR +Y  +++   R +M   +  +  IA W +EK   E  +  ++  
Subjt:  --DINLDDSQEYYVPIPPFSTQGMDTPH-DEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVD

Query:  HMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLE
         +  IP +D+     V   L        +FL  P    +E
Subjt:  HMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLE

E5GCB5 Retrotransposon protein1.5e-8631.63Show/hide
Query:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF
        MD  EL +I+ AF ++Q QLLL+LE+  ND +RI +     RH+IRQLA FR+IH                         T   L  T+VVDV EMVAMF
Subjt:  MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMF

Query:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------
        LHI+AHD K+R+I+R+F+RSGET+SRHF  +L A+++LH+ LLKKP+P+ N  TD +WRWFE                                      
Subjt:  LHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFE--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------VQIISKSNK------------MLGTSRAPKHSWTKHDDAKLV
                                                                  +  I  SN+            M  +SR PKH+WTK ++A LV
Subjt:  ----------------------------------------------------------VQIISKSNK------------MLGTSRAPKHSWTKHDDAKLV

Query:  ECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLR
        ECL+ L  +GGWRS+N TFRPGYL +L  MM  +IPG+ I +STID RI+ +K+ +  +AEM G  CSGFGWNDE KC+  EKE FD W  +HP AKGL 
Subjt:  ECLLSLAQSGGWRSNNDTFRPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLR

Query:  NKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMV
        NK F H+D+L+ VFGKDRATG  AE+  D+ SN  +    D    D+       PP  + G++   D++ ET T+R +    +   GSKRKR  + ++  
Subjt:  NKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHMDEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMV

Query:  VMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN
         +VR A++     L  IAEWP  +     + R +IV H+  IPEL +  + ++M IL R     ++FL VP  ++  YC +++++N
Subjt:  VMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIWAKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein7.5e-0926.48Show/hide
Query:  GTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTF-RPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVE
        G  + P + WT  +   L+E +        WR ++    +     KL   + +R+ G          R++ LK  Y+   + L    SGFGW+ E K   
Subjt:  GTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTF-RPGYLMKLQHMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVE

Query:  VEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMAS----NVGNHMD--EDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPT
           E +  ++KAHP  K ++ +   HF+DL I+FG   ATG+ A   +D        VG      E +N D++ E    +  FS Q   +   E   +P 
Subjt:  VEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMAS----NVGNHMD--EDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPT

Query:  SRNNTVGASTPKGSKRKRS
        + + T    + K   RKR+
Subjt:  SRNNTVGASTPKGSKRKRS

AT2G24960.1 unknown protein1.0e-0527.66Show/hide
Query:  GTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDM
        G+      +  R   L KQY  +  +L  G  GF W+   + V  +   +  ++KAHP A+  + KP  +F DL +++G   A G  + +  D+
Subjt:  GTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDM

AT2G24960.2 unknown protein2.2e-0525.88Show/hide
Query:  GTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATG
        G+      +  R + L++ Y  I  +L    +GF W+     V  + + ++ +++AHP A+  R K  P + +L  +FGK+ + G
Subjt:  GTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATG

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.5e-1235.58Show/hide
Query:  FRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPIT
        +++++  +  C E+ RMD+  F  LC LL+T G L  T  + +   +A+FL II H+ + R ++  F  SGET+SRHF ++LNA++ +        +P +
Subjt:  FRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKNRMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPIT

Query:  NSST
        NS T
Subjt:  NSST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCGCGAGAACTAGTTGCCATACTGACAGCTTTTACCTCAACCCAACATCAATTACTTCTGATATTAGAGATATATATGAATGACCATAGAAGAATAGAATACCA
ATCACCCTTACTCAGGCATCAAATTAGGCAGCTAGCATGTTTCCGACTCATCCACGAGTCCGACCTAATATGTCGTGAGAGTACAAGGATGGACAGAAGATGTTTCGCCA
TTCTTTGTACCCTACTTAAGACATCTGGCAATTTAATAGGGACAAAAGTTGTAGACGTAGCGGAAATGGTAGCCATGTTCTTACACATCATAGCACATGATGCTAAAAAT
CGAATGATTCGTAGACAATTTGTTAGATCTGGTGAGACCGTTTCAAGACATTTTAGATCCATCCTGAATGCACTTTTGCAACTACATGATGTGCTATTGAAGAAACCAGA
ACCCATCACTAATTCCTCAACGGATGGGAAATGGAGATGGTTTGAGGTACAAATCATTTCTAAATCCAATAAAATGTTAGGTACATCGAGAGCTCCTAAGCACTCGTGGA
CTAAGCATGATGATGCGAAATTGGTGGAGTGTCTTTTGTCATTAGCTCAATCTGGTGGTTGGAGATCGAACAATGACACCTTTCGCCCTGGGTACCTGATGAAACTCCAA
CACATGATGGTGGAGAGGATACCGGGCACTCCAATTCAATCATCCACCATAGATTGCAGGATACGAACTTTAAAAAAACAGTATCGTAAGATTGCAGAGATGCTAGGGAC
AGGATGCAGTGGTTTCGGTTGGAACGATGAATTCAAGTGTGTGGAGGTGGAGAAGGAGACGTTCGATGGTTGGGTAAAGGCTCACCCGGGTGCAAAGGGGTTGCGCAACA
AGCCATTTCCCCATTTTGATGATTTAGCCATTGTCTTTGGTAAAGATCGAGCTACAGGTGCAACTGCGGAAACTCCTACAGATATGGCATCGAATGTAGGAAACCACATG
GACGAAGATATTAACTTGGACGATTCTCAAGAATACTATGTGCCAATACCACCATTTTCAACTCAAGGAATGGACACGCCTCATGATGAGATGCATGAGACACCTACTAG
TCGGAACAACACAGTAGGTGCTTCTACACCTAAAGGTAGTAAAAGGAAGCGCTCCACTTATCAATCTGAGATGGTTGTTATGGTACGAGATGCTATGGATGTGAATGCAT
CTCACCTAAAGATGATTGCAGAGTGGCCCAAAGAAAAGCATGCGACGGAGGTAGAAGTACGGTCACAAATTGTGGACCACATGTATAGGATCCCAGAGTTGGATATTTGG
GCTAAAGCGCAAGTGATGAACATCCTCTTCCGTGGTGCCCAAGCATCAGAAAGTTTCTTATCTGTGCCTCCAGAGTTGCAGCTGGAATATTGTCAGATCTTGATCGAGAA
AAACACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCGCGAGAACTAGTTGCCATACTGACAGCTTTTACCTCAACCCAACATCAATTACTTCTGATATTAGAGATATATATGAATGACCATAGAAGAATAGAATACCA
ATCACCCTTACTCAGGCATCAAATTAGGCAGCTAGCATGTTTCCGACTCATCCACGAGTCCGACCTAATATGTCGTGAGAGTACAAGGATGGACAGAAGATGTTTCGCCA
TTCTTTGTACCCTACTTAAGACATCTGGCAATTTAATAGGGACAAAAGTTGTAGACGTAGCGGAAATGGTAGCCATGTTCTTACACATCATAGCACATGATGCTAAAAAT
CGAATGATTCGTAGACAATTTGTTAGATCTGGTGAGACCGTTTCAAGACATTTTAGATCCATCCTGAATGCACTTTTGCAACTACATGATGTGCTATTGAAGAAACCAGA
ACCCATCACTAATTCCTCAACGGATGGGAAATGGAGATGGTTTGAGGTACAAATCATTTCTAAATCCAATAAAATGTTAGGTACATCGAGAGCTCCTAAGCACTCGTGGA
CTAAGCATGATGATGCGAAATTGGTGGAGTGTCTTTTGTCATTAGCTCAATCTGGTGGTTGGAGATCGAACAATGACACCTTTCGCCCTGGGTACCTGATGAAACTCCAA
CACATGATGGTGGAGAGGATACCGGGCACTCCAATTCAATCATCCACCATAGATTGCAGGATACGAACTTTAAAAAAACAGTATCGTAAGATTGCAGAGATGCTAGGGAC
AGGATGCAGTGGTTTCGGTTGGAACGATGAATTCAAGTGTGTGGAGGTGGAGAAGGAGACGTTCGATGGTTGGGTAAAGGCTCACCCGGGTGCAAAGGGGTTGCGCAACA
AGCCATTTCCCCATTTTGATGATTTAGCCATTGTCTTTGGTAAAGATCGAGCTACAGGTGCAACTGCGGAAACTCCTACAGATATGGCATCGAATGTAGGAAACCACATG
GACGAAGATATTAACTTGGACGATTCTCAAGAATACTATGTGCCAATACCACCATTTTCAACTCAAGGAATGGACACGCCTCATGATGAGATGCATGAGACACCTACTAG
TCGGAACAACACAGTAGGTGCTTCTACACCTAAAGGTAGTAAAAGGAAGCGCTCCACTTATCAATCTGAGATGGTTGTTATGGTACGAGATGCTATGGATGTGAATGCAT
CTCACCTAAAGATGATTGCAGAGTGGCCCAAAGAAAAGCATGCGACGGAGGTAGAAGTACGGTCACAAATTGTGGACCACATGTATAGGATCCCAGAGTTGGATATTTGG
GCTAAAGCGCAAGTGATGAACATCCTCTTCCGTGGTGCCCAAGCATCAGAAAGTTTCTTATCTGTGCCTCCAGAGTTGCAGCTGGAATATTGTCAGATCTTGATCGAGAA
AAACACCTGA
Protein sequenceShow/hide protein sequence
MDPRELVAILTAFTSTQHQLLLILEIYMNDHRRIEYQSPLLRHQIRQLACFRLIHESDLICRESTRMDRRCFAILCTLLKTSGNLIGTKVVDVAEMVAMFLHIIAHDAKN
RMIRRQFVRSGETVSRHFRSILNALLQLHDVLLKKPEPITNSSTDGKWRWFEVQIISKSNKMLGTSRAPKHSWTKHDDAKLVECLLSLAQSGGWRSNNDTFRPGYLMKLQ
HMMVERIPGTPIQSSTIDCRIRTLKKQYRKIAEMLGTGCSGFGWNDEFKCVEVEKETFDGWVKAHPGAKGLRNKPFPHFDDLAIVFGKDRATGATAETPTDMASNVGNHM
DEDINLDDSQEYYVPIPPFSTQGMDTPHDEMHETPTSRNNTVGASTPKGSKRKRSTYQSEMVVMVRDAMDVNASHLKMIAEWPKEKHATEVEVRSQIVDHMYRIPELDIW
AKAQVMNILFRGAQASESFLSVPPELQLEYCQILIEKNT