; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018498 (gene) of Snake gourd v1 genome

Gene IDTan0018498
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG10:12353638..12356135
RNA-Seq ExpressionTan0018498
SyntenyTan0018498
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]2.6e-6330.86Show/hide
Query:  ELFNILTIMTTSQRQMLVNLGILLNNYRRLEYRSSYDRHRIRQMYFFRLIYENDLCCRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHIL
        EL +I+     SQRQ+L+ L +L N+ +R+ +     RHRIRQ+ +FR+I+ +DL CR+ST MDRR FAILC +L+T  GL  T+ VD+EEMVAMFLHIL
Subjt:  ELFNILTIMTTSQRQMLVNLGILLNNYRRLEYRSSYDRHRIRQMYFFRLIYENDLCCRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHIL

Query:  AHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-------------------------------------------------
        AHDVKNRVI+R+F RSGET+SRHFN VL +V++LHD LLKKP+P+ + CTD                                                 
Subjt:  AHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                           A
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  IMAGTSKH-----------------------------------------SKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYVQHLQKMLAEKLPN
        I+ G S H                                          KHTWTK E+A LVE    LV+  GWRSDNGTFRPGY+  L +M+A K+P 
Subjt:  IMAGTSKH-----------------------------------------SKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYVQHLQKMLAEKLPN

Query:  SSLELNTIDCK--------------------------------AENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQ
         ++  +TID +                                AE EVFD W  SH  AKG+ NK F HYD+L++VFGKDRATG  AE+  ++ SN    
Subjt:  SSLELNTIDCK--------------------------------AENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQ

Query:  MEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEV
         +      +  +  T+  PM +    ++  ++L  T T+R   S   +  +GSKRKR    T+  D+VRT ++     + R+  W   + +     R+E+
Subjt:  MEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEV

Query:  VDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLL
        V  L  I  LT  DR  L+ +L+ ++     FL+VP   +  YC  +L
Subjt:  VDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLL

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]9.0e-6438.37Show/hide
Query:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCT------------
        MDRR F ILCTML+T GGL  TQYVD++EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFN+VLN+VL+LH++LLK+P+P+T +C             
Subjt:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCT------------

Query:  ----------------------------------DAIMAGT-SKHSKHTWTKVEDARLVESLVYLV-HNGWRSDNGTFRPGYVQH---LQKMLAEKLPNS
                                             MA T SK +KH WT +ED  LVE L+ LV   GWR+DNGTF+ GY++    + +M+       
Subjt:  ----------------------------------DAIMAGT-SKHSKHTWTKVEDARLVESLVYLV-HNGWRSDNGTFRPGYVQH---LQKMLAEKLPNS

Query:  SLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEE
                 + E  VFD WVK H NA+G+ NKPFP++ DL  VFG+DRATG   +TP EM+S  A+  EE ++ +  +D        +E P       E+
Subjt:  SLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEE

Query:  LSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCF
        + +TPTS  + +G+S       +KR  +  +++D  R +M   +  + ++ +WQ++K E+E++  K +   L  I G+   D + + + L+ D      F
Subjt:  LSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCF

Query:  LQVP
        L  P
Subjt:  LQVP

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]3.4e-7150.16Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AE
        M G SK SKH W+KVEDARLVE+L+YLV  GWRSDNGTFRPGY+QHL+++L EK+P  +L  NTI+CK                               E
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AE

Query:  NEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSG
         E+FD WV+SH NAKGM  KPFPHYDDL+ VFGKDRA                            D    E R  E+P   D  +EE +   T R +   
Subjt:  NEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSG

Query:  TSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCL
         SSR  GSKRKRS FQ EMID+V++T+++Q+THM RL SWQ +KYELE    KEVV+ +Y+I+ L E+D+V+LID++VTDIQKTDCFL VP  +R+ YCL
Subjt:  TSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCL

Query:  RLLGR
        RLLGR
Subjt:  RLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]7.9e-6045Show/hide
Query:  KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AENEVFD
        K SKH W+KVEDA+ VE+L+YLV  GWRSDNGTFR  Y+QHL+++  EK+   +L  NTI+CK                               E E+FD
Subjt:  KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AENEVFD

Query:  AWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRC
         WV+SH NAKGM NKPFPHYDDL+ VFGK +A G  +E P  M +NA ++ E+EIRLGSQD                       +TP             
Subjt:  AWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRC

Query:  TGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLLGR
                                ++THM RL SWQK+KYELE  RRKEVV+ +Y+I+GL E D+V+LID+LVTDIQKT+CFL VP  +R+ YCLRLLGR
Subjt:  TGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLLGR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.0e-6748.85Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AE
        MAG+ K SKH W+KVED +LVE+L+YLV  GWRSDNGTFR GY+Q+L+++L EK+P  +L  NTI+CK                               E
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCK------------------------------AE

Query:  NEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSG
         E+FD WV+SH NAKGM NK F HYDDL+ VFGKDRA      TP                         E    E+P   D  +EE +   T R +   
Subjt:  NEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSG

Query:  TSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCL
         SSR  GSKRKR  FQ EMID++R+T+++Q+THM RL SWQK+KYELE  RRKEVV+ +Y I+GL E D+V+ ID+LVTDIQKTDCFL VP  +R+ YCL
Subjt:  TSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCL

Query:  RLLGR
         LL R
Subjt:  RLLGR

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein1.4e-5441.96Show/hide
Query:  MLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD------AIMAGTSKHSKHTWTK
        ML+T GGL  TQYVD+EEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN VLN VL+LH++LLK+P+ +T +C+        + +  SK +KH WT 
Subjt:  MLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD------AIMAGTSKHSKHTWTK

Query:  VEDARLVESLVYLVHNG-WRSDNGTFRPGYVQHLQKMLAEKLPNSSLEL---------------NTID------------------CKAENEVFDAWVKS
        +ED  LVE L+ LV  G WR DNGTF+PGY+  +QK++ EK+  S++++                TI                    +AE  V + WVK 
Subjt:  VEDARLVESLVYLVHNG-WRSDNGTFRPGYVQHLQKMLAEKLPNSSLEL---------------NTID------------------CKAENEVFDAWVKS

Query:  HTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSK
        H NA+ + NKPFP++ DL  VFG+DRATG   +TP EM S  A+  EE ++ +  +D        +E P       E++ +TPTS  + +G+        
Subjt:  HTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSK

Query:  RKRSCFQTEMIDVVRTT
        +KR  +  +++D  R T
Subjt:  RKRSCFQTEMIDVVRTT

A0A5A7SWD8 Retrotransposon protein1.3e-6330.86Show/hide
Query:  ELFNILTIMTTSQRQMLVNLGILLNNYRRLEYRSSYDRHRIRQMYFFRLIYENDLCCRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHIL
        EL +I+     SQRQ+L+ L +L N+ +R+ +     RHRIRQ+ +FR+I+ +DL CR+ST MDRR FAILC +L+T  GL  T+ VD+EEMVAMFLHIL
Subjt:  ELFNILTIMTTSQRQMLVNLGILLNNYRRLEYRSSYDRHRIRQMYFFRLIYENDLCCRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHIL

Query:  AHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-------------------------------------------------
        AHDVKNRVI+R+F RSGET+SRHFN VL +V++LHD LLKKP+P+ + CTD                                                 
Subjt:  AHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                           A
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  IMAGTSKH-----------------------------------------SKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYVQHLQKMLAEKLPN
        I+ G S H                                          KHTWTK E+A LVE    LV+  GWRSDNGTFRPGY+  L +M+A K+P 
Subjt:  IMAGTSKH-----------------------------------------SKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYVQHLQKMLAEKLPN

Query:  SSLELNTIDCK--------------------------------AENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQ
         ++  +TID +                                AE EVFD W  SH  AKG+ NK F HYD+L++VFGKDRATG  AE+  ++ SN    
Subjt:  SSLELNTIDCK--------------------------------AENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQ

Query:  MEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEV
         +      +  +  T+  PM +    ++  ++L  T T+R   S   +  +GSKRKR    T+  D+VRT ++     + R+  W   + +     R+E+
Subjt:  MEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEV

Query:  VDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLL
        V  L  I  LT  DR  L+ +L+ ++     FL+VP   +  YC  +L
Subjt:  VDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLL

A0A5A7TI01 Retrotransposon protein8.8e-4938.17Show/hide
Query:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-----------
        MDRR F ILC +L+T  GL   + VD+EEMVAMFLHI+AHDVKNRVI+R+F RSGET+SRHFN VL  V++LHD LLKKP+P+ + CTD           
Subjt:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD-----------

Query:  --AIMAG---------------------------------------------------------------------TSKHSKHTWTKVEDARLVESLVYL
           ++AG                                                                       KHS    TK E+A LVE LV L
Subjt:  --AIMAG---------------------------------------------------------------------TSKHSKHTWTKVEDARLVESLVYL

Query:  VH-NGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQM
        V+  GWRSDNGTF PGY+  L +++A K+              E EVFD WVKSH  AKG+ NK F HYD+L++VFGKDRAT   AE+     SN     
Subjt:  VH-NGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQM

Query:  EEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMD
        +  I   + D+   +  PM +    ++  ++L  T T+R   S   +  +GSK+KR    T+  D+VRT ++
Subjt:  EEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMD

A0A5D3C7T4 Uncharacterized protein4.4e-6438.37Show/hide
Query:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCT------------
        MDRR F ILCTML+T GGL  TQYVD++EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFN+VLN+VL+LH++LLK+P+P+T +C             
Subjt:  MDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCT------------

Query:  ----------------------------------DAIMAGT-SKHSKHTWTKVEDARLVESLVYLV-HNGWRSDNGTFRPGYVQH---LQKMLAEKLPNS
                                             MA T SK +KH WT +ED  LVE L+ LV   GWR+DNGTF+ GY++    + +M+       
Subjt:  ----------------------------------DAIMAGT-SKHSKHTWTKVEDARLVESLVYLV-HNGWRSDNGTFRPGYVQH---LQKMLAEKLPNS

Query:  SLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEE
                 + E  VFD WVK H NA+G+ NKPFP++ DL  VFG+DRATG   +TP EM+S  A+  EE ++ +  +D        +E P       E+
Subjt:  SLELNTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEE

Query:  LSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCF
        + +TPTS  + +G+S       +KR  +  +++D  R +M   +  + ++ +WQ++K E+E++  K +   L  I G+   D + + + L+ D      F
Subjt:  LSNTPTSRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCF

Query:  LQVP
        L  P
Subjt:  LQVP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein2.8e-5542.27Show/hide
Query:  MLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD------AIMAGTSKHSKHTWTK
        ML+T GGL  TQYVD+EEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN VLN VL+LH++LLK+P+ +T +C+        + +  SK +KH WT 
Subjt:  MLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTD------AIMAGTSKHSKHTWTK

Query:  VEDARLVESLVYLVHNG-WRSDNGTFRPGYVQHLQKMLAEKLPNSSLEL---------------NTID------------------CKAENEVFDAWVKS
        +ED  LVE L+ LV  G WR DNGTF+PGY+  +QK++ EK+  S++++                TI                    +AE  V + WVK 
Subjt:  VEDARLVESLVYLVHNG-WRSDNGTFRPGYVQHLQKMLAEKLPNSSLEL---------------NTID------------------CKAENEVFDAWVKS

Query:  HTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSK
        H NA+ + NKPFP++ DL  VFG+DRATG   +TP EM S  A+  EE ++ +  +D        +E P       E++ +TPTS  + +G+S       
Subjt:  HTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEE-EIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTSSRCTGSK

Query:  RKRSCFQTEMIDVVRTT
        +KR  +  +++D  R T
Subjt:  RKRSCFQTEMIDVVRTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.0e-1243.59Show/hide
Query:  CRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQL
        C E+  MD+  F  LC +L+T G L  T  + IE  +A+FL I+ H+++ R ++  F  SGET+SRHFN+VLN+V+ +
Subjt:  CRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNSVLNSVLQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACCGTTGAAGAATTATTTAATATCCTTACAATCATGACCACATCTCAAAGACAGATGTTGGTAAACTTAGGAATATTATTGAACAATTACCGGCGCTTAGAGTA
TCGATCGTCATACGATCGACATCGAATTAGGCAGATGTATTTCTTTCGCCTCATTTACGAGAATGATCTATGCTGCCGCGAGAGCACAATGATGGATAGAAGGACCTTCG
CTATCTTGTGTACTATGCTCAAGACCACTGGTGGTTTAGTACCAACACAATATGTTGATATCGAGGAGATGGTTGCTATGTTCCTACACATCCTAGCACATGATGTCAAG
AATCGAGTGATTCGTAGGCAATTCGCAAGGTCTGGTGAGACAGTCTCTAGACACTTCAACTCTGTGTTAAACTCAGTCTTGCAACTACACGACGTATTGTTGAAGAAACC
AGAACCAATCACGAGCACGTGCACTGATGCCATTATGGCAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCT
ATTTGGTACATAATGGGTGGCGGTCAGACAATGGAACATTCAGGCCTGGATATGTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTACCAAATTCATCATTAGAACTG
AATACAATAGATTGCAAAGCAGAGAATGAGGTATTTGATGCATGGGTCAAGAGCCATACAAATGCCAAGGGGATGAGGAACAAACCATTTCCACACTATGATGATCTTGC
ATTTGTATTTGGAAAAGATAGAGCAACGGGGATGGGCGCGGAGACCCCAGGGGAAATGGCCTCTAACGCTGCACAACAAATGGAGGAGGAGATCCGACTGGGATCGCAAG
ACTTATTCGGGACGGAGCAACGACCAATGGAGAATCCATGCACTGCTGATGTAGGGGAGGAAGAATTGTCAAATACTCCTACTAGTAGACGTAATACATCTGGCACGTCT
TCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAACAATGGACATCCAAACAACTCACATGCAACGCCTTCTATC
GTGGCAGAAGGATAAGTATGAGTTAGAGGCTGCACGACGGAAGGAAGTGGTTGACCTGTTGTACCATATAGAAGGGTTGACCGAGCATGATCGTGTATCTCTGATAGACA
TGCTTGTCACTGATATACAGAAGACAGACTGCTTCCTACAGGTCCCACCTCAATCGAGGAGGACATACTGTTTGCGTCTCCTAGGTAGGATTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACCGTTGAAGAATTATTTAATATCCTTACAATCATGACCACATCTCAAAGACAGATGTTGGTAAACTTAGGAATATTATTGAACAATTACCGGCGCTTAGAGTA
TCGATCGTCATACGATCGACATCGAATTAGGCAGATGTATTTCTTTCGCCTCATTTACGAGAATGATCTATGCTGCCGCGAGAGCACAATGATGGATAGAAGGACCTTCG
CTATCTTGTGTACTATGCTCAAGACCACTGGTGGTTTAGTACCAACACAATATGTTGATATCGAGGAGATGGTTGCTATGTTCCTACACATCCTAGCACATGATGTCAAG
AATCGAGTGATTCGTAGGCAATTCGCAAGGTCTGGTGAGACAGTCTCTAGACACTTCAACTCTGTGTTAAACTCAGTCTTGCAACTACACGACGTATTGTTGAAGAAACC
AGAACCAATCACGAGCACGTGCACTGATGCCATTATGGCAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCT
ATTTGGTACATAATGGGTGGCGGTCAGACAATGGAACATTCAGGCCTGGATATGTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTACCAAATTCATCATTAGAACTG
AATACAATAGATTGCAAAGCAGAGAATGAGGTATTTGATGCATGGGTCAAGAGCCATACAAATGCCAAGGGGATGAGGAACAAACCATTTCCACACTATGATGATCTTGC
ATTTGTATTTGGAAAAGATAGAGCAACGGGGATGGGCGCGGAGACCCCAGGGGAAATGGCCTCTAACGCTGCACAACAAATGGAGGAGGAGATCCGACTGGGATCGCAAG
ACTTATTCGGGACGGAGCAACGACCAATGGAGAATCCATGCACTGCTGATGTAGGGGAGGAAGAATTGTCAAATACTCCTACTAGTAGACGTAATACATCTGGCACGTCT
TCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAACAATGGACATCCAAACAACTCACATGCAACGCCTTCTATC
GTGGCAGAAGGATAAGTATGAGTTAGAGGCTGCACGACGGAAGGAAGTGGTTGACCTGTTGTACCATATAGAAGGGTTGACCGAGCATGATCGTGTATCTCTGATAGACA
TGCTTGTCACTGATATACAGAAGACAGACTGCTTCCTACAGGTCCCACCTCAATCGAGGAGGACATACTGTTTGCGTCTCCTAGGTAGGATTGGATGA
Protein sequenceShow/hide protein sequence
METVEELFNILTIMTTSQRQMLVNLGILLNNYRRLEYRSSYDRHRIRQMYFFRLIYENDLCCRESTMMDRRTFAILCTMLKTTGGLVPTQYVDIEEMVAMFLHILAHDVK
NRVIRRQFARSGETVSRHFNSVLNSVLQLHDVLLKKPEPITSTCTDAIMAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYVQHLQKMLAEKLPNSSLEL
NTIDCKAENEVFDAWVKSHTNAKGMRNKPFPHYDDLAFVFGKDRATGMGAETPGEMASNAAQQMEEEIRLGSQDLFGTEQRPMENPCTADVGEEELSNTPTSRRNTSGTS
SRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQRLLSWQKDKYELEAARRKEVVDLLYHIEGLTEHDRVSLIDMLVTDIQKTDCFLQVPPQSRRTYCLRLLGRIG