; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020498 (gene) of Snake gourd v1 genome

Gene IDTan0020498
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG09:47410733..47411997
RNA-Seq ExpressionTan0020498
SyntenyTan0020498
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031556.1 retrotransposon protein [Cucumis melo var. makuwa]9.3e-9050.14Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+++A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]9.3e-9049.73Show/hide
Query:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR
        A+Q Q +     +    +RI +     RH+IRQL +FR+I  +DL CR+STRMDRR F+ILC LL+ + GLTST+ VDVEEMVAMFLHI+AHDVKNRV++
Subjt:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR

Query:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------
        R+F+RSGET+SRHFN+VL  ++RLH  +L +          ++RWRWFE              VP  +          ++   + + DT           
Subjt:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------

Query:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP
          +  D+ +L+             GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG  N P++ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP
Subjt:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP

Query:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW
        VEVQCRTI ACCLLHNLI +EM     F+ E +  + DS    T  D+I ++ET++ W++WR+++A  M+
Subjt:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]9.3e-9049.73Show/hide
Query:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR
        A+Q Q +     +    +RI +     RH+IRQL +FR+I  +DL CR+STRMDRR F+ILC LL+ + GLTST+ VDVEEMVAMFLHI+AHDVKNRV++
Subjt:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR

Query:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------
        R+F+RSGET+SRHFN+VL  ++RLH  +L +          ++RWRWFE              VP  +          ++   + + DT           
Subjt:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------

Query:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP
          +  D+ +L+             GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG  N P++ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP
Subjt:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP

Query:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW
        VEVQCRTI ACCLLHNLI +EM     F+ E +  + DS    T  D+I ++ET++ W++WR+++A  M+
Subjt:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW

KAA0056561.1 retrotransposon protein [Cucumis melo var. makuwa]1.6e-8950.14Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+ +A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

KAA0056564.1 retrotransposon protein [Cucumis melo var. makuwa]2.1e-8949.86Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERA+G+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+++A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

TrEMBL top hitse value%identityAlignment
A0A5A7TXW1 Retrotransposon protein4.5e-9049.73Show/hide
Query:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR
        A+Q Q +     +    +RI +     RH+IRQL +FR+I  +DL CR+STRMDRR F+ILC LL+ + GLTST+ VDVEEMVAMFLHI+AHDVKNRV++
Subjt:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR

Query:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------
        R+F+RSGET+SRHFN+VL  ++RLH  +L +          ++RWRWFE              VP  +          ++   + + DT           
Subjt:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------

Query:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP
          +  D+ +L+             GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG  N P++ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP
Subjt:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP

Query:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW
        VEVQCRTI ACCLLHNLI +EM     F+ E +  + DS    T  D+I ++ET++ W++WR+++A  M+
Subjt:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW

A0A5A7UMZ4 Retrotransposon protein1.0e-8949.86Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERA+G+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+++A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

A0A5A7UST8 Retrotransposon protein7.7e-9050.14Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+ +A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

A0A5D3BDX0 Retrotransposon protein4.5e-9049.73Show/hide
Query:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR
        A+Q Q +     +    +RI +     RH+IRQL +FR+I  +DL CR+STRMDRR F+ILC LL+ + GLTST+ VDVEEMVAMFLHI+AHDVKNRV++
Subjt:  ATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR

Query:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------
        R+F+RSGET+SRHFN+VL  ++RLH  +L +          ++RWRWFE              VP  +          ++   + + DT           
Subjt:  RQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNEL--------ISIKSIHMSDT-----------

Query:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP
          +  D+ +L+             GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG  N P++ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP
Subjt:  --TCCDNLLLK------------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYP

Query:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW
        VEVQCRTI ACCLLHNLI +EM     F+ E +  + DS    T  D+I ++ET++ W++WR+++A  M+
Subjt:  VEVQCRTITACCLLHNLIIQEMGPDPTFE-EAHTSDPDSNGMNT--DNIEFVETTDAWTEWREHIANHMW

A0A5D3D7X8 Retrotransposon protein4.5e-9050.14Show/hide
Query:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH
        M   +R+ +  P  RH+IR+L +FR+I E+DL CR+STRMDRRTF+ILC LL+ + GL+ST+ VDVEEMVAMFLH++AHDVKNRV++R+FVRSGETVSRH
Subjt:  MQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRH

Query:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--
        FN+VL  ++RL+  ++ R +       +++RW+ FE              VP G+          I+   + + DT             +  D+ +L+  
Subjt:  FNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFE--------------VPRGNE--------LISIKSIHMSDT-------------TCCDNLLLK--

Query:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL
                   GYYYLCDAGYPNAEGFLAPYRGQRYHL EWRG  N PT+ KE FNM+HSSARNVIERAFG+LKGRWAILRGKS+YP++VQCRTI AC L
Subjt:  ----------SGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCL

Query:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW
        LHNLI +EM      E+    D   +    +++I+++ETT+ W++WR+++A  M+
Subjt:  LHNLIIQEMGPDPTFEEAHTSDPD-SNGMNTDNIEFVETTDAWTEWREHIANHMW

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI18.1e-0426.83Show/hide
Query:  RYHLTEW--RGGNPPTSPKEL-FNMRHSSARNVIERAFGMLKGRWAIL---RGKSFYPVEVQCRTITACCLLHNLIIQEMGPDPTFEEAHTSDPDSNGMN
        RY L +W       P SP +  +N+ H++   +++R F  ++ R+  L   +G   Y  E     I ACC+LHN+ +Q      TFE    +D     ++
Subjt:  RYHLTEW--RGGNPPTSPKEL-FNMRHSSARNVIERAFGMLKGRWAIL---RGKSFYPVEVQCRTITACCLLHNLIIQEMGPDPTFEEAHTSDPDSNGMN

Query:  TDNIEFVETTDAWTEWREHIANH
          +    +  +A    +E I NH
Subjt:  TDNIEFVETTDAWTEWREHIANH

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.4e-2230.45Show/hide
Query:  NFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLHSGIL----S
        N +R +Q++  AC +  RM    F+ LC +L+    L  T  + +EE VAMFL I  H+   R V  +F R+ ETV R F  VL+    L    +     
Subjt:  NFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLHSGIL----S

Query:  RHLHRF--QIRVDERRWRWFEVPRG------------------------NELISIKSI-----------HMSDTTCCDNLLLK-------------SGYY
        + L+R   +++VD+R W +F    G                        N  ++I +I           + +  +C D  +L+             S  Y
Subjt:  RHLHRF--QIRVDERRWRWFEVPRG------------------------NELISIKSI-----------HMSDTTCCDNLLLK-------------SGYY

Query:  YLCDAGYPNAEGFLAPYRGQ-----RYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGR
        YL D+GYPN +G LAPYR       RYH++++  G  P +  ELFN  H+S R+VIER F + K +
Subjt:  YLCDAGYPNAEGFLAPYRGQ-----RYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGR

AT4G10890.1 unknown protein8.0e-1532.78Show/hide
Query:  GLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR---RQFVRSGETVSRHFNVVLSGIVRLHSGIL-SRHLHRFQIRVDERRWRWFEVPRGNELISIKSIHMS
        GL   + V +EE VAMFL  +    KNR VR    ++ +S   V R  + VLS +++  +  L S+  H  ++          +    NE     S H S
Subjt:  GLTSTQYVDVEEMVAMFLHIIAHDVKNRVVR---RQFVRSGETVSRHFNVVLSGIVRLHSGIL-SRHLHRFQIRVDERRWRWFEVPRGNELISIKSIHMS

Query:  DTTCCDNLLLKSGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAIL
        +             YYL ++ YP   G+L P+R   YHL ++  G PP + +ELFN +H   R+VI+R FG+ K +W IL
Subjt:  DTTCCDNLLLKSGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAIL

AT5G28730.1 unknown protein1.1e-1129.63Show/hide
Query:  IQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLH-SGILSRHLHRF--
        I  N+++C+   RM    F+ LCE+L G  GL S+  + ++E VA+FL I A +   R +  +F  + ET+ R F+ VL  + RL    I  R +     
Subjt:  IQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLH-SGILSRHLHRF--

Query:  ---QIRVDERRWRWFEVPRG---NELISIKSIHMSDTTCC--------DNLLLKSGY-------------YYLCDAGYPNAEGFLAPYR
           +++ D R W +     G     +++I  + M  T C         D  +L +               YYL D+GY N  G+LAPYR
Subjt:  ---QIRVDERRWRWFEVPRG---NELISIKSIHMSDTTCC--------DNLLLKSGY-------------YYLCDAGYPNAEGFLAPYR

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.7e-2338.31Show/hide
Query:  YYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCLLHNLIIQEMG---
        +YL D G+ N   FLAP+RG RYHL E+ G    P +P ELFN+RH S RNVIER FG+ K R+AI +    +  + Q   +  C  LHN + +E     
Subjt:  YYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-GNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCLLHNLIIQEMG---

Query:  ---PDPTFEEAHTSDPDSNGMNTDNIEFVETTDAWTE-------WREHIANHMW
           PD    E    + + N MNT+ I+  E  +A  +       WR+ +A  MW
Subjt:  ---PDPTFEEAHTSDPDSNGMNTDNIEFVETTDAWTE-------WREHIANHMW

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)7.5e-2927.57Show/hide
Query:  FRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRF
        ++I+   +  C E+ RMD+  F  LC+LL+  G L  T  + +E  +A+FL II H+++ R V+  F  SGET+SRHFN VL+ ++ +       + +  
Subjt:  FRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSGETVSRHFNVVLSGIVRLHSGILSRHLHRF

Query:  QIRVDERRWR-------WFEVP------------RGNELI-------------------------SIKSIHMSDTTCCDNLLLKSGYYYLCDAGYPNAEG
         +  D+  ++        F +P             GN L+                         S + +  +  T  + L +  G YY+ D  YPN  G
Subjt:  QIRVDERRWR-------WFEVP------------RGNELI-------------------------SIKSIHMSDTTCCDNLLLKSGYYYLCDAGYPNAEG

Query:  FLAPYRGQRYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCLLHNLIIQEMGPDPTF----EE--AHTS
        F+APY G           N     KE+FN RH      I R FG LK R+ IL     YP++ Q + + A C LHN +  E   D  F    EE  A   
Subjt:  FLAPYRGQRYHLTEWRGGNPPTSPKELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCLLHNLIIQEMGPDPTF----EE--AHTS

Query:  DPDSNGMNTDNIEFV--------ETTDAWTEWREHIANHMW
        +     +  + +E V        E  +     R+ IA+ +W
Subjt:  DPDSNGMNTDNIEFV--------ETTDAWTEWREHIANHMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATGTGCAACTCAATATCAATTTGTATCAACATTCACTGCAATTATGCAAGGGCCACGAAGAATTGACAACCAATCCCCCCATGTAAGACACCAAATCAGACAATT
AAACTTTTTTCGTATTATACAAGAAAATGATCTAGCTTGTCGCGAGAGCACACGGATGGATAGGAGAACATTTTCCATCCTCTGTGAATTGCTTAAAGGATTAGGGGGGT
TGACGTCTACTCAATACGTAGATGTGGAAGAAATGGTTGCCATGTTCCTTCACATCATTGCGCATGATGTGAAGAACAGAGTAGTGAGAAGACAATTTGTGCGGTCTGGC
GAGACTGTGTCCAGACACTTCAATGTCGTCCTAAGTGGGATAGTAAGACTTCACTCTGGTATTCTATCAAGGCACCTTCACCGATTTCAAATACGTGTCGACGAGCGGCG
ATGGCGATGGTTCGAGGTCCCAAGGGGTAATGAACTGATTTCAATTAAAAGTATACATATGTCAGACACGACATGTTGTGATAACCTGCTTTTGAAATCAGGGTATTACT
ACTTGTGCGACGCTGGCTATCCCAACGCGGAGGGATTCCTAGCCCCGTATAGAGGACAACGTTACCACTTAACTGAATGGCGAGGAGGGAATCCACCTACCAGTCCAAAA
GAGTTGTTCAACATGCGACATTCATCTGCTCGAAATGTAATTGAGAGGGCATTCGGAATGTTGAAGGGTCGATGGGCGATATTAAGAGGAAAGTCTTTTTACCCAGTAGA
AGTACAATGTAGAACTATAACTGCATGTTGTTTGCTACATAACTTAATTATACAAGAGATGGGCCCAGACCCCACATTCGAGGAGGCACATACAAGTGACCCTGATTCAA
ATGGGATGAATACAGACAACATTGAGTTTGTAGAGACAACAGATGCCTGGACTGAATGGAGGGAACATATCGCAAACCACATGTGGATGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATGTGCAACTCAATATCAATTTGTATCAACATTCACTGCAATTATGCAAGGGCCACGAAGAATTGACAACCAATCCCCCCATGTAAGACACCAAATCAGACAATT
AAACTTTTTTCGTATTATACAAGAAAATGATCTAGCTTGTCGCGAGAGCACACGGATGGATAGGAGAACATTTTCCATCCTCTGTGAATTGCTTAAAGGATTAGGGGGGT
TGACGTCTACTCAATACGTAGATGTGGAAGAAATGGTTGCCATGTTCCTTCACATCATTGCGCATGATGTGAAGAACAGAGTAGTGAGAAGACAATTTGTGCGGTCTGGC
GAGACTGTGTCCAGACACTTCAATGTCGTCCTAAGTGGGATAGTAAGACTTCACTCTGGTATTCTATCAAGGCACCTTCACCGATTTCAAATACGTGTCGACGAGCGGCG
ATGGCGATGGTTCGAGGTCCCAAGGGGTAATGAACTGATTTCAATTAAAAGTATACATATGTCAGACACGACATGTTGTGATAACCTGCTTTTGAAATCAGGGTATTACT
ACTTGTGCGACGCTGGCTATCCCAACGCGGAGGGATTCCTAGCCCCGTATAGAGGACAACGTTACCACTTAACTGAATGGCGAGGAGGGAATCCACCTACCAGTCCAAAA
GAGTTGTTCAACATGCGACATTCATCTGCTCGAAATGTAATTGAGAGGGCATTCGGAATGTTGAAGGGTCGATGGGCGATATTAAGAGGAAAGTCTTTTTACCCAGTAGA
AGTACAATGTAGAACTATAACTGCATGTTGTTTGCTACATAACTTAATTATACAAGAGATGGGCCCAGACCCCACATTCGAGGAGGCACATACAAGTGACCCTGATTCAA
ATGGGATGAATACAGACAACATTGAGTTTGTAGAGACAACAGATGCCTGGACTGAATGGAGGGAACATATCGCAAACCACATGTGGATGAATTAG
Protein sequenceShow/hide protein sequence
MICATQYQFVSTFTAIMQGPRRIDNQSPHVRHQIRQLNFFRIIQENDLACRESTRMDRRTFSILCELLKGLGGLTSTQYVDVEEMVAMFLHIIAHDVKNRVVRRQFVRSG
ETVSRHFNVVLSGIVRLHSGILSRHLHRFQIRVDERRWRWFEVPRGNELISIKSIHMSDTTCCDNLLLKSGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGGNPPTSPK
ELFNMRHSSARNVIERAFGMLKGRWAILRGKSFYPVEVQCRTITACCLLHNLIIQEMGPDPTFEEAHTSDPDSNGMNTDNIEFVETTDAWTEWREHIANHMWMN