; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011082 (gene) of Snake gourd v1 genome

Gene IDTan0011082
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG07:47980199..47985951
RNA-Seq ExpressionTan0011082
SyntenyTan0011082
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]6.3e-6934.35Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK
        PS RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK

Query:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA
        I ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  
Subjt:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        KFL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]4.8e-6934.41Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  ----------------------------------------------------------------------------------------------ELAEDP
                                                                                                      +L+ DP
Subjt:  ----------------------------------------------------------------------------------------------ELAEDP

Query:  STRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPKD
        S RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K 
Subjt:  STRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPKD

Query:  KEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVI
        KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LVI
Subjt:  KEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVI

Query:  FNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAK
         ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  K
Subjt:  FNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAK

Query:  FLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        FL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  FLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]6.3e-6934.35Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK
        PS RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK

Query:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA
        I ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  
Subjt:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        KFL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]8.8e-6333.17Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD I+       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       EL+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI---------------------------------------------------SGNKMSTDCSPSKKSACI
        P  RATLW +ARK KNNEY D  T++C  RI                                                   S N+  T  S  K     
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI---------------------------------------------------SGNKMSTDCSPSKKSACI

Query:  GSNR--------------------------------PKDKEVIDEVEEILVIIRILWKYCYHKMHNVH-------------VWRSISHNPWSS-------
          ++                                PK K V+ + EEIL  I      C+  + +V                 SI+  P          
Subjt:  GSNR--------------------------------PKDKEVIDEVEEILVIIRILWKYCYHKMHNVH-------------VWRSISHNPWSS-------

Query:  --LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS--------DVSL---------------------------------IYL
          +  ED  LPIP + ++++L Q++ NFVAWPR LVI  K KK  SP   KS        DV +                                 IYL
Subjt:  --LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS--------DVSL---------------------------------IYL

Query:  HRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVIHPRANTVYV
         RDDI+ YCGM EIGY CI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL N LEMV+  LDQ V IPYNTG  HW+LI+I+ + N VYV
Subjt:  HRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVIHPRANTVYV

Query:  LNSLRSKIEESFQGTINTVLQ
        ++SLRSKI E FQG INT L+
Subjt:  LNSLRSKIEESFQGTINTVLQ

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]3.6e-6433.23Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD I+       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       EL+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI---------------------------------------------------SGNKMSTDCSPSKKSACI
        P  RATLW +ARK KNNEY D  T++C  RI                                                   S N+  T  S  K     
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI---------------------------------------------------SGNKMSTDCSPSKKSACI

Query:  GSNR--------------------------------PKDKEVIDEVEEILVIIRILWKYCYHKMHNVH-------------VWRSISHNPWSS-------
          ++                                PK K V+ + EEIL  I      C+  + +V                 SI+  P          
Subjt:  GSNR--------------------------------PKDKEVIDEVEEILVIIRILWKYCYHKMHNVH-------------VWRSISHNPWSS-------

Query:  --LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS--------DVSL---------------------------------IYL
          +  ED  LPIP + ++++L Q++ NFVAWPR LVI  K KK  SP   KS        DV +                                 IYL
Subjt:  --LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS--------DVSL---------------------------------IYL

Query:  HRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVL
         RDDI+ YCGM EIGY CI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL N LEMV+  LDQ V IPYNTG HW+LI+I+ + N VYV+
Subjt:  HRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVL

Query:  NSLRSKIEESFQGTINTVLQ
        +SLRSKI E FQG INT L+
Subjt:  NSLRSKIEESFQGTINTVLQ

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.5e-6032.06Show/hide
Query:  SSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE---------
        SS+DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD I+         
Subjt:  SSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE---------

Query:  ---------------------------------------------------------------------------------------------ELAEDPS
                                                                                                     EL+ DP 
Subjt:  ---------------------------------------------------------------------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCVGRIS------------------------------------------GNKMSTDCSPSK----------------
         RATLW +ARK KNN  FDD T++CV RI                                           GN   +  S  K                
Subjt:  TRATLWIQARKGKNNEYFDDETKQCVGRIS------------------------------------------GNKMSTDCSPSK----------------

Query:  ----------------------------KSACIGSNRPKDKEVIDEVEEILVIIRILWK--------------------YCYHKMHNVHVWRSISHN-PW
                                    K    G   PK K V+ E EE L  +++L +                        KM    V     H  P 
Subjt:  ----------------------------KSACIGSNRPKDKEVIDEVEEILVIIRILWK--------------------YCYHKMHNVHVWRSISHN-PW

Query:  SS---------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS-------------------------------DVSL----
         +            ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K KK  S    +S                                +SL    
Subjt:  SS---------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS-------------------------------DVSL----

Query:  ------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVI
              IYL RDDI+ YCGM EIGY CI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL N LEM N  LDQ V IPYNTG  HW+LI+I
Subjt:  ------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVI

Query:  HPRANTVYVLNSLRSKIEESFQGTINTVLQ
          + N VYV++ LRSKI   FQG IN  L+
Subjt:  HPRANTVYVLNSLRSKIEESFQGTINTVLQ

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.5e-6032.06Show/hide
Query:  SSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE---------
        SS+DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD I+         
Subjt:  SSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE---------

Query:  ---------------------------------------------------------------------------------------------ELAEDPS
                                                                                                     EL+ DP 
Subjt:  ---------------------------------------------------------------------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCVGRIS------------------------------------------GNKMSTDCSPSK----------------
         RATLW +ARK KNN  FDD T++CV RI                                           GN   +  S  K                
Subjt:  TRATLWIQARKGKNNEYFDDETKQCVGRIS------------------------------------------GNKMSTDCSPSK----------------

Query:  ----------------------------KSACIGSNRPKDKEVIDEVEEILVIIRILWK--------------------YCYHKMHNVHVWRSISHN-PW
                                    K    G   PK K V+ E EE L  +++L +                        KM    V     H  P 
Subjt:  ----------------------------KSACIGSNRPKDKEVIDEVEEILVIIRILWK--------------------YCYHKMHNVHVWRSISHN-PW

Query:  SS---------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS-------------------------------DVSL----
         +            ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K KK  S    +S                                +SL    
Subjt:  SS---------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVIFNKGKKVASPAKHKS-------------------------------DVSL----

Query:  ------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVI
              IYL RDDI+ YCGM EIGY CI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL N LEM N  LDQ V IPYNTG  HW+LI+I
Subjt:  ------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTG-YHWMLIVI

Query:  HPRANTVYVLNSLRSKIEESFQGTINTVLQ
          + N VYV++ LRSKI   FQG IN  L+
Subjt:  HPRANTVYVLNSLRSKIEESFQGTINTVLQ

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X13.0e-6934.35Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK
        PS RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK

Query:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA
        I ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  
Subjt:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        KFL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X43.0e-6934.35Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK
        PS RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPK

Query:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA
        I ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  
Subjt:  IFNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        KFL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  KFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X22.3e-6934.41Show/hide
Query:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------
        S SS+DE +V I  E +   RRG T+M  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ +E       
Subjt:  SGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIE-------

Query:  ----------------------------------------------------------------------------------------------ELAEDP
                                                                                                      +L+ DP
Subjt:  ----------------------------------------------------------------------------------------------ELAEDP

Query:  STRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPKD
        S RA LW +ARKGKNNEYFDD T++C  RI        G  + T+                  SPS                  KS   GSN    + K 
Subjt:  STRATLWIQARKGKNNEYFDDETKQCVGRI-------SGNKMSTDC-----------------SPS-----------------KKSACIGSN----RPKD

Query:  KEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVI
        KE+++  EEI V    ++  K C+     + N+    +I  N                       +  E A +PIP+RGE+E+L+Q++  FVAWPR LVI
Subjt:  KEVIDEVEEILVI--IRILWKYCY---HKMHNVHVWRSISHNPWSS-------------------LRSEDAPLPIPIRGEVESLSQSMRNFVAWPRDLVI

Query:  FNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAK
         ++ K ++S                               +H+  V +            IYL R+DI+ YC M+EIGY CI+ YIAYLW V +YEI  K
Subjt:  FNKGKKVAS-----------------------------PAKHKSDVSL------------IYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAK

Query:  FLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA
        FL+VD  TIS +VKSQE R  NLAN LEMVN  L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INT L   ++W A
Subjt:  FLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGAAGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTAGTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTATAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCA
CGAAAAGGAAAAAATAATGAATATTTCGATGATGAAACTAAACAATGCGTTGGTCGAATCTCTGGCAACAAAATGTCGACTGATTGCTCACCGTCCAAAAAGTCTGCATG
CATAGGCAGTAATCGTCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGTGATAATTAGGATTTTGTGGAAATATTGTTATCATAAGATGCACAATGTAC
ACGTCTGGCGCTCAATTTCCCACAATCCATGGAGTTCCCTTAGGAGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCAATGAGA
AATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTCTTTAATTTATTTACATCGCGATGA
TATCCTGCATTATTGTGGGATGGTGGAGATAGGGTACTTCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAG
TTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTCTAAATCTGGCTAACATGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATC
CCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCAAACACCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAAC
AATAAATACTGTTCTGCAATGTTTCAGGTTATGGTTAGCTGATGTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGAAGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTAGTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTATAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCA
CGAAAAGGAAAAAATAATGAATATTTCGATGATGAAACTAAACAATGCGTTGGTCGAATCTCTGGCAACAAAATGTCGACTGATTGCTCACCGTCCAAAAAGTCTGCATG
CATAGGCAGTAATCGTCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGTGATAATTAGGATTTTGTGGAAATATTGTTATCATAAGATGCACAATGTAC
ACGTCTGGCGCTCAATTTCCCACAATCCATGGAGTTCCCTTAGGAGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCAATGAGA
AATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTCTTTAATTTATTTACATCGCGATGA
TATCCTGCATTATTGTGGGATGGTGGAGATAGGGTACTTCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAG
TTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTCTAAATCTGGCTAACATGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATC
CCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCAAACACCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAAC
AATAAATACTGTTCTGCAATGTTTCAGGTTATGGTTAGCTGATGTTCTTTAA
Protein sequenceShow/hide protein sequence
MSGSSEDEVNVSIQMEARHTNRRGLTSMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSIEELAEDPSTRATLWIQA
RKGKNNEYFDDETKQCVGRISGNKMSTDCSPSKKSACIGSNRPKDKEVIDEVEEILVIIRILWKYCYHKMHNVHVWRSISHNPWSSLRSEDAPLPIPIRGEVESLSQSMR
NFVAWPRDLVIFNKGKKVASPAKHKSDVSLIYLHRDDILHYCGMVEIGYFCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCLNLANMLEMVNLDLDQQVFI
PYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTVLQCFRLWLADVL