; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001619 (gene) of Snake gourd v1 genome

Gene IDTan0001619
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG10:21591935..21607973
RNA-Seq ExpressionTan0001619
SyntenyTan0001619
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004264 - Transposase, Tnp1/En/Spm-like
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]5.6e-8437.09Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK
        PS RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK

Query:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII
                            +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI 
Subjt:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII

Query:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
         KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]4.3e-8437.16Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------ELAEDP
                                                                                                      +L+ DP
Subjt:  ----------------------------------------------------------------------------------------------ELAEDP

Query:  STRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD
        S RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K 
Subjt:  STRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD

Query:  KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-
        KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV 
Subjt:  KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-

Query:  ----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIA
                           +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI  
Subjt:  ----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
        KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  KFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]5.6e-8437.09Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK
        PS RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK

Query:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII
                            +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI 
Subjt:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII

Query:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
         KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]8.3e-8034.87Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       EL+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI---------------------------------------------------SGNKMSTDCSPSKKSTSI
        P  RATLW + RK KNNEY D  T++CA +I                                                   S N+  T  S  K  T  
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI---------------------------------------------------SGNKMSTDCSPSKKSTSI

Query:  GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVG
          +                                 PK K V+ + EEILEG PCHLAIGS DN+VAVG M+ SDAQ  +++ +PLG +N+R +VD+++G
Subjt:  GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVG

Query:  ENAPLPIPIRGEVESLSQSMRNFVAWPRDLV--------ASPAKHKS--------DVYL---------------------------------IYLHRDDI
        E+  LPIP + ++++L Q++ NFVAWPR LV         SP   KS        DV++                                 IYL RDDI
Subjt:  ENAPLPIPIRGEVESLSQSMRNFVAWPRDLV--------ASPAKHKS--------DVYL---------------------------------IYLHRDDI

Query:  LHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIHPRANAVYVLNSLR
        + YCGM +IGYSCI+AYIA LW  CD EI  KF++VDQ  IS+ +K QE    NL NRL+M+   LDQ V IPYNTG  HW+LI+I+ + N VYV++SLR
Subjt:  LHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIHPRANAVYVLNSLR

Query:  SKIEESFQGTIN--IKHKYNQDELDEIRVEL
        SKI E FQG IN  +K+   +  L++ R  +
Subjt:  SKIEESFQGTIN--IKHKYNQDELDEIRVEL

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]3.4e-8134.92Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       EL+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI---------------------------------------------------SGNKMSTDCSPSKKSTSI
        P  RATLW + RK KNNEY D  T++CA +I                                                   S N+  T  S  K  T  
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI---------------------------------------------------SGNKMSTDCSPSKKSTSI

Query:  GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVG
          +                                 PK K V+ + EEILEG PCHLAIGS DN+VAVG M+ SDAQ  +++ +PLG +N+R +VD+++G
Subjt:  GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVG

Query:  ENAPLPIPIRGEVESLSQSMRNFVAWPRDLV--------ASPAKHKS--------DVYL---------------------------------IYLHRDDI
        E+  LPIP + ++++L Q++ NFVAWPR LV         SP   KS        DV++                                 IYL RDDI
Subjt:  ENAPLPIPIRGEVESLSQSMRNFVAWPRDLV--------ASPAKHKS--------DVYL---------------------------------IYLHRDDI

Query:  LHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRS
        + YCGM +IGYSCI+AYIA LW  CD EI  KF++VDQ  IS+ +K QE    NL NRL+M+   LDQ V IPYNTG HW+LI+I+ + N VYV++SLRS
Subjt:  LHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRS

Query:  KIEESFQGTIN--IKHKYNQDELDEIRVEL
        KI E FQG IN  +K+   +  L++ R  +
Subjt:  KIEESFQGTIN--IKHKYNQDELDEIRVEL

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X13.2e-7734.86Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE---------
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++         
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE---------

Query:  ---------------------------------------------------------------------------------------------ELAEDPS
                                                                                                     EL+ DP 
Subjt:  ---------------------------------------------------------------------------------------------ELAEDPS

Query:  TRATLWIQGRKGKNNEYFDEDTKQCA--------------------------GQISG-------------------------------------------
         RATLW + RK KNN  FD+ T++C                           G+I G                                           
Subjt:  TRATLWIQGRKGKNNEYFDEDTKQCA--------------------------GQISG-------------------------------------------

Query:  ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLG
             + S D + +++S S              G   PK K V+ E EE LE            G PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG
Subjt:  ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLG

Query:  VENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV------------ASPAKHKSDVY---------------------------------
         ENIRV VD+ + E+  LPIP++G++E+L+Q++ NFVAWPR LV            AS +  +S  Y                                 
Subjt:  VENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV------------ASPAKHKSDVY---------------------------------

Query:  ----LIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIH
             IYL RDDI+ YCGM +IGYSCI+ YIA LW VC+ EI  +F+LVDQ  IS+ IKSQE    NL NRL+M    LDQ V IPYNTG  HW+LI+I 
Subjt:  ----LIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIH

Query:  PRANAVYVLNSLRSKIEESFQGTINIKHKYNQDE
         + N VYV++ LRSKI   FQG IN   K+ Q E
Subjt:  PRANAVYVLNSLRSKIEESFQGTINIKHKYNQDE

A0A5D3CYL9 ULP_PROTEASE domain-containing protein3.2e-7734.86Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE---------
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++         
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE---------

Query:  ---------------------------------------------------------------------------------------------ELAEDPS
                                                                                                     EL+ DP 
Subjt:  ---------------------------------------------------------------------------------------------ELAEDPS

Query:  TRATLWIQGRKGKNNEYFDEDTKQCA--------------------------GQISG-------------------------------------------
         RATLW + RK KNN  FD+ T++C                           G+I G                                           
Subjt:  TRATLWIQGRKGKNNEYFDEDTKQCA--------------------------GQISG-------------------------------------------

Query:  ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLG
             + S D + +++S S              G   PK K V+ E EE LE            G PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG
Subjt:  ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLG

Query:  VENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV------------ASPAKHKSDVY---------------------------------
         ENIRV VD+ + E+  LPIP++G++E+L+Q++ NFVAWPR LV            AS +  +S  Y                                 
Subjt:  VENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV------------ASPAKHKSDVY---------------------------------

Query:  ----LIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIH
             IYL RDDI+ YCGM +IGYSCI+ YIA LW VC+ EI  +F+LVDQ  IS+ IKSQE    NL NRL+M    LDQ V IPYNTG  HW+LI+I 
Subjt:  ----LIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTG-YHWMLIVIH

Query:  PRANAVYVLNSLRSKIEESFQGTINIKHKYNQDE
         + N VYV++ LRSKI   FQG IN   K+ Q E
Subjt:  PRANAVYVLNSLRSKIEESFQGTINIKHKYNQDE

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.7e-8437.09Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK
        PS RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK

Query:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII
                            +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI 
Subjt:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII

Query:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
         KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X42.7e-8437.09Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  -----------------------------------------------------------------------------------------------ELAED
                                                                                                       +L+ D
Subjt:  -----------------------------------------------------------------------------------------------ELAED

Query:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK
        PS RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K
Subjt:  PSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK

Query:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV
         KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV
Subjt:  DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV

Query:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII
                            +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI 
Subjt:  -----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEII

Query:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
         KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  AKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X22.1e-8437.16Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE       
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------ELAEDP
                                                                                                      +L+ DP
Subjt:  ----------------------------------------------------------------------------------------------ELAEDP

Query:  STRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD
        S RA LW + RKGKNNEYFD+ T++CA +I        G  + T+                  SPS                  KST+ G+N      K 
Subjt:  STRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGNKMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD

Query:  KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-
        KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV 
Subjt:  KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-

Query:  ----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIA
                           +KH +DV++                                 IYL R+DI+ YC M++IGYSCI+ YIAYLW V +YEI  
Subjt:  ----------------ASPAKHKSDVYL---------------------------------IYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIA

Query:  KFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK
        KFL+VD   IS ++KSQE    NLANRL+M+   L+Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  IN   K
Subjt:  KFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACGATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAAGGA
CGAAAAGGAAAAAATAATGAATACTTCGATGAAGACACCAAACAATGTGCTGGTCAAATCTCTGGAAACAAAATGTCGACTGATTGCTCACCTTCCAAAAAGTCTACAAG
CATAGGCACTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTG
TAGGCATAATGTACACGTCTGACGCTCAATTTCTCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATATTAGAGTGGTAGTGGACATGATCGTAGGTGAAAATGCTCCA
TTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGAGAAATTTTGTGGCATGGCCTCGGGACCTTGTGGCTTCTCCAGCAAAACATAAGTCGGATGT
TTATTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTGAAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACT
ATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAATCATTTCTAATTTTATTAAAAGTCAAGAAACACTTTGTATAAATCTGGCTAACAGGTTAAAAATGATTTAT
TTGTACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCGAACGCCGTTTATGTCTTAAACTCGTTGAGGAG
TAAGATCGAAGAAAGTTTTCAAGGAACAATAAATATAAAACATAAGTACAACCAAGATGAGTTGGATGAGATTAGAGTTGAGCTATGTGAATTTGTATCTCAGTACTTAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACGATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAAGGA
CGAAAAGGAAAAAATAATGAATACTTCGATGAAGACACCAAACAATGTGCTGGTCAAATCTCTGGAAACAAAATGTCGACTGATTGCTCACCTTCCAAAAAGTCTACAAG
CATAGGCACTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTG
TAGGCATAATGTACACGTCTGACGCTCAATTTCTCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATATTAGAGTGGTAGTGGACATGATCGTAGGTGAAAATGCTCCA
TTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGAGAAATTTTGTGGCATGGCCTCGGGACCTTGTGGCTTCTCCAGCAAAACATAAGTCGGATGT
TTATTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTGAAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACT
ATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAATCATTTCTAATTTTATTAAAAGTCAAGAAACACTTTGTATAAATCTGGCTAACAGGTTAAAAATGATTTAT
TTGTACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCGAACGCCGTTTATGTCTTAAACTCGTTGAGGAG
TAAGATCGAAGAAAGTTTTCAAGGAACAATAAATATAAAACATAAGTACAACCAAGATGAGTTGGATGAGATTAGAGTTGAGCTATGTGAATTTGTATCTCAGTACTTAT
GA
Protein sequenceShow/hide protein sequence
MSGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEELAEDPSTRATLWIQG
RKGKNNEYFDEDTKQCAGQISGNKMSTDCSPSKKSTSIGTNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAP
LPIPIRGEVESLSQSMRNFVAWPRDLVASPAKHKSDVYLIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIY
LYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHKYNQDELDEIRVELCEFVSQYL