; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021476 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021476
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposase
Genome locationchr7:8147878..8152698
RNA-Seq ExpressionLag0021476
SyntenyLag0021476
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]3.3e-23059.22Show/hide
Query:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR
        S+S DEG+V I  E +    RG T M EL  +RNSG+R  +EYN +GQ VG NA KMQSFIGVCVRQQIP+TY+ W ++PQELKD IFDCI+MSF+VD  
Subjt:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR

Query:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP
        SKH ILQSASKKFR+F+  LTQ YI+P+ + P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP
Subjt:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP

Query:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---
         NRATLWKEARK KN   FDD TRE   RIDELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T   
Subjt:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---

Query:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV
                        +  S+ S+ KTKGKK+ +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V
Subjt:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV

Query:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG
         CPTIHG+PLGA N+RV VD+   EDV +PIP+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRY + +M 
Subjt:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG

Query:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN
         +D + I+LS+ IFG +KTI+L RDDI+QYCGM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYN
Subjt:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN

Query:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA
        TG CHW+LI+I+  EN VYV++ LRSKI   FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  A
Subjt:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA

Query:  YTQEEIDEVRIEWAGFVGTFV
        Y QEEID VR+EWA FV  FV
Subjt:  YTQEEIDEVRIEWAGFVGTFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]9.8e-24363.02Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DE  V IH E +    RG TTM EL  +RN G+R  +EYN QGQ +G NA KMQSFIGVCVRQ+IP+TY+HW ++PQELKDKIF+C+E SF++D 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
        RSKH ILQSASKKFR F+  LT+ YI+PF + P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE
        PSNRA LWKEARKGKN EYFDD TRE A RIDELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE

Query:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE
        SN SK K+KGK+IV   EE++ ++E                         G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E
Subjt:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE

Query:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD
           IPIPV GEIETL+Q  G FVAWPR+LVIL+ +K +SS         +   S+ TDVHV+IKLLNRYV+LSM  +DTV INLS  IFG +K I+L R+
Subjt:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD

Query:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS
        DIMQYC M+EIGYSCIL YI YLW V + EIT KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR 
Subjt:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS

Query:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        KI+E +Q +INTSL++WQAK+S+ +YR+N IWK IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA FVG  V
Subjt:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.8e-24163.02Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DE  V IH E +    RG TTM EL  +RN G+R  +EYN QGQ +G NA KMQSFIGVCVRQ+IP+TY+HW ++PQELKDKIF+C+E SF++D 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
        RSKH ILQSASKKFR F+  LT+ YI+PF + P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE
        PSNRA LWKEARKGKN EYFDD TRE A RIDELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE

Query:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE
        SN SK K+KGK+IV   EE++ ++E                         G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E
Subjt:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE

Query:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD
           IPIPV GEIETL+Q  G FVAWPR+LVIL+ +K +SS         +   S+ TDVHV+IKLLNRYV+LSM  +DTV INLS  IFG +K I+L R+
Subjt:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD

Query:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS
        DIMQYC M+EIGYSCIL YI YLW V + EIT KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR 
Subjt:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS

Query:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        KI+E +Q +INTSL++WQAK+S+ +YR+N IWK IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA FVG  V
Subjt:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.7e-23461.23Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DEG+V I  E +    RG T M EL  +RNSG+R  +EYN  GQ VG NA KMQSFIGVCVRQQIPLTY  W  +PQELKD IFDCI+MSF+VD 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
         SKH ILQSASKKFR F+  LTQ+YI+P+ + P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMSTKV
        P NRATLWKEARK KN EY D  TRE A RIDELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP++++N+ +    L  +SQ++ +T+ S  K  T+ 
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMSTKV

Query:  ESNSSKTKTKGKKIVEE--PEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMI
          +  +T+     +VE+    +  +    ++ R        VV  P+  +   G PC LA+ SVDNIVA+GTM+ES   CP+I+ +PLG +NVR +VD++
Subjt:  ESNSSKTKTKGKKIVEE--PEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMI

Query:  TGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFL
         GEDV +PIP   +I+TL QA G+FVAWPRKLVI   +KK  S   PT + S+A SS+ TDVHVTIKLLNRY + SM  DD + INLS+ I G +KTI+L
Subjt:  TGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFL

Query:  HRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLN
         RDDI+QYCGM EIGYSCIL YI  LW  CD+EIT KF+IVD  TISS VK QE R +NL NRL+MV+L+QLV+IPYNTG CHW+LI+IN  EN VYV++
Subjt:  HRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLN

Query:  SLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        SLRSKI E FQG+INTSL+ WQAK+SL QYR+ I WK IKCPRQ G++ECGYYVQKYIREIV NS+T ISNLFNT+ AY Q+EID VR+EWA FV  FV
Subjt:  SLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]6.8e-23661.32Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DEG+V I  E +    RG T M EL  +RNSG+R  +EYN  GQ VG NA KMQSFIGVCVRQQIPLTY  W  +PQELKD IFDCI+MSF+VD 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
         SKH ILQSASKKFR F+  LTQ+YI+P+ + P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMSTKV
        P NRATLWKEARK KN EY D  TRE A RIDELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP++++N+ +    L  +SQ++ +T+ S  K  T+ 
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMSTKV

Query:  ESNSSKTKTKGKKIVEE--PEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMI
          +  +T+     +VE+    +  +    ++ R        VV  P+  +   G PC LA+ SVDNIVA+GTM+ES   CP+I+ +PLG +NVR +VD++
Subjt:  ESNSSKTKTKGKKIVEE--PEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMI

Query:  TGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFL
         GEDV +PIP   +I+TL QA G+FVAWPRKLVI   +KK  S   PT + S+A SS+ TDVHVTIKLLNRY + SM  DD + INLS+ I G +KTI+L
Subjt:  TGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFL

Query:  HRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNS
         RDDI+QYCGM EIGYSCIL YI  LW  CD+EIT KF+IVD  TISS VK QE R +NL NRL+MV+L+QLV+IPYNTGCHW+LI+IN  EN VYV++S
Subjt:  HRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNS

Query:  LRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        LRSKI E FQG+INTSL+ WQAK+SL QYR+ I WK IKCPRQ G++ECGYYVQKYIREIV NS+T ISNLFNT+ AY Q+EID VR+EWA FV  FV
Subjt:  LRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.6e-23059.22Show/hide
Query:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR
        S+S DEG+V I  E +    RG T M EL  +RNSG+R  +EYN +GQ VG NA KMQSFIGVCVRQQIP+TY+ W ++PQELKD IFDCI+MSF+VD  
Subjt:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR

Query:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP
        SKH ILQSASKKFR+F+  LTQ YI+P+ + P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP
Subjt:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP

Query:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---
         NRATLWKEARK KN   FDD TRE   RIDELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T   
Subjt:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---

Query:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV
                        +  S+ S+ KTKGKK+ +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V
Subjt:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV

Query:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG
         CPTIHG+PLGA N+RV VD+   EDV +PIP+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRY + +M 
Subjt:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG

Query:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN
         +D + I+LS+ IFG +KTI+L RDDI+QYCGM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYN
Subjt:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN

Query:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA
        TG CHW+LI+I+  EN VYV++ LRSKI   FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  A
Subjt:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA

Query:  YTQEEIDEVRIEWAGFVGTFV
        Y QEEID VR+EWA FV  FV
Subjt:  YTQEEIDEVRIEWAGFVGTFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.6e-23059.22Show/hide
Query:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR
        S+S DEG+V I  E +    RG T M EL  +RNSG+R  +EYN +GQ VG NA KMQSFIGVCVRQQIP+TY+ W ++PQELKD IFDCI+MSF+VD  
Subjt:  SNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPR

Query:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP
        SKH ILQSASKKFR+F+  LTQ YI+P+ + P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP
Subjt:  SKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDP

Query:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---
         NRATLWKEARK KN   FDD TRE   RIDELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T   
Subjt:  SNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTEGSGSKMST---

Query:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV
                        +  S+ S+ KTKGKK+ +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V
Subjt:  ----------------KVESNSSKTKTKGKKIVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSV

Query:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG
         CPTIHG+PLGA N+RV VD+   EDV +PIP+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRY + +M 
Subjt:  GCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMG

Query:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN
         +D + I+LS+ IFG +KTI+L RDDI+QYCGM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYN
Subjt:  EDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYN

Query:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA
        TG CHW+LI+I+  EN VYV++ LRSKI   FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  A
Subjt:  TG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTA

Query:  YTQEEIDEVRIEWAGFVGTFV
        Y QEEID VR+EWA FV  FV
Subjt:  YTQEEIDEVRIEWAGFVGTFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X14.7e-24363.02Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DE  V IH E +    RG TTM EL  +RN G+R  +EYN QGQ +G NA KMQSFIGVCVRQ+IP+TY+HW ++PQELKDKIF+C+E SF++D 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
        RSKH ILQSASKKFR F+  LT+ YI+PF + P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE
        PSNRA LWKEARKGKN EYFDD TRE A RIDELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE

Query:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE
        SN SK K+KGK+IV   EE++ ++E                         G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E
Subjt:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE

Query:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD
           IPIPV GEIETL+Q  G FVAWPR+LVIL+ +K +SS         +   S+ TDVHV+IKLLNRYV+LSM  +DTV INLS  IFG +K I+L R+
Subjt:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD

Query:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS
        DIMQYC M+EIGYSCIL YI YLW V + EIT KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR 
Subjt:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS

Query:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        KI+E +Q +INTSL++WQAK+S+ +YR+N IWK IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA FVG  V
Subjt:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.1e-22862.26Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DE  V IH E +    RG TTM EL  +RN G+R  +EYN QGQ +G NA KMQSFIGVCVRQ+IP+TY+HW ++PQELKDKIF+C+E SF++D 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
        RSKH ILQSASKKFR F+  LT+ YI+PF + P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE
        PSNRA LWKEARKGKN EYFDD TRE A RIDELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE

Query:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE
        SN SK K+KGK+IV   EE++ ++E                         G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E
Subjt:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE

Query:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD
           IPIPV GEIETL+Q  G FVAWPR+LVIL+ +K +SS         +   S+ TDVHV+IKLLNRYV+LSM  +DTV INLS  IFG +K I+L R+
Subjt:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD

Query:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS
        DIMQYC M+EIGYSCIL YI YLW V + EIT KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR 
Subjt:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS

Query:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSI
        KI+E +Q +INTSL++WQAK+S+ +YR+N IWK IKCP Q+GSVECGYYVQKYIREIV N+ST I
Subjt:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSI

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X28.9e-24263.02Show/hide
Query:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP
        +S+S DE  V IH E +    RG TTM EL  +RN G+R  +EYN QGQ +G NA KMQSFIGVCVRQ+IP+TY+HW ++PQELKDKIF+C+E SF++D 
Subjt:  TSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSGQRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDP

Query:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD
        RSKH ILQSASKKFR F+  LT+ YI+PF + P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS D
Subjt:  RSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDD

Query:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE
        PSNRA LWKEARKGKN EYFDD TRE A RIDELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   
Subjt:  PSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVE

Query:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE
        SN SK K+KGK+IV   EE++ ++E                         G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E
Subjt:  SNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGE

Query:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD
           IPIPV GEIETL+Q  G FVAWPR+LVIL+ +K +SS         +   S+ TDVHV+IKLLNRYV+LSM  +DTV INLS  IFG +K I+L R+
Subjt:  DVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGEDDTVPINLSDAIFGVDKTIFLHRD

Query:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS
        DIMQYC M+EIGYSCIL YI YLW V + EIT KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR 
Subjt:  DIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRS

Query:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV
        KI+E +Q +INTSL++WQAK+S+ +YR+N IWK IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA FVG  V
Subjt:  KIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGCCCTTAAAAGTAGTCACTTTATTGCCCGACGTTGTCTCCTTTTCTCTCTCCCCGACATTTCCCCTCTTCTTAGTTTCTCCGCCCAGGTCTCCTTATTTTCTCT
CTGCCCATCTCAGCTCGCCGTCGTTCTCGCCGTCGTTCTCGCCGTCGTTCTCGTCGTTCTCGCCGTCGTTCTCGCCGTCGTGCTCGCAAGGTCTTCTCGTCGTCGTTTGT
CGCAGCCCCGTCATTCGGCGTCGTTCGCCGCCAGCAAGGCTGTTCTCGTGGGTTTTCTCGCCGCCGGCATTTTCGTGGGGGTGTCGCTGCCGGCGTGGTCGTGTGTCTGC
TCGCCGGTCTCCGTAGCTCCAAAAAGTGGTGGACCATTTGCTAAATATAGTTTCAATTTTACAAAATCAAGGCTGTTCTCGTGGGTTTTCTCGCCGCCGGCATTTTCGTG
GGGTGTCGCTGCCGGCGTGGTCGTGTGTCTTGTCGCCGGTCTCCGTAGCTCCAAAGCTGTGGGTTTTTCTCCAAAGCACAGTGGGTTTTTGTTGCTTTCCGCTCCATTGA
CAAGTAACAGTGATGATGAAGGCCACGTTGCTATTCATATGGAGGCCAGACCGCCTGTTGGACGAGGTCTCACCACTATGCGTGAGTTGGCAGGTGTACGAAATTCTGGA
CAACGCTTGGTTGTTGAATACAACAGTCAAGGTCAGGCCGTTGGTACAAATGCAAACAAAATGCAAAGTTTCATCGGAGTTTGTGTCAGACAACAAATTCCACTGACTTA
TGACCATTGGAACAAAATTCCACAGGAGTTGAAAGACAAGATATTTGATTGTATAGAGATGTCATTCATCGTGGACCCCAGGTCCAAACATGCAATCCTTCAATCAGCAT
CAAAGAAGTTTCGAAATTTTCGGTACAATTTGACTCAGAAGTATATAATTCCATTCATGAATGCACCAGAACTGTTGCAGAGACCTCCTGAGAAATATTCACATATTGAT
CAACAACAGTGGATTGAGTTTGTTAATTCAAGATTATCTGAGGAGTGGAAGGCACTTAGTGGTCTCCAAAAAGAAAGAAGGGAGAAACTTAAATACAATCATCATATGTC
TCGTAAGGGATATGCTAACCTGGCCAAAGAACTAGAATTGTCAGATGATCCTAGCAACCGAGCCACTCTATGGAAGGAAGCAAGAAAAGGAAAAAATAAGGAATATTTTG
ATGACGACACTAGAGAACGCGCTAATCGAATTGACGAGCTAGCTGCGACAAATCAAGGTCAAGATATACTTACTGAAGCATTAGGCACGCCAGAACATAGAGGGCGTGTT
AGAGGAGTGGGTGAGTTTGTTTCACCATCTGTCTACTTCAATCTTCCTAGGACATCAAACTTAGGTCCACAATCCCAAAGCAAGGGTAAAACGGAAGGCAGTGGTTCAAA
AATGTCAACAAAGGTAGAAAGTAATTCTTCAAAGACAAAAACAAAAGGAAAGAAGATTGTTGAAGAACCAGAAGAGGTGTTCGAGTCAGAAGAAGTGTTAGAGGTGAGAA
ATTTTTGGGATAGTCAGTTTTACGTCGTTATGCTGCCGAAATTTTCGGTGCACTCGGTTGGTACACCATGTCGCTTGGCTGTAAATTCAGTGGATAACATTGTTGCCATA
GGCACAATGTATGAATCAAGTGTCGGATGTCCAACAATCCATGGAGTACCACTAGGAGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCAT
ACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTGTGGCGTGGCCTCGCAAGCTTGTGATTCTAAATAACAAGAAAAAGGTATCTTCTC
CCGCAAAACCTACAAGGAATGTGTCTGTTGCACATTCTTCCGAACGTACAGATGTCCACGTTACTATCAAGTTGTTGAATCGATATGTCGTTCTGTCCATGGGAGAGGAT
GACACAGTTCCTATCAACTTGAGTGACGCCATATTTGGAGTTGATAAAACAATTTTCCTACATCGTGATGACATCATGCAGTATTGCGGGATGGTTGAAATAGGGTACTC
ATGTATACTAGTGTACATTACGTATCTATGGACTGTATGTGACAATGAAATAACCAGCAAATTTCTGATAGTTGATCCAGGAACCATCTCTTCATTTGTAAAGTGTCAAG
AAACTCGTTGCAGAAATCTAGCCAACCGGCTAGACATGGTTAATTTGAATCAACTAGTCATCATCCCGTACAATACTGGGTGTCATTGGATGTTGATTGTGATCAATCCT
GGAGAGAATACCGTCTATGTGTTGAACTCATTACGTAGTAAGATTGAAGAAAGTTTTCAAGGAATTATCAATACATCATTGAGAATGTGGCAAGCAAAGAACTCACTCCC
ACAATATCGCTCGAACATAATTTGGAAACTTATAAAGTGCCCCCGTCAATTGGGTTCTGTAGAGTGTGGATATTATGTGCAAAAGTATATTCGAGAAATAGTACACAACT
CATCTACGTCTATAAGTAATCTTTTTAACACGAAAACAGCATATACGCAAGAAGAAATTGACGAGGTTCGGATAGAATGGGCAGGGTTCGTGGGAACATTTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCGCCCTTAAAAGTAGTCACTTTATTGCCCGACGTTGTCTCCTTTTCTCTCTCCCCGACATTTCCCCTCTTCTTAGTTTCTCCGCCCAGGTCTCCTTATTTTCTCT
CTGCCCATCTCAGCTCGCCGTCGTTCTCGCCGTCGTTCTCGCCGTCGTTCTCGTCGTTCTCGCCGTCGTTCTCGCCGTCGTGCTCGCAAGGTCTTCTCGTCGTCGTTTGT
CGCAGCCCCGTCATTCGGCGTCGTTCGCCGCCAGCAAGGCTGTTCTCGTGGGTTTTCTCGCCGCCGGCATTTTCGTGGGGGTGTCGCTGCCGGCGTGGTCGTGTGTCTGC
TCGCCGGTCTCCGTAGCTCCAAAAAGTGGTGGACCATTTGCTAAATATAGTTTCAATTTTACAAAATCAAGGCTGTTCTCGTGGGTTTTCTCGCCGCCGGCATTTTCGTG
GGGTGTCGCTGCCGGCGTGGTCGTGTGTCTTGTCGCCGGTCTCCGTAGCTCCAAAGCTGTGGGTTTTTCTCCAAAGCACAGTGGGTTTTTGTTGCTTTCCGCTCCATTGA
CAAGTAACAGTGATGATGAAGGCCACGTTGCTATTCATATGGAGGCCAGACCGCCTGTTGGACGAGGTCTCACCACTATGCGTGAGTTGGCAGGTGTACGAAATTCTGGA
CAACGCTTGGTTGTTGAATACAACAGTCAAGGTCAGGCCGTTGGTACAAATGCAAACAAAATGCAAAGTTTCATCGGAGTTTGTGTCAGACAACAAATTCCACTGACTTA
TGACCATTGGAACAAAATTCCACAGGAGTTGAAAGACAAGATATTTGATTGTATAGAGATGTCATTCATCGTGGACCCCAGGTCCAAACATGCAATCCTTCAATCAGCAT
CAAAGAAGTTTCGAAATTTTCGGTACAATTTGACTCAGAAGTATATAATTCCATTCATGAATGCACCAGAACTGTTGCAGAGACCTCCTGAGAAATATTCACATATTGAT
CAACAACAGTGGATTGAGTTTGTTAATTCAAGATTATCTGAGGAGTGGAAGGCACTTAGTGGTCTCCAAAAAGAAAGAAGGGAGAAACTTAAATACAATCATCATATGTC
TCGTAAGGGATATGCTAACCTGGCCAAAGAACTAGAATTGTCAGATGATCCTAGCAACCGAGCCACTCTATGGAAGGAAGCAAGAAAAGGAAAAAATAAGGAATATTTTG
ATGACGACACTAGAGAACGCGCTAATCGAATTGACGAGCTAGCTGCGACAAATCAAGGTCAAGATATACTTACTGAAGCATTAGGCACGCCAGAACATAGAGGGCGTGTT
AGAGGAGTGGGTGAGTTTGTTTCACCATCTGTCTACTTCAATCTTCCTAGGACATCAAACTTAGGTCCACAATCCCAAAGCAAGGGTAAAACGGAAGGCAGTGGTTCAAA
AATGTCAACAAAGGTAGAAAGTAATTCTTCAAAGACAAAAACAAAAGGAAAGAAGATTGTTGAAGAACCAGAAGAGGTGTTCGAGTCAGAAGAAGTGTTAGAGGTGAGAA
ATTTTTGGGATAGTCAGTTTTACGTCGTTATGCTGCCGAAATTTTCGGTGCACTCGGTTGGTACACCATGTCGCTTGGCTGTAAATTCAGTGGATAACATTGTTGCCATA
GGCACAATGTATGAATCAAGTGTCGGATGTCCAACAATCCATGGAGTACCACTAGGAGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCAT
ACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTGTGGCGTGGCCTCGCAAGCTTGTGATTCTAAATAACAAGAAAAAGGTATCTTCTC
CCGCAAAACCTACAAGGAATGTGTCTGTTGCACATTCTTCCGAACGTACAGATGTCCACGTTACTATCAAGTTGTTGAATCGATATGTCGTTCTGTCCATGGGAGAGGAT
GACACAGTTCCTATCAACTTGAGTGACGCCATATTTGGAGTTGATAAAACAATTTTCCTACATCGTGATGACATCATGCAGTATTGCGGGATGGTTGAAATAGGGTACTC
ATGTATACTAGTGTACATTACGTATCTATGGACTGTATGTGACAATGAAATAACCAGCAAATTTCTGATAGTTGATCCAGGAACCATCTCTTCATTTGTAAAGTGTCAAG
AAACTCGTTGCAGAAATCTAGCCAACCGGCTAGACATGGTTAATTTGAATCAACTAGTCATCATCCCGTACAATACTGGGTGTCATTGGATGTTGATTGTGATCAATCCT
GGAGAGAATACCGTCTATGTGTTGAACTCATTACGTAGTAAGATTGAAGAAAGTTTTCAAGGAATTATCAATACATCATTGAGAATGTGGCAAGCAAAGAACTCACTCCC
ACAATATCGCTCGAACATAATTTGGAAACTTATAAAGTGCCCCCGTCAATTGGGTTCTGTAGAGTGTGGATATTATGTGCAAAAGTATATTCGAGAAATAGTACACAACT
CATCTACGTCTATAAGTAATCTTTTTAACACGAAAACAGCATATACGCAAGAAGAAATTGACGAGGTTCGGATAGAATGGGCAGGGTTCGTGGGAACATTTGTGTAG
Protein sequenceShow/hide protein sequence
MPALKSSHFIARRCLLFSLPDISPLLSFSAQVSLFSLCPSQLAVVLAVVLAVVLVVLAVVLAVVLARSSRRRLSQPRHSASFAASKAVLVGFLAAGIFVGVSLPAWSCVC
SPVSVAPKSGGPFAKYSFNFTKSRLFSWVFSPPAFSWGVAAGVVVCLVAGLRSSKAVGFSPKHSGFLLLSAPLTSNSDDEGHVAIHMEARPPVGRGLTTMRELAGVRNSG
QRLVVEYNSQGQAVGTNANKMQSFIGVCVRQQIPLTYDHWNKIPQELKDKIFDCIEMSFIVDPRSKHAILQSASKKFRNFRYNLTQKYIIPFMNAPELLQRPPEKYSHID
QQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQGQDILTEALGTPEHRGRV
RGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTEGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAI
GTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYVVLSMGED
DTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINP
GENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGFVGTFV