; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015926 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015926
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposase
Genome locationchr12:29400458..29405770
RNA-Seq ExpressionLag0015926
SyntenyLag0015926
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR021109 - Aspartic peptidase domain superfamily
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]6.7e-17858.66Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP NRATLWKEARK KN   FDD TRE   RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK
        ELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T                   +  S+ S+ KTKGKK
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK

Query:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI
        + +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V CPTIHG+PLGA N+RV VD+   EDV +PI
Subjt:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI

Query:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC
        P+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRYA+ +M  +D + I+LS+ IFG +KTI+L RDDI+QYC
Subjt:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC

Query:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES
        GM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYNTG CHW+LI+I+  EN VYV++ LRSKI   
Subjt:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES

Query:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA
        FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  AY QEEID VR+EWA
Subjt:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]2.9e-18963.04Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS DPSNRA LWKEARKGKN EYFDD TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF
        ELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   SN SK K+KGK+IV   EE++ ++E       
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF

Query:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL
                          G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E   IPIPV GEIETL+Q  G FVAWPR+LVIL
Subjt:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL

Query:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT
        + +K +SS         +   S+ TDVHV+IKLLNRY +LSM  +DTV INLS  IFG +K I+L R+DIMQYC M+EIGYSCIL YI YLW V + EIT
Subjt:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT

Query:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW
         KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR KI+E +Q +INTSL++WQAK+S+ +YR+N IW
Subjt:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW

Query:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG
        K IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA  +G
Subjt:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.9e-18963.04Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS DPSNRA LWKEARKGKN EYFDD TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF
        ELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   SN SK K+KGK+IV   EE++ ++E       
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF

Query:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL
                          G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E   IPIPV GEIETL+Q  G FVAWPR+LVIL
Subjt:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL

Query:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT
        + +K +SS         +   S+ TDVHV+IKLLNRY +LSM  +DTV INLS  IFG +K I+L R+DIMQYC M+EIGYSCIL YI YLW V + EIT
Subjt:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT

Query:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW
         KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR KI+E +Q +INTSL++WQAK+S+ +YR+N IW
Subjt:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW

Query:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG
        K IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA  +G
Subjt:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]2.0e-18261.07Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP NRATLWKEARK KN EY D  TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEE--PEEVFESEEVLEV
        ELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP++++N+ +    L  +SQ++ +T+ S  K  T+   +  +T+     +VE+    +  +    ++ 
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEE--PEEVFESEEVLEV

Query:  RNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKL
        R        VV  P+  +   G PC LA+ SVDNIVA+GTM+ES   CP+I+ +PLG +NVR +VD++ GEDV +PIP   +I+TL QA G+FVAWPRKL
Subjt:  RNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKL

Query:  VILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDN
        VI   +KK  S   PT + S+A SS+ TDVHVTIKLLNRYA+ SM  DD + INLS+ I G +KTI+L RDDI+QYCGM EIGYSCIL YI  LW  CD+
Subjt:  VILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDN

Query:  EITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRS
        EIT KF+IVD  TISS VK QE R +NL NRL+MV+L+QLV+IPYNTG CHW+LI+IN  EN VYV++SLRSKI E FQG+INTSL+ WQAK+SL QYR+
Subjt:  EITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRS

Query:  NIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA
         I WK IKCPRQ G++ECGYYVQKYIREIV NS+T ISNLFNT+ AY Q+EID VR+EWA
Subjt:  NIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]8.2e-18461.18Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP NRATLWKEARK KN EY D  TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEE--PEEVFESEEVLEV
        ELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP++++N+ +    L  +SQ++ +T+ S  K  T+   +  +T+     +VE+    +  +    ++ 
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEE--PEEVFESEEVLEV

Query:  RNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKL
        R        VV  P+  +   G PC LA+ SVDNIVA+GTM+ES   CP+I+ +PLG +NVR +VD++ GEDV +PIP   +I+TL QA G+FVAWPRKL
Subjt:  RNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKL

Query:  VILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDN
        VI   +KK  S   PT + S+A SS+ TDVHVTIKLLNRYA+ SM  DD + INLS+ I G +KTI+L RDDI+QYCGM EIGYSCIL YI  LW  CD+
Subjt:  VILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDN

Query:  EITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSN
        EIT KF+IVD  TISS VK QE R +NL NRL+MV+L+QLV+IPYNTGCHW+LI+IN  EN VYV++SLRSKI E FQG+INTSL+ WQAK+SL QYR+ 
Subjt:  EITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSN

Query:  IIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA
        I WK IKCPRQ G++ECGYYVQKYIREIV NS+T ISNLFNT+ AY Q+EID VR+EWA
Subjt:  IIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X13.2e-17858.66Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP NRATLWKEARK KN   FDD TRE   RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK
        ELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T                   +  S+ S+ KTKGKK
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK

Query:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI
        + +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V CPTIHG+PLGA N+RV VD+   EDV +PI
Subjt:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI

Query:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC
        P+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRYA+ +M  +D + I+LS+ IFG +KTI+L RDDI+QYC
Subjt:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC

Query:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES
        GM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYNTG CHW+LI+I+  EN VYV++ LRSKI   
Subjt:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES

Query:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA
        FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  AY QEEID VR+EWA
Subjt:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA

A0A5D3CYL9 ULP_PROTEASE domain-containing protein3.2e-17858.66Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKYSHID++QW  FV +RLSEEW+  S  Q+ERR K  YNHH+SRKGYANLA+ELELS DP NRATLWKEARK KN   FDD TRE   RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK
        ELAA  +GQDILTEALGTPEHRGR+RGVGEFVSP+++ N+ R +  L  QSQ K +T+ S  +  T                   +  S+ S+ KTKGKK
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTS-NLGPQSQSKGKTKGSGSKMST-------------------KVESNSSKTKTKGKK

Query:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI
        + +  +       V ESEE LEV+          +L +    S G PC LA+ S+DN+VA+G M+ES V CPTIHG+PLGA N+RV VD+   EDV +PI
Subjt:  IVEEPE------EVFESEEVLEVRNFWDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPI

Query:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC
        P+ G+IETL+QA G+FVAWPRKLVI+  +KK  S    T + S   SS+ TDVHVTIKLLNRYA+ +M  +D + I+LS+ IFG +KTI+L RDDI+QYC
Subjt:  PVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYC

Query:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES
        GM EIGYSCIL YI  LW VC++EIT +F++VD  TISS +K QE R RNL NRL+M NL+QLV+IPYNTG CHW+LI+I+  EN VYV++ LRSKI   
Subjt:  GMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTG-CHWMLIVINPGENTVYVLNSLRSKIEES

Query:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA
        FQG+IN SL+ WQ ++S   YRS I WK IKCPR LGS+ECGYYVQKY+RE+V N++T I+NLFNT  AY QEEID VR+EWA
Subjt:  FQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWA

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.4e-18963.04Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS DPSNRA LWKEARKGKN EYFDD TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF
        ELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   SN SK K+KGK+IV   EE++ ++E       
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF

Query:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL
                          G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E   IPIPV GEIETL+Q  G FVAWPR+LVIL
Subjt:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL

Query:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT
        + +K +SS         +   S+ TDVHV+IKLLNRY +LSM  +DTV INLS  IFG +K I+L R+DIMQYC M+EIGYSCIL YI YLW V + EIT
Subjt:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT

Query:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW
         KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR KI+E +Q +INTSL++WQAK+S+ +YR+N IW
Subjt:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW

Query:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG
        K IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA  +G
Subjt:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.0e-17662.29Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS DPSNRA LWKEARKGKN EYFDD TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF
        ELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   SN SK K+KGK+IV   EE++ ++E       
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF

Query:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL
                          G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E   IPIPV GEIETL+Q  G FVAWPR+LVIL
Subjt:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL

Query:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT
        + +K +SS         +   S+ TDVHV+IKLLNRY +LSM  +DTV INLS  IFG +K I+L R+DIMQYC M+EIGYSCIL YI YLW V + EIT
Subjt:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT

Query:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW
         KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR KI+E +Q +INTSL++WQAK+S+ +YR+N IW
Subjt:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW

Query:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSI
        K IKCP Q+GSVECGYYVQKYIREIV N+ST I
Subjt:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSI

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.4e-18963.04Show/hide
Query:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID
        P  LQ PPEKY HIDQ+QW  FVN+RLSEEW+ LS   KE R K  YNHH+SRKGYANLA+EL+LS DPSNRA LWKEARKGKN EYFDD TRE A RID
Subjt:  PELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRID

Query:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF
        ELAA ++G+DILTEALGT EH GRVRGVGEFVSPS+YFN+ +          K KT+      ST   SN SK K+KGK+IV   EE++ ++E       
Subjt:  ELAATNQGQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNF

Query:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL
                          G PC LAV SVDNIVA+GT+++++V CPT+HGVPLG +NVRV+VD++  E   IPIPV GEIETL+Q  G FVAWPR+LVIL
Subjt:  WDSQFYVVMLPKFSVHSVGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVIL

Query:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT
        + +K +SS         +   S+ TDVHV+IKLLNRY +LSM  +DTV INLS  IFG +K I+L R+DIMQYC M+EIGYSCIL YI YLW V + EIT
Subjt:  NNKKKVSSPAKPTRNVSVAHSSERTDVHVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEIT

Query:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW
         KFLIVDP TIS +VK QE R RNLANRL+MVNL QLV+IPY +GCHWMLI+IN  EN VYVL+SLR KI+E +Q +INTSL++WQAK+S+ +YR+N IW
Subjt:  SKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQLVIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIW

Query:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG
        K IKCP Q+GSVECGYYVQKYIREIV N+ST ISN+FNTK AY QEEIDEVRIEWA  +G
Subjt:  KLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEEIDEVRIEWAGKLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-0430.14Show/hide
Query:  KEGFIDGTIIKPTS-TKMAKDWKCNNDIIASWIMNSVSKEIAASIVYTDCVKDVWDEL-------ADSKVFSL
        K GFIDGT+ KP   + + + W+  N ++  W+MNS++ ++  S++Y +    +W++L        D K++ L
Subjt:  KEGFIDGTIIKPTS-TKMAKDWKCNNDIIASWIMNSVSKEIAASIVYTDCVKDVWDEL-------ADSKVFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCACCAGAACTGTTGCAGAGACCTCCTGAGAAATATTCACATATTGATCAACAACAGTGGATTGAGTTTGTTAATTCAAGATTATCTGAGGAGTGGAAGGCACT
TAGTGGTCTCCAAAAAGAAAGAAGGGAGAAACTTAAATACAATCATCATATGTCTCGTAAGGGATATGCTAACCTGGCCAAAGAACTAGAATTGTCAGATGATCCTAGCA
ACCGAGCCACTCTATGGAAGGAAGCAAGAAAAGGAAAAAATAAGGAATATTTTGATGACGACACTAGAGAACGCGCTAATCGAATTGACGAGCTAGCTGCGACAAATCAA
GGTCAAGATATACTTACTGAAGCATTAGGCACGCCAGAACATAGAGGGCGTGTTAGAGGAGTGGGTGAGTTTGTTTCACCATCTGTCTACTTCAATCTTCCTAGGACATC
AAACTTAGGTCCACAATCCCAAAGCAAGGGTAAAACGAAAGGCAGTGGTTCAAAAATGTCAACAAAGGTAGAAAGTAATTCTTCAAAGACAAAAACAAAAGGAAAGAAGA
TTGTTGAAGAACCAGAAGAGGTGTTCGAGTCAGAAGAAGTGTTAGAGGTGAGAAATTTTTGGGATAGTCAGTTTTACGTCGTTATGCTGCCGAAATTTTCGGTGCACTCG
GTTGGTACACCATGTCGCTTGGCTGTAAATTCAGTGGACAACATTGTTGCCATAGGCACAATGTATGAATCAAGTGTCGGATGTCCAACAATCCATGGAGTACCACTAGG
AGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCATACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTG
TGGCGTGGCCTCGCAAGCTTGTGATTCTAAATAACAAGAAAAAGGTATCTTCTCCCGCAAAACCTACAAGGAATGTGTCTGTTGCACATTCTTCCGAACGTACAGATGTC
CACGTTACTATCAAGTTGTTGAATCGATATGCCGTTCTGTCCATGGGAGAGGATGACACAGTTCCTATCAACTTGAGTGACGCCATATTTGGAGTTGATAAAACAATTTT
CCTACATCGTGATGACATCATGCAGTATTGCGGGATGGTTGAAATAGGGTACTCATGTATACTAGTGTACATTACGTATCTATGGACTGTATGTGACAATGAAATAACCA
GTAAATTTCTGATAGTTGATCCAGGAACCATCTCTTCATTTGTAAAGTGTCAAGAAACTCGTTGCAGAAATCTAGCCAACCGGCTAGACATGGTTAATTTGAATCAACTA
GTCATCATCCCGTACAATACTGGGTGTCATTGGATGTTGATTGTGATCAATCCTGGAGAGAATACCGTCTATGTGTTGAACTCATTACGTAGTAAGATTGAAGAAAGTTT
TCAAGGAATTATCAATACATCATTGAGAATGTGGCAAGCAAAGAACTCACTCCCACAATATCGCTCGAACATAATTTGGAAACTTATAAAGTGCCCCCGTCAATTGGGTT
CTGTAGAGTGTGGATATTATGTGCAAAAGTATATTCGAGAAATAGTACACAACTCATCTACGTCTATAAGTAATCTTTTTAACACGAAAACAGCATATACGCAAGAAGAA
ATTGACGAGGTTCGGATAGAATGGGCAGGGAAACTTGGTTTTGGGAAATTAAAGGCAATCACAATTGTTCTGCAATTGGCAGATCATTCGATGACATACCCTAAAGGTGT
GTTAGAGGATGTTCTGGTCAACGTTGACAAATTCATATTCCCTACAAACTTTGTGGTGTTAGACATGGAAGAAGACCCTGAAGTCCCAATCATTCTTGGGAGACCATTTC
TGGCAACAGCTCACGCTTTGATTGAGGGCAGTGATGATGGCTCTCTCCGGAAAAACAAGGAAGGTTTTATCGATGGTACCATCATCAAGCCTACTTCGACTAAAATGGCA
AAGGATTGGAAGTGTAACAACGACATCATAGCCTCGTGGATTATGAACTCTGTTTCGAAAGAAATCGCAGCTAGCATTGTCTATACTGATTGTGTTAAAGACGTATGGGA
TGAACTTGCCGATAGCAAGGTCTTTTCCCTAGTTATTCAAGAGGAACGTCAAAGGAATGTTGGAATTCCAATCTCACAACAAGATCTTGTTGCCCTAATTGCTGCTAGTG
GTTCGAAGAAGAATTCTCCTAATAACTCCAATACTAATTCTTCTTTTCGTAAAAAGGAACACACCAGAGATGATCAAAGGCCAATTTGTTCTCATTGTGGAATCAAAGGC
CATACCGTGGACAAATGCTACAAGATCCACAGATACCCACCTGGTTATCGATCACGAAATCAGAATACAAGCACCAATGTTAATGGCAATAATTCTCTAAGTATTGTTGG
CACCAAATCTGTTTCTCAGGTTCCACAGCCTTCTTCAATCGGAGGGAATTTCTTTGCAAGCCTCAATGCCAACCAGTGTTCTCAACTGATGGAGCTCTTAACTTCTCAAC
TCCAAGCTGCTAAAACTGATTCTATCACAGTGGCAATCTCTGCGGTTCATGCCACAGGTATATGCTCTACTCTATCTTCACCTATTTCATCTGATGTATGGATTGTTGAC
TCGAGTGCTTCACGACATATCTCTCATCGTTATCATTTATTTCACAACTGGCTTCGTGTATATGATGTCTCTATTGTTCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCACCAGAACTGTTGCAGAGACCTCCTGAGAAATATTCACATATTGATCAACAACAGTGGATTGAGTTTGTTAATTCAAGATTATCTGAGGAGTGGAAGGCACT
TAGTGGTCTCCAAAAAGAAAGAAGGGAGAAACTTAAATACAATCATCATATGTCTCGTAAGGGATATGCTAACCTGGCCAAAGAACTAGAATTGTCAGATGATCCTAGCA
ACCGAGCCACTCTATGGAAGGAAGCAAGAAAAGGAAAAAATAAGGAATATTTTGATGACGACACTAGAGAACGCGCTAATCGAATTGACGAGCTAGCTGCGACAAATCAA
GGTCAAGATATACTTACTGAAGCATTAGGCACGCCAGAACATAGAGGGCGTGTTAGAGGAGTGGGTGAGTTTGTTTCACCATCTGTCTACTTCAATCTTCCTAGGACATC
AAACTTAGGTCCACAATCCCAAAGCAAGGGTAAAACGAAAGGCAGTGGTTCAAAAATGTCAACAAAGGTAGAAAGTAATTCTTCAAAGACAAAAACAAAAGGAAAGAAGA
TTGTTGAAGAACCAGAAGAGGTGTTCGAGTCAGAAGAAGTGTTAGAGGTGAGAAATTTTTGGGATAGTCAGTTTTACGTCGTTATGCTGCCGAAATTTTCGGTGCACTCG
GTTGGTACACCATGTCGCTTGGCTGTAAATTCAGTGGACAACATTGTTGCCATAGGCACAATGTATGAATCAAGTGTCGGATGTCCAACAATCCATGGAGTACCACTAGG
AGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCATACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTG
TGGCGTGGCCTCGCAAGCTTGTGATTCTAAATAACAAGAAAAAGGTATCTTCTCCCGCAAAACCTACAAGGAATGTGTCTGTTGCACATTCTTCCGAACGTACAGATGTC
CACGTTACTATCAAGTTGTTGAATCGATATGCCGTTCTGTCCATGGGAGAGGATGACACAGTTCCTATCAACTTGAGTGACGCCATATTTGGAGTTGATAAAACAATTTT
CCTACATCGTGATGACATCATGCAGTATTGCGGGATGGTTGAAATAGGGTACTCATGTATACTAGTGTACATTACGTATCTATGGACTGTATGTGACAATGAAATAACCA
GTAAATTTCTGATAGTTGATCCAGGAACCATCTCTTCATTTGTAAAGTGTCAAGAAACTCGTTGCAGAAATCTAGCCAACCGGCTAGACATGGTTAATTTGAATCAACTA
GTCATCATCCCGTACAATACTGGGTGTCATTGGATGTTGATTGTGATCAATCCTGGAGAGAATACCGTCTATGTGTTGAACTCATTACGTAGTAAGATTGAAGAAAGTTT
TCAAGGAATTATCAATACATCATTGAGAATGTGGCAAGCAAAGAACTCACTCCCACAATATCGCTCGAACATAATTTGGAAACTTATAAAGTGCCCCCGTCAATTGGGTT
CTGTAGAGTGTGGATATTATGTGCAAAAGTATATTCGAGAAATAGTACACAACTCATCTACGTCTATAAGTAATCTTTTTAACACGAAAACAGCATATACGCAAGAAGAA
ATTGACGAGGTTCGGATAGAATGGGCAGGGAAACTTGGTTTTGGGAAATTAAAGGCAATCACAATTGTTCTGCAATTGGCAGATCATTCGATGACATACCCTAAAGGTGT
GTTAGAGGATGTTCTGGTCAACGTTGACAAATTCATATTCCCTACAAACTTTGTGGTGTTAGACATGGAAGAAGACCCTGAAGTCCCAATCATTCTTGGGAGACCATTTC
TGGCAACAGCTCACGCTTTGATTGAGGGCAGTGATGATGGCTCTCTCCGGAAAAACAAGGAAGGTTTTATCGATGGTACCATCATCAAGCCTACTTCGACTAAAATGGCA
AAGGATTGGAAGTGTAACAACGACATCATAGCCTCGTGGATTATGAACTCTGTTTCGAAAGAAATCGCAGCTAGCATTGTCTATACTGATTGTGTTAAAGACGTATGGGA
TGAACTTGCCGATAGCAAGGTCTTTTCCCTAGTTATTCAAGAGGAACGTCAAAGGAATGTTGGAATTCCAATCTCACAACAAGATCTTGTTGCCCTAATTGCTGCTAGTG
GTTCGAAGAAGAATTCTCCTAATAACTCCAATACTAATTCTTCTTTTCGTAAAAAGGAACACACCAGAGATGATCAAAGGCCAATTTGTTCTCATTGTGGAATCAAAGGC
CATACCGTGGACAAATGCTACAAGATCCACAGATACCCACCTGGTTATCGATCACGAAATCAGAATACAAGCACCAATGTTAATGGCAATAATTCTCTAAGTATTGTTGG
CACCAAATCTGTTTCTCAGGTTCCACAGCCTTCTTCAATCGGAGGGAATTTCTTTGCAAGCCTCAATGCCAACCAGTGTTCTCAACTGATGGAGCTCTTAACTTCTCAAC
TCCAAGCTGCTAAAACTGATTCTATCACAGTGGCAATCTCTGCGGTTCATGCCACAGGTATATGCTCTACTCTATCTTCACCTATTTCATCTGATGTATGGATTGTTGAC
TCGAGTGCTTCACGACATATCTCTCATCGTTATCATTTATTTCACAACTGGCTTCGTGTATATGATGTCTCTATTGTTCTGTGA
Protein sequenceShow/hide protein sequence
MNAPELLQRPPEKYSHIDQQQWIEFVNSRLSEEWKALSGLQKERREKLKYNHHMSRKGYANLAKELELSDDPSNRATLWKEARKGKNKEYFDDDTRERANRIDELAATNQ
GQDILTEALGTPEHRGRVRGVGEFVSPSVYFNLPRTSNLGPQSQSKGKTKGSGSKMSTKVESNSSKTKTKGKKIVEEPEEVFESEEVLEVRNFWDSQFYVVMLPKFSVHS
VGTPCRLAVNSVDNIVAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRKLVILNNKKKVSSPAKPTRNVSVAHSSERTDV
HVTIKLLNRYAVLSMGEDDTVPINLSDAIFGVDKTIFLHRDDIMQYCGMVEIGYSCILVYITYLWTVCDNEITSKFLIVDPGTISSFVKCQETRCRNLANRLDMVNLNQL
VIIPYNTGCHWMLIVINPGENTVYVLNSLRSKIEESFQGIINTSLRMWQAKNSLPQYRSNIIWKLIKCPRQLGSVECGYYVQKYIREIVHNSSTSISNLFNTKTAYTQEE
IDEVRIEWAGKLGFGKLKAITIVLQLADHSMTYPKGVLEDVLVNVDKFIFPTNFVVLDMEEDPEVPIILGRPFLATAHALIEGSDDGSLRKNKEGFIDGTIIKPTSTKMA
KDWKCNNDIIASWIMNSVSKEIAASIVYTDCVKDVWDELADSKVFSLVIQEERQRNVGIPISQQDLVALIAASGSKKNSPNNSNTNSSFRKKEHTRDDQRPICSHCGIKG
HTVDKCYKIHRYPPGYRSRNQNTSTNVNGNNSLSIVGTKSVSQVPQPSSIGGNFFASLNANQCSQLMELLTSQLQAAKTDSITVAISAVHATGICSTLSSPISSDVWIVD
SSASRHISHRYHLFHNWLRVYDVSIVL